VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-21 07:56:26 +01:00

Author	SHA1	Message	Date
Roman Khavronenko	25317b4e70	vmalert: follow-up after `d4ac4b7813` (#4659 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-18 15:53:37 +02:00
hagen1778	99f4f6a653	docs: mention change from `6f3fee197e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-18 11:48:06 +02:00
Aliaksandr Valialkin	9c3717412a	docs/VictoriaLogs: add CHANGELOG.md	2023-07-17 23:14:05 -07:00
Aliaksandr Valialkin	8815080030	app/vmselect/promql: add the ability to copy all the labels from `one` side of group_left()/group_right() operation This is performed by specifying `` inside group_left()/group_right(). Also allow specifying prefix for the copied labels via `group_left(...) prefix "..."` and `group_right(...) prefix "..."` syntax. For example, the following query adds all the namespace-related labels to pod info, and prefixes all the copied label names with "ns_" prefix: kube_pod_info on(namespace) group_left(*) prefix "ns_" kube_namespace_labels This resolves the following StackOverflow questions: - https://stackoverflow.com/questions/76661818/how-to-add-namespace-labels-to-pod-labels-in-prometheus - https://stackoverflow.com/questions/76653997/how-can-i-make-a-new-copy-of-kube-namespace-labels-metric-with-a-different-name	2023-07-17 19:07:39 -07:00
Aliaksandr Valialkin	be31bdc88c	app/vmselect/promql: recommend to use `(a op b) keep_metric_names` instead of `a op b keep_metric_names` The `a op b keep_metric_names` is ambigouos to `a op (b keep_metric_names)` when `b` is a transform or rollup function. For example, `a + rate(b) keep_metric_names`. So it is better to use more clear syntax: `(a op b) keep_metric_names` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3710	2023-07-16 23:46:34 -07:00
Zakhar Bessarab	e2367b6d1c	metricsql: add support of using keep_metric_names for binary operations (#4109 ) * metricsql: add support of using keep_metric_names for binary operations This should help to avoid confusion with queries like one in the issue #3710. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * wip --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-16 03:00:39 -07:00
Aliaksandr Valialkin	4cb024d8a3	all: add support for `or` filters in series selectors This commit adds ability to select series matching distinct filters via a single series selector. For example, the following selector selects series with either {env="prod",job="a"} or {env="dev",job="b"} labels: {env="prod",job="a" or env="dev",job="b"} The `or` filter is supported in all the VictoriaMetrics tools now. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3997 Uses https://github.com/VictoriaMetrics/metricsql/pull/14	2023-07-16 00:06:33 -07:00
Haleygo	b002e2a743	vmalert: fix evalTS after modify group interval (#4629 )	2023-07-14 14:45:24 +02:00
Aliaksandr Valialkin	71f3898f84	docs/CHANGELOG.md: refer to the commit `7094fa38bc`	2023-07-13 16:14:28 -07:00
Aliaksandr Valialkin	7094fa38bc	lib/storage: switch from global to per-day index for `MetricName -> TSID` mapping Previously all the newly ingested time series were registered in global `MetricName -> TSID` index. This index was used during data ingestion for locating the TSID (internal series id) for the given canonical metric name (the canonical metric name consists of metric name plus all its labels sorted by label names). The `MetricName -> TSID` index is stored on disk in order to make sure that the data isn't lost on VictoriaMetrics restart or unclean shutdown. The lookup in this index is relatively slow, since VictoriaMetrics needs to read the corresponding data block from disk, unpack it, put the unpacked block into `indexdb/dataBlocks` cache, and then search for the given `MetricName -> TSID` entry there. So VictoriaMetrics uses in-memory cache for speeding up the lookup for active time series. This cache is named `storage/tsid`. If this cache capacity is enough for all the currently ingested active time series, then VictoriaMetrics works fast, since it doesn't need to read the data from disk. VictoriaMetrics starts reading data from `MetricName -> TSID` on-disk index in the following cases: - If `storage/tsid` cache capacity isn't enough for active time series. Then just increase available memory for VictoriaMetrics or reduce the number of active time series ingested into VictoriaMetrics. - If new time series is ingested into VictoriaMetrics. In this case it cannot find the needed entry in the `storage/tsid` cache, so it needs to consult on-disk `MetricName -> TSID` index, since it doesn't know that the index has no the corresponding entry too. This is a typical event under high churn rate, when old time series are constantly substituted with new time series. Reading the data from `MetricName -> TSID` index is slow, so inserts, which lead to reading this index, are counted as slow inserts, and they can be monitored via `vm_slow_row_inserts_total` metric exposed by VictoriaMetrics. Prior to this commit the `MetricName -> TSID` index was global, e.g. it contained entries sorted by `MetricName` for all the time series ever ingested into VictoriaMetrics during the configured -retentionPeriod. This index can become very large under high churn rate and long retention. VictoriaMetrics caches data from this index in `indexdb/dataBlocks` in-memory cache for speeding up index lookups. The `indexdb/dataBlocks` cache may occupy significant share of available memory for storing recently accessed blocks at `MetricName -> TSID` index when searching for newly ingested time series. This commit switches from global `MetricName -> TSID` index to per-day index. This allows significantly reducing the amounts of data, which needs to be cached in `indexdb/dataBlocks`, since now VictoriaMetrics consults only the index for the current day when new time series is ingested into it. The downside of this change is increased indexdb size on disk for workloads without high churn rate, e.g. with static time series, which do no change over time, since now VictoriaMetrics needs to store identical `MetricName -> TSID` entries for static time series for every day. This change removes an optimization for reducing CPU and disk IO spikes at indexdb rotation, since it didn't work correctly - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 . At the same time the change fixes the issue, which could result in lost access to time series, which stop receving new samples during the first hour after indexdb rotation - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2698 The issue with the increased CPU and disk IO usage during indexdb rotation will be addressed in a separate commit according to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401#issuecomment-1553488685 This is a follow-up for `1f28b46ae9`	2023-07-13 16:07:30 -07:00
Aliaksandr Valialkin	8eeaf9b1f6	docs/CHANGELOG.md: clarify the description of the bugfix at `177a0c1ca9` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4555	2023-07-13 12:20:03 -07:00
Dmytro Kozlov	c76084b529	app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined (#4605 ) * app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined * app/vmctl: update CHANGELOG.md --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-07-13 17:14:43 +02:00
Dmytro Kozlov	177a0c1ca9	app/vmctl: fix issue with adding many seconds (#4617 ) * app/vmctl: fix issue with adding many seconds * app/vmagent: add CHANGELOG.md	2023-07-13 17:11:48 +02:00
Roman Khavronenko	cbc28ccdb2	vmalert: check for negative offset for missed rounds (#4628 ) It could happen for low evaluation intervals and irregular delays during execution that evaluation time would get a negative offset. This could result into cumulative discrepancy between the actual time and evaluation time for rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-13 17:11:22 +02:00
Aliaksandr Valialkin	30cdcc751d	all: update Go builder from 1.20.5 to 1.20.6 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.6+label%3ACherryPickApproved	2023-07-12 00:59:59 -07:00
Roman Khavronenko	fb03762d4d	vmselect: introduce `search.skipSlowReplicas` cmd-line flag (#4538 ) * vmselect: introduce `search.skipSlowReplicas` cmd-line flag vmselect has two logical conditions during request processing when `-replicationFactor` cmd-line flag is set: 1. If at least `len(storageNodes) - replicationFactor` responded, it could skip waiting for the rest of nodes to respond. This could lead to problems described here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207. 2. Mark response as partial if less than `len(storageNodes) - replicationFactor` responded without an error. The P1 showed itself error-prone and became the main reason why `-replicationFactor` wasn't recommended to use at vmselect level. However, this optimization could be still very useful in situations when there are slow and fast replicas in cluster. But P2 remains viable and important conditionless. Hiding P1 behind the feature-flag `search.skipSlowReplicas` should make `-replicationFactor` flag usable again. And let users choose whether they want P1 to be respected. Related issues https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711 Signed-off-by: hagen1778 <roman@victoriametrics.com> * docs: update changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-09 12:30:06 -07:00
Haleygo	20e7db47ee	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-07 11:48:05 +02:00
Haleygo	bca8ae034f	vmalert:fix query request using rfc3339 format (#4577 ) vmalert: consistently use time.RFC3339 format for time in queries Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 10:39:25 +02:00
Roman Khavronenko	7c8a215a7c	vmalert: allow disabling of `step` param attached to instant queries (#4574 ) vmalert: allow disabling of `step` param attached to instant queries This might be useful for using vmalert with datasources that to not support this param, unlike VictoriaMetrics. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4573 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 07:44:34 +02:00
Aliaksandr Valialkin	152ca00fb8	docs/CHANGELOG.md: clarify description for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4336 bugfix This is a follow-up for `5eb5df96e2`	2023-07-06 17:09:03 -07:00
Aliaksandr Valialkin	c851d78c93	docs/CHANGELOG.md: use the proper link to the issue related to the commit `7a92263459` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4402	2023-07-06 16:59:49 -07:00
Aliaksandr Valialkin	4991d9b299	docs/CHANGELOG.md: remove redundant info from the url to consulagent_sd_configs docs This is a follow-up for `40d12be607`	2023-07-06 16:53:05 -07:00
Aliaksandr Valialkin	c473dcaac8	docs/CHANGELOG.md: clarify the description of the bugfix at `ce7141383d`	2023-07-06 16:24:03 -07:00
Aliaksandr Valialkin	10a0533417	docs/CHANGELOG.md: remove the change regarding http2 support at vmagent This is a follow-up for `8a07621a0c` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4283	2023-07-06 16:06:30 -07:00
Aliaksandr Valialkin	7f3b5431a1	app/vmselect/graphite: follow-up after `c7884f8686` - Consistently use -search.maxGraphiteTagValues for limiting tag values from auto-complete API - Use -search.maxGraphiteSeries for limiting paths (aka series), which can be returned from Graphite series API - Clarify the change in docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4339 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2841	2023-07-06 15:21:56 -07:00
Alexander Marshalov	af53c7cc78	fix removing storage data dir before restoring from backup (#598 ) * fix removing storage data dir before restoring from backup Signed-off-by: Alexander Marshalov <_@marshalov.org> * fix review comment Signed-off-by: Alexander Marshalov <_@marshalov.org> * fix review comment Signed-off-by: Alexander Marshalov <_@marshalov.org> * fixes after merge with `enterprise-single-node` branch Signed-off-by: Alexander Marshalov <_@marshalov.org> --------- Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-07-06 14:16:18 -07:00
Aliaksandr Valialkin	3d28357bd3	app/vmselect/netstorage: follow-up after `11ac551d52` - Clarify the scope of the fix at docs/CHANGELOG.md - Handle the case when -search.maxSamplesPerSeries limit is exceeded in the same way as the -search.maxSamplesPerQuery limit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4472	2023-07-05 21:25:06 -07:00
Roman Khavronenko	2f710ec77d	vmctl: interrupt explore procedure in influx mode if no numeric fields were found (#4576 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-04 13:45:42 +02:00
Roman Khavronenko	8fe5b37978	docs: follow-up after `9da638aa66` (#4572 ) `9da638aa66` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-04 11:27:38 +04:00
Dmytro Kozlov	9bde95bfff	app/vmalert: show on UI groups error after reload config (#4543 ) show on UI groups error after reload config https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4076 Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-03 14:59:52 +02:00
Haleygo	5fc0ee43d4	fix parse for invalid partial RFC3339 format (#4539 ) The validation was needed for covering corner cases when storage is tested with data from 1970. This resulted into unexpected search results, as year was parsed incorrectly from the given timestamp. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-03 13:11:49 +02:00
Nikolay	c30492312f	docs: adds v1.91.3 release docs (#4561 )	2023-07-03 10:31:02 +02:00
Yury Molodov	3cdba1b1c6	vmui: fix app routing issues (#4408 ) The change focuses on rectifying inconsistencies in the navigation behavior of the application and eliminating issues encountered when manually altering the URL. The key updates include: - Refactoring of the routing mechanism to handle all possible routes and their states. - Enhancement of the React Router usage to ensure a smoother navigation experience. - Handling application state when the URL is manually changed.	2023-06-30 10:13:10 +02:00
Alexander Marshalov	1cc06e39cd	show backup progress percentage in vmbackup log during backup uploading and restoring progress percentage in vmrestore log during backup downloading (#4460 ) (#4530 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-06-28 14:44:45 +02:00
Roman Khavronenko	72edc31ffb	vmauth: expose latency metrics per user (#4525 ) expose `vmauth_user_request_duration_seconds` and `vmauth_unauthorized_user_request_duration_seconds` summary metrics for measuring requests latency per user. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-27 20:15:17 +02:00
Haleygo	a97887a2d9	vmalert: add `vmalert_remotewrite_sent_duration_seconds_total` metric (#4517 ) add `vmalert_remotewrite_sent_duration_seconds_total` metric	2023-06-26 07:34:51 +02:00
Roman Khavronenko	5f9ad22884	vmalert: update retry policy for pushing data to `-remoteWrite.url` (#4504 ) By default, vmalert will make multiple retry attempts with exponential delay. The total time spent during retry attempts shouldn't exceed `-remoteWrite.retryMaxTime` (default is 30s). When retry time is exceeded vmalert drops the data dedicated for `-remoteWrite.url`. Before, vmalert dropped data after 5 retry attempts with 1s delay between attempts (not configurable). See `-remoteWrite.retryMinInterval` and `-remoteWrite.retryMaxTime` cmd-line flags. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-06-22 15:14:23 +02:00
Roman Khavronenko	4aad7a43df	vmalert: properly interrupt remotewrite retries on shutdown (#4505 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-22 15:07:32 +02:00
Zakhar Bessarab	57a4ad3fa8	docs/changelog: followup for `830dac177f` (#4499 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-06-22 11:24:37 +02:00
Roman Khavronenko	79a5499cb2	vmalert: retry all errors except 4XX status codes (#4461 ) vmalert: retry all errors except 4XX status codes Retry all errors except 4XX status codes while pushing via remote-write to the remote storage. Previously, errors like broken connection could prevent vmalert from retrying the request. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-20 13:24:45 +02:00
Yury Molodov	66b42a6772	vmui: memory leak fix (#4455 ) * fix: optimize the preparation of data for the graph * fix: optimize tooltip rendering * fix: optimize re-rendering of the chart * vmui: memory leak fix	2023-06-20 11:29:24 +02:00
Aliaksandr Valialkin	b49d04b3dc	lib/promutils.ParseTime(): add support for timestamps in milliseconds See https://stackoverflow.com/questions/76437098/how-to-handle-time-unit-and-step-while-ingesting-or-querying-in-victoriametrics/76438405 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4459	2023-06-19 22:25:04 -07:00
Nikolay	5eb5df96e2	lib/storage: creates parts.json on start-up if it not exists. (#4450 ) * lib/storage: creates parts.json on start-up if it not exists. It fixes migrations from versions below v1.90.0. Previously parts.json was created only after successful merge. But if merge was interruped for some reason (OOM or shutdown), parts.json wasn't created and partitions left after interruped merge weren't properly deleted. Since VM cannot check if it must be removed or not. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4336 * Apply suggestions from code review Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> * Update lib/storage/partition.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-06-15 11:19:22 +02:00
Dmytro Kozlov	7a92263459	vmctl: increase retry backoff policy delay (#4447 ) vmctl: update backoff policy on retries to reduce probability of overloading for `source` or `destination` databases	2023-06-14 09:47:44 +02:00
Dmytro Kozlov	ddb3ae0f00	vmctl: finish retries if context canceled (#4442 ) vmctl: interrupt backoff retries if import context is cancelled Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-06-13 13:54:24 +02:00
Alexander Marshalov	40d12be607	fixed service name detection for consulagent service discovery in case of a difference in service name and service id (#4390 ) (#4439 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-06-12 16:16:43 +02:00
Roman Khavronenko	ccaa9571ef	Dashboard upd (#4438 ) dashboards: update dashboard for single-node version * add anonymous mem usage panel; * add syscall rate panel; * add location to logs panel; * update legend for panels to reflect instance name; * update queries to aggregate per instance. dashboards: update dashboard for cluster version * add syscall rate panel; * add drilldown to logs panel. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-12 15:58:47 +02:00
Roman Khavronenko	476c7bdd6f	all: update Go builder from Go1.20.4 to Go1.20.5 (#4427 ) See https://github.com/golang/go/issues?q=milestone%3AGo1.20.5+label%3ACherryPickApproved Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-09 09:42:55 +02:00
Roman Khavronenko	d4c314d628	docs/changelog: mention `a6a7795b9e` change (#4425 ) docs/changelog: mention `a6a7795b9e` change `a6a7795b9e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-09 09:12:41 +02:00
Zakhar Bessarab	9a490d0b5c	doc: changelog followup for #4420 fix (#4421 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-06-07 16:16:23 +02:00
Zakhar Bessarab	ce7141383d	app/vmagent/remotewrite: fix vmagent panic on shutdown (#4407 ) app/vmagent/remotewrite: fix vmagent panic on shutdown Currently, when vmagent is stopping it first flushes pending series in remote write context and proceeds to stop streaming aggregation. This leads to streaming aggregation being unable to write results into pending timeseries (since it is already nil) and panic. This can lead to losing some aggregation results being lost almost silently. The fix is reordering flow to first stop streaming aggregation and flush all pending time series after that. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-06-07 15:45:43 +02:00
Roman Khavronenko	3305a6901c	app/vmagent: mention `enable_http2` in changelog (#4403 ) Follow-up after `72c3cd47eb` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-05 16:31:58 +02:00
Roman Khavronenko	cc739e3f8d	docs/CHANGELOG.md: cut v1.91.2 (#4393 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-03 11:17:23 +02:00
Dmytro Kozlov	fc5292d8ed	app/vmctl: add verbose output for docker installations or when TTY isn't available (#4333 ) * app/vmctl: add verbose output for docker installations or when TTY isn't available * app/vmctl: fix tests * app/vmctl: make vmctl interactive if no tty * app/vmctl: cleanup * app/vmctl: add comment --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-06-02 14:57:08 +02:00
Dmytro Kozlov	c7884f8686	app/{graphite,netstorage,prometheus}: fix graphite search tags api limits, remove redudant limit from SeriesHandler handler (#4352 ) * app/{graphite,netstorage,prometheus}: fix graphite search tags api limits, remove unused limit from SeriesHandler handler, * app/{graphite,netstorage,prometheus}: use search.maxTagValues for Graphite * app/{graphite,netstorage,prometheus}: update CHANGELOG.md * app/{graphite,netstorage,prometheus}: use own flags for Graphite API * app/{graphite,netstorage,prometheus}: cleanup * app/{graphite,netstorage,prometheus}: cleanup * app/{graphite,netstorage,prometheus}: update docs --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-06-02 14:34:04 +02:00
Roman Khavronenko	de94812088	vmalert: fix nil map assignment (#4392 ) * vmalert: fix nil map assignment The storage instance with nil map params was created for remote-read purposes. And before change `7a9ae9de0d` this map was ignored in ApplyParams. Now, it started to be used and vmalert panics in runtime. The fix properly inits map for at `NewVMStorage` and verifies it is not nil on assignment in `ApplyParams`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add to changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly clone Storage params Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-02 11:38:55 +02:00
Roman Khavronenko	b771152039	docs/CHANGELOG.md: cut v1.91.1 (#4386 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-02 07:50:24 +02:00
Nikolay	2c876227e4	docs/changlelog: mention `6c84b61` (#4384 )	2023-06-01 13:45:12 +02:00
Roman Khavronenko	4b5faf7efb	docs: mention fix for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 (#4382 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-01 11:30:42 +02:00
Dmytro Kozlov	9843ec0e1d	app/vmui: fix behavior when changing url in global settings (#4332 ) * app/vmui: fix behavior when changing url in global settings * app/vmctl: minor fix * app/vmui: fix behavior when changing url in global settings	2023-06-01 12:19:03 +03:00
Nikolay	a0bf8f233f	docs: mention recent changes at changelog (#4379 )	2023-06-01 10:57:32 +02:00
Nikolay	f263031fe9	app/vmauth: properly handle LOCAL proxy protocol command (#4373 ) app/vmauth: properly handle LOCAL proxy protocol command It is required for handling health checks from load balancers https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-05-31 15:37:59 +02:00
Roman Khavronenko	51cea6cad4	vmalert: properly form assets address if httpPrefix set (#4351 ) Properly form path to static assets in WEB UI if `http.pathPrefix` set. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4349 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-29 07:38:13 +02:00
Aliaksandr Valialkin	388ffec262	docs/CHANGELOG.md: document v1.79.13 LTS release	2023-05-18 19:56:28 -07:00
Aliaksandr Valialkin	9e21315315	docs/CHANGELOG.md: document v1.87.6 LTS release	2023-05-18 17:50:12 -07:00
Aliaksandr Valialkin	52b5498165	docs/CHANGELOG.md: cut v1.91.0	2023-05-18 12:37:12 -07:00
Aliaksandr Valialkin	1f28b46ae9	lib/storage: revert the migration from global to per-day index for (MetricName -> TSID) This reverts the following commits: - `e0e16a2d36` - `2ce02a7fe6` The reason for revert: the updated logic breaks assumptions made when fixing https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2698 . For example, if a time series stop receiving new samples during the first day after the indexdb rotation, there are chances that the time series won't be registered in the new indexdb. This is OK until the next indexdb rotation, since the time series is registered in the previous indexdb, so it can be found during queries. But the time series will become invisible for search after the next indexdb rotation, while its data is still there. There is also incompletely solved issue with the increased CPU and disk IO resource usage just after the indexdb rotation. There was an attempt to fix it, but it didn't fix it in full, while introducing the issue mentioned above. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 TODO: to find out the solution, which simultaneously solves the following issues: - increased memory usage for setups high churn rate and long retention (e.g. what the reverted commit does) - increased CPU and disk IO usage during indexdb rotation ( https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 ) - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2698 Possible solution - to create the new indexdb in one hour before the indexdb rotation and to gradually pre-populate it with the needed index data during the last hour before indexdb rotation. Then the new indexdb will contain all the needed data just after the rotation, so it won't trigger increased CPU and disk IO.	2023-05-18 11:30:49 -07:00
Aliaksandr Valialkin	4f7f750850	lib/handshake: do not pollute logs with `cannot read hello` messages on TCP health checks Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1762	2023-05-18 10:41:34 -07:00
Dmytro Kozlov	7cbda6796c	app/vmctl: set default value for --vm-native-step-interval flag (#4327 ) * app/vmctl: set default value for `--vm-native-step-interval` flag * app/vmctl: update CHANGELOG.md * app/vmctl: update CHANGELOG.md, fix docs * app/vmctl: fix typo * app/vmctl: fix typo	2023-05-18 13:43:35 +02:00
Denys Holius	c605d64a95	deployment/docker/Makefile: updated docker compose commands regarding migration from V1 to V2 (#4314 ) deployment/docker/Makefile: updated docker compose commands regarding migration from V1 to V2	2023-05-17 13:14:24 +02:00
Aliaksandr Valialkin	63b1cab454	app/vmauth: simplify the code after `4a1d29126c` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4242	2023-05-17 00:37:05 -07:00
Nikolay	4a1d29126c	app/vmauth: retry common network dial errors (#4280 ) with tracking request body read calls it allows us to retry POST and PUT requests	2023-05-17 00:19:33 -07:00
Nikolay	16df18ec14	app/vmauth: do not return invalid credentials (#4288 ) at http response by default https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4188 based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4190 Thanks @raj-kumar-j for init implementation	2023-05-17 00:09:47 -07:00
Aliaksandr Valialkin	e0e16a2d36	lib/storage: follow-up after `2ce02a7fe6` - Document the change at docs/CHANGELOG.md - Clarify comments for non-trivial code touched by the commit - Improve the logic behind maybeCreateIndexes(): - Correctly create per-day indexes if the indexdb rotation is performed during the first hour or the last hour of the day by UTC. Previously there was a possibility of missing index entries on that day. - Increase the duration for creating new indexes in the current indexdb for up to 22 hours after indexdb rotation. This should reduce the increased resource usage after indexdb rotation. It is safe to postpone index creation for the current day until the last hour of the current day after indexdb rotation by UTC, since the corresponding (date, ...) entries exist in the previous indexdb. - Search for TSID by (date, MetricName) in both the current and the previous indexdb. Previously the search was performed only in the current indexdb. This could lead to excess creation of per-day indexes for the current day just after indexdb rotation. - Search for (date, metricID) entries in both the current and the previous indexdb. Previously the search was performed only in the current indexdb. This could lead to excess creation of per-day indexes for the current day just after indexdb rotation.	2023-05-16 23:19:27 -07:00
Aliaksandr Valialkin	278278af95	lib/storage: reduce the unimportant logging during Storage start / stop This should improve the visibility of potentially important logs	2023-05-16 15:14:21 -07:00
Aliaksandr Valialkin	664db964ca	vendor: update github.com/VictoriaMetrics/metrics from v1.23.1 to v1.24.0 This change adds process_* metrics to VictoriaMetrics components under Windows OS See https://github.com/VictoriaMetrics/metrics/pull/47	2023-05-16 11:37:07 -07:00
Alexander Marshalov	7b15834cbe	added backup locking/unlocking against retention policy to vmbackupmanager (#558 ) * added backup locking/unlocking against retention policy to vmbackupmanager Signed-off-by: Alexander Marshalov <_@marshalov.org> * added docs for new commands Signed-off-by: Alexander Marshalov <_@marshalov.org> * fix review comments Signed-off-by: Alexander Marshalov <_@marshalov.org> --------- Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-05-16 11:23:36 -07:00
Roman Khavronenko	f68d93cca2	vmalert: follow-up after `669becd011` (#4318 ) * vmalert: follow-up after `669becd011` Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: follow-up after `669becd011` Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: follow-up after `669becd011` Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-16 18:51:38 +02:00
Aliaksandr Valialkin	2613110f75	deployment/docker: update base docker image from 3.17.3 to 3.18.0 See https://www.alpinelinux.org/posts/Alpine-3.18.0-released.html	2023-05-12 17:31:21 -07:00
Aliaksandr Valialkin	616175b1ce	lib/promutils: properly return error when incorrect Prometheus label names are passed to NewLabelsFromString() Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4284 See also https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4304	2023-05-12 16:52:29 -07:00
Aliaksandr Valialkin	318a87c36f	Revert "lib/promrelabel: show error message if labels not in prometheus exposition format (#4304 )" This reverts commit `193a9c3328`. Reason for revert: the commit doesn't fix the real issue with promutils.NewLabelsFromString() function, which must return error when improperly formatted Prometheus metric with labels is passed to it. See https://github.com/prometheus/docs/blob/main/content/docs/instrumenting/exposition_formats.md#text-format-example E.g. the promutils.NewLabelsFromString() must return error when the following strings are passed to it: - `{foo:"bar"}`, since `:` is disallowed in Prometheus text exposition format. The corect value is `{foo="bar"}` - `{"foo":"bar"}`, since label name shouldn't be quoted. The correct value is `{foo="bar"}`. The reverted commit introduces another set of bugs, which happily accept the following invalid input: - `{foo=~"bar"}` - `{foo!="bar"}` - `{foo!~"bar"}` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4284 See also https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4304	2023-05-12 16:07:37 -07:00
Aliaksandr Valialkin	160453b86c	lib/protoparser/csvimport: properly parse the last empty column in CSV line Do not ignore the last empty column in CSV line. While at it, properly parse CSV columns in single quotes, e.g. `'foo,bar',baz` is parsed as two columns - `foo,bar` and `baz` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4048 See also https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4298	2023-05-12 15:51:41 -07:00
Aliaksandr Valialkin	b7fe7b801c	Revert "lib/protoparser: fix skip csv line when metric can be collect from the line (#4298 )" This reverts commit `410ae99c2e`. Reason for revert: the commit masks the real issue instead of fixing it. The real issue is that the scanner.NextColumn() skips the last column if it is empty. The commit also introduces two bugs: - a panic if all the metric values in CSV line are empty - silent import of CSV lines with too small number of columns Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4048 See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4298	2023-05-12 15:22:27 -07:00
Yury Molodov	f0fad01e8a	vmui: add notification for non-matching queries (#4301 ) vmui: add notification for non-matching queries (#4211) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4211	2023-05-12 13:55:47 +02:00
Roman Khavronenko	ad2d079ba5	docs: update docs about VMUI pages (#4305 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-12 10:51:44 +02:00
Dmytro Kozlov	193a9c3328	lib/promrelabel: show error message if labels not in prometheus exposition format (#4304 ) lib/promrelabel: show error message if labels not in prometheus exposition format https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4284	2023-05-12 10:42:56 +02:00
Dmytro Kozlov	410ae99c2e	lib/protoparser: fix skip csv line when metric can be collect from the line (#4298 ) * lib/protoparser: fix skip csv line when metric can be collect from the line * lib/protoparser: fix comment	2023-05-12 11:04:16 +03:00
Yury Molodov	1e4a9a8dfe	vmui: enhancements to top queries page (#4299 ) * feat: improvement of the top queries page * vmui/docs: enhancements to top queries page * Apply suggestions from code review --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-05-11 13:47:32 -07:00
Aliaksandr Valialkin	5d22c36904	docs/CHANGELOG.md: improve the description of the change at `7ea2531db0` Move the change description to the group of vmui changes. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4213	2023-05-11 13:30:33 -07:00
Aliaksandr Valialkin	73812c71a5	lib/promutils: properly parse time strings with timezones at ParseTime()	2023-05-11 13:24:00 -07:00
Yury Molodov	a55d3f6882	vmui: increase font-size and fix the text display (#4273 ) vmui: change default font size to 14px for better readability vmui: fix bug with missing text on buttons in safari --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-05-11 14:50:09 +02:00
Dmytro Kozlov	7ea2531db0	app/vmui: added table where Labels with the highest number of unique values show (#4271 ) * app/vmui: added Labels with the highest number of unique values * app/vmui: cleanup * app/vmui: cleanup * app/vmui: add table description * app/vmui: fix comment, updated CHANGELOG.md * app/vmui: disable links * app/vmui: added actions to the table, it will show values for selected label with the highest number of series * app/vmui: fix comment	2023-05-11 15:19:36 +03:00
Aliaksandr Valialkin	86424e079e	docs/CHANGELOG.md: fix typo after `2caf0b05c6`	2023-05-10 13:03:20 -07:00
Artem Navoiev	0ccb7f51dd	fix typo in changelog Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-05-10 19:23:24 +02:00
Roman Khavronenko	51196739af	Vmalert UI updates (#4276 ) * vmalert: expand rule groups on anchor click before, anchor click was only updating the URL. To expand the group, user had to click on rule's block. Now, group will toggle automatically. * vmalert: allow filtering group in web UI The new filter allows to filter groups and rules within groups by: errors only or noMatch only. The filtering supposed to help navigating big numbers of groups/rules. Filtering is reflected in URL, so can be shared as a link. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-10 14:38:13 +02:00
Aliaksandr Valialkin	4e0345a5ef	docs/CHANGELOG.md: add a link to docs about never-firing alerts Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4039	2023-05-09 21:58:36 -07:00
Aliaksandr Valialkin	78b23c9a83	docs/CHANGELOG.md: document `8f4de6fa47`	2023-05-09 21:39:25 -07:00
Roman Khavronenko	491831df49	vminsert: properly reset labels object on aggregation (#4278 ) Without reset, labels duplicates could have been added during stream aggregation. Since `ctx.Labels` is reused during processing of many series, each series will add its labels to the context. Even if the same labels were already addeded on prev iteration. Now, we reset `ctx.Labels` on each iteration to contain so labels from different series didn't interfere. This could have cause exceeding of the limit on number of labels per pushed time series. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4277 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-09 08:33:58 -07:00
Aliaksandr Valialkin	83a1c2484c	docs/CHANGELOG.md: group changelog lines for tip release according to VictoriaMetrics apps	2023-05-08 22:57:23 -07:00
Aliaksandr Valialkin	11eb94d3bc	docs/CHANGELOG.md: document `baf456978d` See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4223	2023-05-08 22:26:04 -07:00
Aliaksandr Valialkin	fc42617ecd	docs/CHANGELOG.md: refer to the author and the pull request of the notifier_headers feature at vmalert Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3260	2023-05-08 17:18:48 -07:00
Aliaksandr Valialkin	5a02bc56fb	docs/CHANGELOG.md: document `03150c8973` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4204	2023-05-08 16:30:28 -07:00
Aliaksandr Valialkin	3c0470f91e	docs/vmalert.md: clarify docs regarding the support of recursive globs Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4041	2023-05-08 16:21:44 -07:00
Aliaksandr Valialkin	74155afb71	docs: clarify docs after `5ee344824f` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4183	2023-05-08 16:11:44 -07:00
Aliaksandr Valialkin	185894fc5a	app/vmagent/remotewrite: make more user-friendly the warning message about too small -remoteWrite.maxdiskUsagePerURL value This is a follow-up for `bc17f4828c` . While at it, document the change at docs/CHANGELOG.md . Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4195	2023-05-08 15:42:30 -07:00
Aliaksandr Valialkin	d906e83e5e	app/vmauth: merge `default_url` example into multi-url example in order to reduce the amounts of text to read for the user Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4084 This is a follow-up for `041e188df8`	2023-05-08 15:12:23 -07:00
Aliaksandr Valialkin	1db9b78b88	app/vmselect: small cleanup after `68e31a6000` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3811	2023-05-08 14:34:37 -07:00
Aliaksandr Valialkin	80946f06c2	app/{vmselect,vmctl}: move ParseTime() to lib/promutils Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4091 This is a follow-up for `e2053baf32`	2023-05-08 14:17:57 -07:00
Aliaksandr Valialkin	6601fa3e9d	docs/CHANGELOG.md: typo fix after `45a551df9c`: 'this doc' -> 'this feature request'	2023-05-08 13:41:58 -07:00
Aliaksandr Valialkin	8f43f496d7	docs: document IP filters functionality in vmauth Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3491 This is a follow-up for `2f08ed3be2`	2023-05-08 12:12:16 -07:00
Aliaksandr Valialkin	1b288e0a05	all: update Go builder from Go1.20.3 to Go1.20.4 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.4+label%3ACherryPickApproved	2023-05-08 09:40:55 -07:00
Aliaksandr Valialkin	24eeb8d9af	docs/CHANGELOG.md: document `c77385e78f`	2023-05-08 08:50:30 -07:00
Alexander Marshalov	8225a48b56	fixed `vm_promscrape_config_last_reload_successful` metric value recovery after successful reloading with unchanged content (#4260 ) (#4268 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-05-08 13:32:51 +02:00
Roman Khavronenko	fa3a17938e	vmalert: follow-up after `cae87da` (#4269 ) * vmalert: follow-up after `cae87da` `cae87da4bb` Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: update struct comments Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: rm typo Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 13:31:54 +02:00
Roman Khavronenko	5b8450fc1b	app/vmalert: detect alerting rules which don't match any series at all (#4198 ) app/vmalert: detect alerting rules which don't match any series at all vmalert starts to understand /query responses which contain object: ``` "stats":{"seriesFetched": "42"} ``` If object is present, vmalert parses it and populates a new field `SeriesFetched`. This field is then used to populate the new metric `vmalert_alerting_rules_last_evaluation_series_fetched` and to display warnings in the vmalert's UI. If response doesn't contain the new object (Prometheus or VictoriaMetrics earlier than v1.90), then `SeriesFetched=nil`. In this case, UI will contain no additional warnings. And `vmalert_alerting_rules_last_evaluation_series_fetched` will be set to `-1`. Negative value of the metric will help to compile correct alerting rule in follow-up. Thanks for the initial implementation to @Haleygo See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4056 See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4039 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 09:36:39 +02:00
Roman Khavronenko	01520d3e5d	alerts: update TooHighMemoryUsage threshold (#4256 ) It appears that 90% usage for anonymous mem usage is already concerning. So we lowering the threshold to 80%. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-07 22:18:56 +02:00
Zakhar Bessarab	4e71003620	lib/promscrape/discovery/kubernetes: follow-up for `d5e94721db` (#4255 ) - add changelog reference to an author - fix tests - add metadata to match Prometheus behavior Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-05-05 14:41:17 +02:00
Zakhar Bessarab	aca256735c	lib/storage: fix indexdb rotation infinite loop (#4249 ) When using `retentionTimezoneOffset` and having local timezone being more than 4 hours different from UTC indexdb retention calculation could return negative value. This caused indexdb rotation to get in loop. Fix calculation of offset to use `retentionTimezoneOffset` value properly and add test to cover all legit timezone configs. See: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4207 - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4206 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-05-04 17:16:48 +02:00
Alexander Marshalov	56b84140a9	added new consulagent service discovery (#3953 ) (#4217 )	2023-05-04 11:36:21 +02:00
Alexander Marshalov	2eb27ddb22	max value for `memory.allowedPercent` changed from 200 to 100 (#4171 ) (#4251 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-05-04 11:34:57 +02:00
Zakhar Bessarab	ddcc3b1e9d	docs: changelog follow-up for `49b77ec01a` (#4250 ) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-05-03 16:42:22 +04:00
Nikolay	4786f036de	lib/backup: fixes path generation for windows (#4133 ) replaces custom fsync function with standard Fsync methods for files. fixes pattern matching for parts and properly generate backup path for local fs. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-05-03 10:48:53 +02:00
Nikolay	73b6c23271	lib/fs: do not panic at windows at dir deletion (#4132 ) Windows doesn't allow to remove dir with opened files. Usually it's a case for snapshots, hard cannot be removed if file is openned. With this change, dir will be renamed and properly deleted at the next process start. It's recommended to restart vmstorage/vmsingle for snapshots deletion completion periodically. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-05-03 10:47:02 +02:00
Zakhar Bessarab	bf3b6732bd	lib/promscrape/discovery/kubernetes: add common labels to all ports discovered from endpoints (#4235 ) * lib/promscrape/discovery/kubernetes: add common labels to all ports discovered from endpoints Sets `__meta_kubernetes_endpoints_name` and `__meta_kubernetes_namespace` labels to all ports of pod. Prometheus sets those labels to all ports in pod (`0ab9553611/discovery/kubernetes/endpoints.go (L267C15-L269)`) even if port is not matching any service. See: #4154 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promscrape/discovery/kubernetes: fix test for updated discovery logic Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-05-03 02:17:33 +02:00
Artem Navoiev	4ddfc67d54	remove information of releasing graphite render api from tip section as we released it in 1.90 Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-05-02 15:30:22 +02:00
Artem Navoiev	ee1da35071	prepare static docs to migration Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-04-30 12:33:36 +02:00
Roman Khavronenko	3383f12a4b	vmalert: fix API to return non-nil values (#4222 ) Properly return empty slices instead of nil for `/api/v1/rules` and `/api/v1/alerts` API handlers. This improves compatibility with Grafana. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4221 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-04-28 10:08:29 +02:00
Roman Khavronenko	eb746a4dab	Revert "http server: limit max concurrent requests (#4185 )" (#4215 ) This reverts commit `77f76371` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-04-27 13:02:47 +02:00
Roman Khavronenko	29e059e49c	app/vmalert: follow-up after `6c322b4a00` (#4214 ) `6c322b4a00` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-04-27 13:02:21 +02:00
Zakhar Bessarab	b21a55febf	app/vmalert: add support of recursive path globs for rules and templates (#4148 ) Supports using `` for `-rule` and `-rule.templates`: `dir//*.tpl` loads contents of dir and all subdirectories recursively. See: #4041 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Artem Navoiev <tenmozes@gmail.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-04-26 19:20:22 +02:00
Nikolay	5ee344824f	lib/promscrape: adds filter for consul_sd_configs: (#4184 ) * lib/promscrape: adds filter for consul_sd_configs: it allows advanced filtering for consul service discovery requests https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4183 * typo fix * removes deprecation mentions since it's not relevant * Update docs/CHANGELOG.md Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-04-26 19:16:27 +02:00
Zakhar Bessarab	89a1c941c2	app/vmalert: return an error when using `query` function in `-external.alert.source` flag (#4191 ) Templating of `-external.alert.source` is not expected to have access to the query which was causing runtime error when query function was passed as nil. See: #4181 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-04-26 15:31:14 +02:00
Alexander Marshalov	041e188df8	added `default_url` field in vmauth users config (#4084 ) (#4156 ) * added default url field in vmauth users config (#4084) --------- Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-04-26 11:04:35 +02:00
Yury Molodov	4f3f9950d0	vmui: add metric relabel debug (#3889 ) * feat: add metric relabel debug (#3807) * fix: add link to relabeling cookbook * lib/promrelabel: merge, fix conflicts * lib/promrelabel: fix diff * docs/vmui: add metric relabel playground --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-04-26 11:53:29 +03:00
Alexander Marshalov	45a551df9c	changelog for issue #4083 (#4197 ) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-04-26 10:50:44 +02:00
Yury Molodov	cf567badcf	vmui: display heatmap in the `Explore Metrics` (#4124 ) * feat: display heatmap in the explore metrics (#4111) * fix: correct calc step for heatmap * fix: remove spaces in the result of getDurationFromMilliseconds	2023-04-25 13:14:57 +03:00
Yury Molodov	a80b0aebe8	vmui: add a comparison of data to the `Cardinality Explorer` (#4123 ) * feat: add button "show today" to date picker * feat: add comparison with the prev day (#3967) * vmui/docs: add comparison of data to cardinality page	2023-04-25 12:21:57 +03:00
Yury Molodov	752895d1ee	docs/vmui: fix CHANGELOG.md about WITH templates (#4194 )	2023-04-25 12:05:14 +03:00
Yury Molodov	68e31a6000	vmui: Integrate WITH template playground (#3831 ) * feat: add WithTemplate page * app/vmselect/prometheus: enable json mode for expand with expr API * app/vmselect/prometheus: enable CORS and add content type * feat: add api for expand with templates * fix: remove console from useExpandWithExprs * app/vmselect/prometheus: fix escaping * vmui: integrate WITH template * app/vmctl: check content type instead of form param * fix: add content-type for fetch with-exprs * fix: add a header to the server's response that allows the "Content-Type" header * app/vmctl: added comment and cleanup * app/vmctl: use format query param --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-04-25 11:40:01 +03:00
Dmytro Kozlov	e2053baf32	app/vmctl: add support for the different time format in the native binary protocol (#4189 ) * app/vmctl: add support for the different time format in the native binary protocol * app/vmctl: update flag description, update CHANGELOG.md * app/vmctl: add comment to exported function	2023-04-24 18:33:30 +02:00
Roman Khavronenko	77f76371d0	http server: limit max concurrent requests (#4185 ) * lib/httpserver: introduce `-http.maxConcurrentRequests` command-line flag Introduce `-http.maxConcurrentRequests` command-line flag to protect VM components from resource exhaustion during unexpected spikes of HTTP requests. By default, the new flag's value is set to 0 which means no limits are applied. Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/httpserver: mention http.maxConcurrentRequests in docs Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-04-24 14:52:06 +02:00
Yury Molodov	05ab34f2c8	vmui: fix freeze when query regular with heatmap query (#4093 ) * fix: fix freeze when query regular with heatmap query * vmui/docs: fix freeze when query regular with heatmap query	2023-04-21 11:59:09 +03:00
Yury Molodov	3140aa34de	vmui: fix bug where tenant list was not displayed (#4162 ) * fix: modify the condition for querying tenants * fix: change getTenantIdFromUrl output to string	2023-04-21 11:56:08 +03:00
Artem Navoiev	87b925afb2	move note about opensource of graphite in v1.90 release note Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-04-19 16:34:00 +02:00
Roman Khavronenko	de61a73c63	vmalert: retry datasource requests with EOF or unexpected EOF errors (#4146 ) * vmalert: retry datasource requests with EOF or unexpected EOF errors Retry failed read request on the closed connection one more time. This may improve rules execution reliability when connection between vmalert and datasource closes unexpectedly. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: fix old tests Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-04-19 10:18:32 +02:00
Zakhar Bessarab	472fe3fd03	lib/httpserver: add handler to serve `/robots.txt` and deny search indexing (#4143 ) This handler will instruct search engines that indexing is not allowed for the content exposed to the internet. This should help to address issues like #4128 when instances are exposed to the internet without authentication.	2023-04-18 16:47:26 +04:00
Artem Navoiev	413701454c	add graphite render api opensource to changelog Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-04-18 11:09:02 +02:00
Aliaksandr Valialkin	4b43c91f8c	vendor: update github.com/VictoriaMetrics/metricsql from v0.56.1 to v0.56.2 This fixes panic when the duration in the query contains `M` suffix. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4120 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3589	2023-04-14 14:05:56 -07:00
Aliaksandr Valialkin	5e87e03409	docs/CHANGELOG.md: move the bugfix description into the correct place This is a follow-up for `2a5b9ff782` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4092	2023-04-13 23:49:53 -07:00
Aliaksandr Valialkin	9f8209d593	docs/CHANGELOG.md: run at least 4 background mergers on systems with less than 4 CPU cores This reduces the probability of sudden spike in the number of small parts when all the background mergers are busy with big merges.	2023-04-13 23:43:17 -07:00
Dmytro Kozlov	2a5b9ff782	app/vmctl: fix performance degradation, add flag to disable backoff policy (#4097 ) * app/vmctl: change api for getting metric names * app/vmctl: fix tests * app/vmctl: add flag to enable backoff policy, fix test, performance improvements * app/vmctl: use one http client * app/vmctl: made linter happy * app/vmctl: updated documentation and CHANGELOG.md * app/vmctl: cleanup * app/vmctl: rename flag * app/vmctl: cleanup * app/vmctl: fix comments * app/vmctl: fix metrics parser problem, improve tests	2023-04-14 09:34:54 +03:00
Aliaksandr Valialkin	e1211a1187	app/vmstorage: deprecate -bigMergeConcurrency command-line flag Improperly configured -bigMergeConcurrency command-line flag usually leads to uncontrolled growth of unmerged parts, which, in turn, increases CPU usage and query durations. So it is better deprecating this flag. In rare cases -smallMergeConcurrency command-line flag can be used instead for controlling the concurrency of background merges.	2023-04-13 20:40:24 -07:00
Aliaksandr Valialkin	90b876cd1e	app/vmbackupmanager: sync with enterprise-single-node branch after 41a54c775891c87e3d5ed59ff0769c869dd2fe71	2023-04-13 19:29:06 -07:00
Zakhar Bessarab	81f28f0f1f	lib/backup/actions: store metadata(creation and completion time) in backup files (#4117 ) This makes it easier to understand exact point in time which is included in this backup. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-04-12 18:51:27 +02:00
Aliaksandr Valialkin	05d44525b7	docs/CHANGELOG.md: formatting fix	2023-04-06 19:04:13 -07:00
Aliaksandr Valialkin	28975067c6	docs/CHANGELOG.md: cut v1.90.0 release	2023-04-06 16:16:42 -07:00
Dmytro Kozlov	244c18fa38	app/vmctl: add multiple filters defined in `--vm-native-filter-match` flag to discovered metric names (#4063 ) * app/vmctl: add multiple filters defined in `--vm-native-filter-match` flag to discovered metric names * app/vmctl: fix comments * app/vmctl: move function buildMatchWithFilter to the correct place * app/vmctl: update CHANGELOG.md * app/vmctl: fix CI, remove error wrapping * app/vmctl: fix CI, simplify `Set()`	2023-04-06 15:06:52 -07:00
Aliaksandr Valialkin	ee80e71d17	docs/CHANGELOG.md: document the bugfix, which remove unneeded logger.Errorf() call during stream aggregation with the enabled deduplication This is a follow-up for `ff72ca14b9`	2023-04-06 15:00:42 -07:00
Aliaksandr Valialkin	44aad84a53	docs/CHANGELOG.md: document that VictoriaMetrics for Windows cannot delete snapshots See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70#issuecomment-1491529183	2023-04-06 03:16:06 -07:00
Aliaksandr Valialkin	608d87273d	docs/CHANGELOG.md: document v1.79.12	2023-04-06 03:10:01 -07:00
Aliaksandr Valialkin	7a65329e65	docs/CHANGELOG.md: document v1.87.5	2023-04-06 00:44:07 -07:00
Aliaksandr Valialkin	5074cc672a	all: update Go builder from Go1.20.2 to Go1.20.3 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.3+label%3ACherryPickApproved	2023-04-05 13:37:22 -07:00
Artem Navoiev	59102db4cf	update changelog Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-04-05 15:45:38 +02:00
Aliaksandr Valialkin	55b5276b70	docs/CHANGELOG.md: document `edb45d7fc1` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4013	2023-04-02 21:26:12 -07:00
Dmytro Kozlov	cc0427897c	lib/promscrape: fix the problem with scrape work duplicates when file_sd_config can't be read (#4027 ) * lib/promscrape: fix the problem with scrape work duplicates when file_sd_config can't be read * lib/promscrape: clarified comment * lib/promscrape: made better approach to handle a problem with growing []ScrapeWork on each error when loading config lib/promscrape: added CHANGELOG.md * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-04-02 20:26:13 -07:00
Yury Molodov	42087518ba	vmui: tips for working with the graph and legend (#4045 ) * feat: add tips for working with the graph and legend * feat: add the ability to collapse the legend * vmui/docs: add the ability to collapse the legend --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-04-01 00:38:18 -07:00
Roman Khavronenko	27b958ba8b	lib/storage: check for free disk space before opening tables (#4035 ) * lib/storage: check for free disk space before opening tables We check for free disk space before call to `openTable`, so `Storage` can be set to ReadOnly before mergeWorkers start. Before the change, there was a chance that merges will start even if Storage has to start in ReadOnly mode because of `-storage.minFreeDiskSpaceBytes` limit. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4023 Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/storage: chore Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update lib/storage/storage.go --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-31 23:50:27 -07:00
Aliaksandr Valialkin	ffdf430be0	app/vmselect/graphite: open source Graphite Render API	2023-03-31 23:25:04 -07:00
Aliaksandr Valialkin	cddfc4d3f8	deployment/docker: update base Docker image from Alpine 3.17.2 to Alpine 3.17.3 This fixes security issues from https://alpinelinux.org/posts/Alpine-3.17.3-released.html This is a follow-up for `59c350d0d2`	2023-03-31 22:46:27 -07:00
Aliaksandr Valialkin	d577657fb7	lib/streamaggr: follow-up for `ff72ca14b9` - Make sure that the last successfully loaded config is used on hot-reload failure - Properly cleanup resources occupied by already initialized aggregators when the current aggregator fails to be initialized - Expose distinct vmagent_streamaggr_config_reload* metrics per each -remoteWrite.streamAggr.config This should simplify monitoring and debugging failed reloads - Remove race condition at app/vminsert/common.MustStopStreamAggr when calling sa.MustStop() while sa could be in use at realoadSaConfig() - Remove lib/streamaggr.aggregator.hasState global variable, since it may negatively impact scalability on system with big number of CPU cores at hasState.Store(true) call inside aggregator.Push(). - Remove fine-grained aggregator reload - reload all the aggregators on config change instead. This simplifies the code a bit. The fine-grained aggregator reload may be returned back if there will be demand from real users for it. - Check -relabelConfig and -streamAggr.config files when single-node VictoriaMetrics runs with -dryRun flag - Return back accidentally removed changelog for v1.87.4 at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3639	2023-03-31 22:30:38 -07:00
Roman Khavronenko	4a49577028	vmalert: use `missingkey=zero` for templating (#4040 ) Replace empty labels with "" instead of "<no value>" during templating, as Prometheus does. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4012 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-30 16:57:00 +04:00
Zakhar Bessarab	ec45f1bc5f	lib/fs: verify response code when reading configuration over HTTP (#4036 ) Verifying status code helps to avoid misleading errors caused by attempt to parse unsuccessful response. Related issue: #4034 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-30 13:18:00 +02:00
Alexander Marshalov	ff72ca14b9	added hot reload support for stream aggregation configs (#3969 ) (#3970 ) added hot reload support for stream aggregation configs (#3969) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-03-29 18:05:58 +02:00
Aliaksandr Valialkin	94cabf29b0	lib/flagutil: ArrayString: support commas inside quoted strings and inside `[]`, `{}` and `()` braces Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3915	2023-03-28 21:22:55 -07:00
Aliaksandr Valialkin	aea6df8197	app/vmagent/remotewrite: cosmetic updates after `f3a51e8b1d` - Compare directory names instead of paths to directory when determining which persistent queues must be deleted This is less error-prone solution, since paths to the same directory can differ, which could lead to accidental directory removal for the existing -remoteWrite.url - Log the `removed %d dangling queues` message when at least a single queue has been removed - Consistently use filepath.Join() for creating paths to persistent queues. This is needed for Windows support (see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70 ) - Clarify the description of the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4014	2023-03-27 18:33:07 -07:00
Zakhar Bessarab	f3a51e8b1d	app/vmagent: add `-remoteWrite.removeDanglingQueues` flag (#4017 ) * app/vmagent: add `-remoteWrite.removeDanglingQueues` flag which allows to automatically remove dangling persistent queue contents Related issue: #4014 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmagent: address review feedback - remove persistent queues files by default - rename `remoteWrite.removeDanglingQueues` to `remoteWrite.keepDanglingQueues` - update docs to reflect changed behaviour Related issue: #4014 * Apply suggestions from code review --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-27 18:15:28 -07:00
Aliaksandr Valialkin	02ee4ffd4d	app/vmselect/promql: follow-up for `79e1c6a6fc` - Document the fix at docs/CHANGELOG.md - Add tests with multiple adjancent zero buckets - Simplify the fix a bit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/296 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4021	2023-03-27 18:03:36 -07:00
Yury Molodov	3214b1c315	vmui: heatmap (#3780 ) * fix: add stroke and font for all axes * feat: add util for generate gradient * feat: add heatmap plugin * feat: add heatmap legend * feat: add heatmap graph (#3384) * vmui: add heatmap graph (#3384) * feat: add convert Prometheus to VictoriaMetrics histogram * fix: prevent re-render graph * feat: reset step for heatmap * feat: normalize heatmap data * fix: format heatmap legend * wip * app/vmselect/vmui: run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-26 00:30:02 -07:00
Aliaksandr Valialkin	72a0b49330	docs/CHANGELOG.md: document v1.87.4 LTS release	2023-03-25 22:43:59 -07:00
Aliaksandr Valialkin	811f4a9380	app/{vmbackup,vmrestore}: publish vmbackup and vmrestore binaries for Windows Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-25 15:08:21 -07:00
Aliaksandr Valialkin	e7f46a0aab	app/vmselect/promql: follow-up for `7205c79c5a` - Allocate and initialize seriesByWorkerID slice in a single go instead of initializing every item in the list separately. This should reduce CPU usage a bit. - Properly set anti-false sharing padding at timeseriesWithPadding structure - Document the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-24 23:34:37 -07:00
Zakhar Bessarab	5ba347bd2c	app/vmbackup: delete created snapshot in case of error during backup (#4008 ) Related issue: #2055 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-24 21:49:58 -07:00
Aliaksandr Valialkin	27f9a1eda2	docs/CHANGELOG.md: cosmetic fixes: remove trailing whitespace and consistently use `-flag` instead of `--flag`	2023-03-24 15:44:33 -07:00
Alexander Marshalov	7c86dcc4fa	allowed using dashes and dots in environment variables names (#4009 ) * allowed using dashes and dots in environment variables names for templating config files with envtemplate (#3999) Signed-off-by: Alexander Marshalov <_@marshalov.org> * Apply suggestions from code review --------- Signed-off-by: Alexander Marshalov <_@marshalov.org> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-24 15:43:05 -07:00
Aliaksandr Valialkin	c1d871a45a	docs/vmauth.md: follow-up for `36edba9bfb` - Document `-configCheckInterval` command-line flag in `quick start` section - Clarify the addition of `-configCheckInterval` at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3990	2023-03-24 13:22:37 -07:00
Dmytro Kozlov	ba505dd357	docs: follow up after `dc2c712a29` (#4001 )	2023-03-23 18:27:55 +01:00
Yury Molodov	023c65968f	vmui: display errors for each query individually (#3987 ) (#3994 )	2023-03-23 13:10:59 +01:00
Alexander Marshalov	36edba9bfb	added configCheckInterval flag for vmauth (#3990 ) (#3991 ) * added configCheckInterval flag for vmauth (#3990) Signed-off-by: Alexander Marshalov <_@marshalov.org>	2023-03-23 09:34:12 +01:00
Nikolay	a2f716b6cc	lib/netutil: log only parsing errors for proxy-protocol (#3985 ) * lib/netutil: log only parsing errors for proxy-protocol Previosly every error was logged. With configured TCP health checks at load-balancer or kubernetes, vmauth spams a lot of false positive error message into logs * Update docs/CHANGELOG.md Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> * Update lib/netutil/tcplistener.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-03-21 10:22:39 -07:00
Dmytro Kozlov	e79cd24807	lib/promrelabel: make target url from labels on target relabel page (#3882 ) * lib/promrelabel: make target url from labels on target relabel page * wip --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-20 22:07:52 -07:00
Aliaksandr Valialkin	8d709f3483	docs/CHANGELOG.md: cosmetic fixes	2023-03-20 14:14:20 -07:00
Dmytro Kozlov	8da9502df6	app/vmctl: automatically check tty (#3938 ) app/vmctl: automatically detect if TTY is available	2023-03-20 11:16:08 +01:00
Yury Molodov	d4525bd2d0	vmui: support for drag'n'drop in the "Trace analyzer" page (#3971 ) vmui: add drag-and-drop support for the trace analyzer page	2023-03-20 11:07:18 +01:00
Yury Molodov	a2af2e5a1b	vmui: improve usability of date/time picker (#3968 ) * vmui: allow manually set input date and time * vmui/docs: improve usability of date/time picker	2023-03-20 09:22:49 +01:00
Aliaksandr Valialkin	43b24164ef	all: add Windows build for VictoriaMetrics This commit changes background merge algorithm, so it becomes compatible with Windows file semantics. The previous algorithm for background merge: 1. Merge source parts into a destination part inside tmp directory. 2. Create a file in txn directory with instructions on how to atomically swap source parts with the destination part. 3. Perform instructions from the file. 4. Delete the file with instructions. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since the remaining files with instructions is replayed on the next restart, after that the remaining contents of the tmp directory is deleted. Unfortunately this algorithm doesn't work under Windows because it disallows removing and moving files, which are in use. So the new algorithm for background merge has been implemented: 1. Merge source parts into a destination part inside the partition directory itself. E.g. now the partition directory may contain both complete and incomplete parts. 2. Atomically update the parts.json file with the new list of parts after the merge, e.g. remove the source parts from the list and add the destination part to the list before storing it to parts.json file. 3. Remove the source parts from disk when they are no longer used. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since incomplete partitions from step 1 or old source parts from step 3 are removed on the next startup by inspecting parts.json file. This algorithm should work under Windows, since it doesn't remove or move files in use. This algorithm has also the following benefits: - It should work better for NFS. - It fits object storage semantics. The new algorithm changes data storage format, so it is impossible to downgrade to the previous versions of VictoriaMetrics after upgrading to this algorithm. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3236 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3821 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-19 01:36:51 -07:00
Aliaksandr Valialkin	6460475e3b	lib/{mergeset,storage}: prevent from long wait time when creating a snapshot under high data ingestion rate Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3873	2023-03-19 00:15:30 -07:00
Nikolay	91cbb9063d	Vmagent kafka updates (#535 ) * app/vmagent: allow vm proto for kafka consumer and producer it should reduce network usage up to 50%. According to benchmarks without any encoding at kafka topic, it reduces traffic up to 50%. With enabled zstd at kafka topic, it shows no diffence in traffic. So it doesn't make much sense to use it. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1225 * mention eb61a7dd68b834b08d01727a918f207700348ada at changelog * app/vmagent: bumps kafka lib version it allows compiling vmagent for arm64 machines fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2271 * mention d19b1a888248c96cfd7ccee00ba6f596d89be1d7 at change log * app/vmagent: adds natural concurrency for kafka consumer it should improve performance for data consumption https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1957 * mention change 0c143bb22ca2e7e0b7eec9bc84a94ee2b41626ca * Update app/vmagent/kafka/consumer.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> * Update app/vmagent/kafka/consumer_cgo.go Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-03-15 13:03:44 -07:00
Zakhar Bessarab	6a5d236245	lib/storage: log original labels set when label value is truncated (#3952 ) lib/storage: log original labels set when label value is truncated	2023-03-14 10:59:40 +01:00
Aliaksandr Valialkin	3e7bfe1200	docs/CHANGELOG.md: document v1.87.3	2023-03-13 00:20:51 -07:00
Aliaksandr Valialkin	02ffe05750	docs/CHANGELOG.md: document v1.79.11 LTS release	2023-03-12 23:22:53 -07:00
Aliaksandr Valialkin	388d6ee16e	docs/CHANGELOG.md: cut v1.89.1	2023-03-12 19:14:19 -07:00
Aliaksandr Valialkin	e8225d7d6b	app/vmselect/promql: prevent from `cannot unmarshal timeseries from rollupResultCache` panic after the upgrade to v1.89.0 The issue has been introduced in `0af9e2b693`	2023-03-12 19:09:39 -07:00
Aliaksandr Valialkin	911bab4f6a	docs/CHANGELOG.md: cut v1.89.0	2023-03-12 17:29:44 -07:00
Aliaksandr Valialkin	468de76e9a	app/vmselect: remove data race on updating EvalConfig.IsPartialResponse from concurrently running goroutines This properly returns `is_partial: true` for partial responses.	2023-03-12 16:54:08 -07:00
Aliaksandr Valialkin	0af9e2b693	app/vmselect/promql: prevent from SIGBUS crash on architecures, which deny unaligned access to 8-byte words (e.g. ARM) Thanks to @oliverpool for nailing down the root cause of the issue and for the initial attempt to fix it at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3927	2023-03-12 16:32:08 -07:00
Yury Molodov	01367faa39	vmui: remove send step param for instant queries (#3931 ) * fix: remove step param for instant queries (#3896) * vmui: remove send step param for instant queries * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-12 03:09:56 -07:00
Aliaksandr Valialkin	a52413ce0a	docs/CHANGELOG.md: document `113a89904d`	2023-03-12 01:58:18 -08:00
Aliaksandr Valialkin	b19de3fa12	docs/CHANGELOG.md: yet another typo fix	2023-03-12 01:06:40 -08:00
Aliaksandr Valialkin	2f1d24fccf	docs/CHANGELOG.md: typo fix	2023-03-12 01:04:14 -08:00
Aliaksandr Valialkin	b5db69fe05	app/vmselect/netstorage: do not intern string representation of MetricName for time series received from vmstorage It has been appeared that this interning may lead to increased memory usage and increased CPU usage when vmselect performs queries, which select big number of time series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3692 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3863	2023-03-12 00:52:35 -08:00
Aliaksandr Valialkin	babc9e9815	docs/CHANGELOG.md: document `927d9da270`	2023-03-12 00:25:00 -08:00
Aliaksandr Valialkin	e3488c6cbc	docs/CHANGELOG.md: typo fixes	2023-03-12 00:09:26 -08:00
Aliaksandr Valialkin	48e32b325e	docs/CHANGELOG.md: document c9f44daaee8f4282d9ed41e3ba799c7a33841313	2023-03-11 23:55:13 -08:00
Roman Khavronenko	856c2db144	vmalert: support concurrent reading from object storage (#532 ) * vmalert: support concurrent reading from object storage Config reading from GCS or S3 can be slow if object storage contains a big number of files. Object storages are usually fast for downloading and are slow for individual operations. If there would be thousands of files to read, vmalert could spend significant time for retrieving those because it is done sequentially. The change introduces ability to read configs from object storage concurrently. By default, both GCS and S3 are now read with 50 concurrent readers. This significantly reduces the load time: * loading 500 files with concurrency=1 takes 27s * loading 500 files with concurrency=50 takes <1s * vmalert: add note to Changelog * vmalert: cleanup * vmalert: use ticker properly * app/vmalert: improve status reporting during config loading * vmalert: support concurrent reading from object storage Config reading from GCS or S3 can be slow if object storage contains a big number of files. Object storages are usually fast for downloading and are slow for individual operations. If there would be thousands of files to read, vmalert could spend significant time for retrieving those because it is done sequentially. The change introduces ability to read configs from object storage concurrently. By default, both GCS and S3 are now read with 50 concurrent readers. This significantly reduces the load time: * loading 500 files with concurrency=1 takes 27s * loading 500 files with concurrency=50 takes <1s * app/vmalert: make linter happy	2023-03-11 23:51:23 -08:00
Dmytro Kozlov	3c9058c168	app/vmctl: add support of basic auth and barer token (#3921 ) app/vmctl: add support of basic auth and bearer token	2023-03-09 14:53:29 +01:00
Roman Khavronenko	d66bae212b	app/vmalert: log number of configration files found for each specified `-rule` (#3936 ) The change also introduces `List` method to `FS` interface. The `List` method can be used for wildcard support in object storage FS. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-03-09 14:46:19 +01:00
Dmytro Kozlov	7f54c181bb	app/vmctl: follow up after `09e3742a82` (#3937 ) app/vmctl: follow up after `09e3742a82`	2023-03-09 13:28:55 +01:00
Roman Khavronenko	3de7fc5c71	security: bump go version to 1.20.2 (#3935 ) upgrade Go builder from Go1.20.1 to Go1.20.2 See the list of issues addressed in Go1.20.2 here (https://github.com/golang/go/issues?q=milestone%3AGo1.20.2+label%3ACherryPickApproved). Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-09 13:20:54 +01:00
Aliaksandr Valialkin	1b5dc9f91d	all: follow-up for `7a3e16e774` - Sync the description for -httpListenAddr.useProxyProtocol command-line flag at vmagent and vmauth, so it is consistent with the description at vmauth and victoria-metrics - Add a sample of panic text to docs/CHANGELOG.md, so it could be googled - Mention the -httpListenAddr.useProxyProtocol command-line flag in the description for the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-08 01:26:55 -08:00
Aliaksandr Valialkin	ed73317622	docs/CHANGELOG.md: improve description for `4b136abff8`	2023-03-08 01:05:33 -08:00
Aliaksandr Valialkin	70b831e684	docs/CHANGELOG.md: improve the description of the bugfix at `62beea23f7` - Make the description easier to read by humans :) - Add a link to VictoriaMetrics datasource plugin for Grafana, so users could easily discover it	2023-03-08 00:59:42 -08:00
Aliaksandr Valialkin	6e8e64f695	docs/CHANGELOG.md: clarify the description for `6bfe9cc733` - Add the panic message to the description, so it is easier to google - Add a link to the corresponding bugreport Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3897	2023-03-08 00:39:34 -08:00
Aliaksandr Valialkin	884e58d58d	docs/CHANGELOG.md: clarify the description for the change at `8bab50dc29` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3600	2023-03-08 00:23:35 -08:00
Aliaksandr Valialkin	1bb529e23e	app/vmagent/remotewrite: follow-up for e3a756d82869f8c357b072f6e635ebfc7d65dd2c - Document the fix - Move the detection of VictoriaMetrics remoteWrite protocol from client.init() to newHTTPClient() This simplifies the fix to the following diff: diff --git a/app/vmagent/remotewrite/client.go b/app/vmagent/remotewrite/client.go index 099899c19..70b904af4 100644 --- a/app/vmagent/remotewrite/client.go +++ b/app/vmagent/remotewrite/client.go @@ -151,10 +151,6 @@ func newHTTPClient(argIdx int, remoteWriteURL, sanitizedURL string, fq persiste } c.sendBlock = c.sendBlockHTTP - return c -} - -func (c client) init(argIdx, concurrency int, sanitizedURL string) { useVMProto := forceVMProto.GetOptionalArg(argIdx) usePromProto := forcePromProto.GetOptionalArg(argIdx) if useVMProto && usePromProto { @@ -173,6 +169,10 @@ func (c client) init(argIdx, concurrency int, sanitizedURL string) { } c.useVMProto = useVMProto + return c +} + +func (c client) init(argIdx, concurrency int, sanitizedURL string) {	2023-03-07 23:54:24 -08:00
Aliaksandr Valialkin	202083f38c	docs/CHANGELOG.md: document `ec2abf9b69`	2023-03-07 23:33:19 -08:00
Nikolay	7a3e16e774	lib/netutil: fixes panic at proxy protocol (#3905 ) it may occur if non proxy protocol message received by tcp server. Listener Accept method must return only non-recoverable errors. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-07 08:50:18 -08:00
Yury Molodov	bbf8e459a0	vmui: fix display of selected value in the selector (#3919 ) vmui: fix selected value in dropdowns for Explore page	2023-03-07 16:23:02 +01:00
Dmytro Kozlov	cc5b916237	docs: follow up after `4b136abff8` (#3918 ) docs: follow up after `4b136abff8`	2023-03-06 12:41:48 +01:00
Roman Khavronenko	95dc65e7b3	docs: follow-up after `62beea23f7` (#3907 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-03 22:45:19 +01:00
Nikolay	6bfe9cc733	lib{mergset,storage}: prevent possible race condition with logging st… (#3900 ) lib{mergset,storage}: prevent possible race condition with logging stats for merges Previously partwrapper could be release by background process and reference for part may be invalid during logging stats. It will lead to panic at vmstorage https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3897	2023-03-03 12:33:42 +01:00
Dmytro Kozlov	8bab50dc29	app/vmctl: add backoff retries to native protocol (#3859 ) app/vmctl: vm-native - split migration on per-metric basis `vm-native` mode now splits the migration process on per-metric basis. This allows to migrate metrics one-by-one according to the specified filter. This change allows to retry export/import requests for a specific metric and provides a better understanding of the migration progress. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-03-02 13:19:45 +01:00
Aliaksandr Valialkin	95e1173423	docs/CHANGELOG.md: document v1.79.10 release	2023-02-27 17:35:48 -08:00
Aliaksandr Valialkin	8288e327ee	docs/CHANGELOG.md: cut v1.88.1	2023-02-27 15:28:18 -08:00
Aliaksandr Valialkin	dfe3939665	docs/CHANGELOG.md: link to the issue, which may benefit from `-internStringDisableCache` command-line flag Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3863 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3692	2023-02-27 14:55:40 -08:00
Aliaksandr Valialkin	46127b432d	lib/bytesutil: add `-internStringDisableCache` and `-internStringCacheExpireDuration` command-line flags This commit is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3872	2023-02-27 14:16:49 -08:00
Aliaksandr Valialkin	0d3f31f60e	lib/storage: follow-up for `39cdc546dd` - Use flag.Duration instead of flagutil.Duration for -snapshotCreateTimeout, since the flagutil.Duration is intended mostly for big durations, e.g. days, months and years, while the -snapshotCreateTimeout is usually smaller than one hour. - Add links to https://docs.victoriametrics.com/#how-to-work-with-snapshots in docs/CHANGELOG.md, so readers could easily find the corresponding docs when reading the changelog. - Properly remove all the created directories on unsuccessful attempt to create snapshot in Storage.CreateSnapshot(). Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551	2023-02-27 13:07:38 -08:00
Zakhar Bessarab	39cdc546dd	lib/storage: enhancements for snapshots process (#3873 ) * lib/{fs,mergeset,storage}: skip `.must-remove.` dirs when creating snapshot (#3858) * lib/{mergeset,storage}: add timeout configuration for snapshots creation, remove incomplete snapshots from storage * docs: fix formatting * app/vmstorage: add metrics to track status of snapshots * app/vmstorage: use `vm_http_requests_total` metric for snapshot endpoints metrics, rename new flag to make name more clear Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmstorage: update flag name in docs Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmstorage: reflect new metrics names change in docs Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-27 12:12:03 -08:00
Zakhar Bessarab	5fadd58cf6	lib/promscrape: correctly register `vm_promscrape_config_` metrics (#3876 ) lib/promscrape: set `vm_promscrape_config_last_reload_successful` to 1 if there was no promscrape config provided Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * lib/promscrape: register `vm_promscrape_config_*` metrics only in case promscrape config is used Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-27 11:53:53 -08:00
Zakhar Bessarab	ac6d937372	doc: add changelog reference for vmgateway OpenID discovery (#3877 ) * doc: add changelog reference for vmgateway OpenID discovery * doc: add vmgateway docs --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-27 11:50:22 -08:00
Aliaksandr Valialkin	b3bb18d674	app/vmselect/promql: fix panic when calculating `aggr_func(rollup*())` The panic has been introduced in `dac21d874b`	2023-02-27 11:48:27 -08:00
Dmytro Kozlov	27c9446520	app/vmctl: skip series if measurement not found (#3869 ) app/vmctl: skip measurements with no fields for influxdb mode	2023-02-27 14:28:47 +01:00
Aliaksandr Valialkin	ffa327d6d1	app/vmagent: use the provided auth options when checking whether the remote storage supports VictoriaMetrics remote write protocol Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3847 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1225	2023-02-26 12:07:47 -08:00
Aliaksandr Valialkin	2d36dbcfa9	docs/CHANGELOG.md: cut v1.88.0	2023-02-24 17:54:21 -08:00
Roman Khavronenko	e1c3267e34	vmselect/promql: check for deadline in `count_values` fn (#3806 ) * vmselect/promql: check for deadline in `count_values` fn `count_values` could be very slow during the data processing. Checking for deadline between iterations supposed to reduce probability of exceeding `search.maxQueryDuration`. The change also adds a new trace record, which captures the time spent in aggregation function. Before that, the trace for aggr funcs could be confusing since it doesn't account for all the places where time was spent. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-24 16:59:26 -08:00
Aliaksandr Valialkin	c33cc4322c	docs/CHANGELOG.md: document v1.87.2 release	2023-02-24 16:14:28 -08:00
Aliaksandr Valialkin	6a88d5402d	docs/CHANGELOG.md: document v1.79.9 release	2023-02-24 15:10:48 -08:00
Roman Khavronenko	dac21d874b	metricsql: support optional 2nd argument for rollup functions (#3841 ) * metricsql: support optional 2nd argument for rollup functions Support optional 2nd argument `min`, `max` or `avg` for rollup functions: * rollup * rollup_delta * rollup_deriv * rollup_increase * rollup_rate * rollup_scrape_interval If second argument is passed, then rollup function will return only the selected aggregation type. This change can be useful for situations where only one type of rollup calculation is needed. For example, `rollup_rate(requests_total[5m], "max")`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-24 13:47:52 -08:00
Aliaksandr Valialkin	87aeeec3e8	docs/CHANGELOG.md: document `d8eaa511b0`	2023-02-24 12:42:02 -08:00
Aliaksandr Valialkin	a2340e6c95	docs/CHANGELOG.md: typo fix: `scrape scrape` -> `scrape`	2023-02-24 12:33:29 -08:00
Aliaksandr Valialkin	c6ad3692ad	lib/promscrape: follow-up for `43e104a83f` - Return immediately on context cancel during the backoff sleep. This should help with https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3747 - Add a comment describing why the second attempt to obtain the response from remote side is perfromed immediately after the first attempt. - Remove fasthttp dependency from lib/promscrape/discoveryutils - Set context deadline before calling doRequestWithPossibleRetry(). This simplifies the doRequestWithPossibleRetry() a bit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3293	2023-02-24 12:20:42 -08:00

... 3 4 5 6 7 ...

1660 Commits