VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-11-23 20:37:12 +01:00

Author	SHA1	Message	Date
Roman Khavronenko	c32a01c52e	docs: follow-up after `aec4b5db81` (#4638 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-19 10:10:51 +02:00
Roman Khavronenko	25317b4e70	vmalert: follow-up after `d4ac4b7813` (#4659 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-18 15:53:37 +02:00
venkatbvc	d4ac4b7813	vmalert: allow to blackhole alerting notifications (#4639 ) vmalert: support option to blackhole alerting notifications https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4122 --------- Co-authored-by: Rao, B V Chalapathi <b_v_chalapathi.rao@nokia.com>	2023-07-18 15:06:19 +02:00
Yury Molodov	7d1495056a	vmui: add tip to Explore Metrics page (#4615 ) * feat: add tip to Explore Metrics page (#4248) * fix: update description page	2023-07-18 11:44:31 +02:00
Aliaksandr Valialkin	8815080030	app/vmselect/promql: add the ability to copy all the labels from `one` side of group_left()/group_right() operation This is performed by specifying `` inside group_left()/group_right(). Also allow specifying prefix for the copied labels via `group_left(...) prefix "..."` and `group_right(...) prefix "..."` syntax. For example, the following query adds all the namespace-related labels to pod info, and prefixes all the copied label names with "ns_" prefix: kube_pod_info on(namespace) group_left(*) prefix "ns_" kube_namespace_labels This resolves the following StackOverflow questions: - https://stackoverflow.com/questions/76661818/how-to-add-namespace-labels-to-pod-labels-in-prometheus - https://stackoverflow.com/questions/76653997/how-can-i-make-a-new-copy-of-kube-namespace-labels-metric-with-a-different-name	2023-07-17 19:07:39 -07:00
Aliaksandr Valialkin	be31bdc88c	app/vmselect/promql: recommend to use `(a op b) keep_metric_names` instead of `a op b keep_metric_names` The `a op b keep_metric_names` is ambigouos to `a op (b keep_metric_names)` when `b` is a transform or rollup function. For example, `a + rate(b) keep_metric_names`. So it is better to use more clear syntax: `(a op b) keep_metric_names` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3710	2023-07-16 23:46:34 -07:00
Zakhar Bessarab	e2367b6d1c	metricsql: add support of using keep_metric_names for binary operations (#4109 ) * metricsql: add support of using keep_metric_names for binary operations This should help to avoid confusion with queries like one in the issue #3710. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * wip --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-16 03:00:39 -07:00
Aliaksandr Valialkin	4cb024d8a3	all: add support for `or` filters in series selectors This commit adds ability to select series matching distinct filters via a single series selector. For example, the following selector selects series with either {env="prod",job="a"} or {env="dev",job="b"} labels: {env="prod",job="a" or env="dev",job="b"} The `or` filter is supported in all the VictoriaMetrics tools now. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3997 Uses https://github.com/VictoriaMetrics/metricsql/pull/14	2023-07-16 00:06:33 -07:00
Aliaksandr Valialkin	a47932f4a2	all: replace ElasticSearch -> Elasticsearch for the sake of consistency This is a follow-up for `7f6b5dc47b`	2023-07-14 10:52:13 -07:00
Haleygo	b002e2a743	vmalert: fix evalTS after modify group interval (#4629 )	2023-07-14 14:45:24 +02:00
Roman Khavronenko	adfc6539ca	vmctl: mention replicationFactor during migration (#4633 ) Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4624 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-14 10:52:39 +02:00
dependabot[bot]	9baecdcd33	build(deps): bump tough-cookie in /app/vmui/packages/vmui (#4603 ) Bumps [tough-cookie](https://github.com/salesforce/tough-cookie) from 4.1.2 to 4.1.3. - [Release notes](https://github.com/salesforce/tough-cookie/releases) - [Changelog](https://github.com/salesforce/tough-cookie/blob/master/CHANGELOG.md) - [Commits](https://github.com/salesforce/tough-cookie/compare/v4.1.2...v4.1.3) --- updated-dependencies: - dependency-name: tough-cookie dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-07-13 22:22:21 -07:00
Aliaksandr Valialkin	7094fa38bc	lib/storage: switch from global to per-day index for `MetricName -> TSID` mapping Previously all the newly ingested time series were registered in global `MetricName -> TSID` index. This index was used during data ingestion for locating the TSID (internal series id) for the given canonical metric name (the canonical metric name consists of metric name plus all its labels sorted by label names). The `MetricName -> TSID` index is stored on disk in order to make sure that the data isn't lost on VictoriaMetrics restart or unclean shutdown. The lookup in this index is relatively slow, since VictoriaMetrics needs to read the corresponding data block from disk, unpack it, put the unpacked block into `indexdb/dataBlocks` cache, and then search for the given `MetricName -> TSID` entry there. So VictoriaMetrics uses in-memory cache for speeding up the lookup for active time series. This cache is named `storage/tsid`. If this cache capacity is enough for all the currently ingested active time series, then VictoriaMetrics works fast, since it doesn't need to read the data from disk. VictoriaMetrics starts reading data from `MetricName -> TSID` on-disk index in the following cases: - If `storage/tsid` cache capacity isn't enough for active time series. Then just increase available memory for VictoriaMetrics or reduce the number of active time series ingested into VictoriaMetrics. - If new time series is ingested into VictoriaMetrics. In this case it cannot find the needed entry in the `storage/tsid` cache, so it needs to consult on-disk `MetricName -> TSID` index, since it doesn't know that the index has no the corresponding entry too. This is a typical event under high churn rate, when old time series are constantly substituted with new time series. Reading the data from `MetricName -> TSID` index is slow, so inserts, which lead to reading this index, are counted as slow inserts, and they can be monitored via `vm_slow_row_inserts_total` metric exposed by VictoriaMetrics. Prior to this commit the `MetricName -> TSID` index was global, e.g. it contained entries sorted by `MetricName` for all the time series ever ingested into VictoriaMetrics during the configured -retentionPeriod. This index can become very large under high churn rate and long retention. VictoriaMetrics caches data from this index in `indexdb/dataBlocks` in-memory cache for speeding up index lookups. The `indexdb/dataBlocks` cache may occupy significant share of available memory for storing recently accessed blocks at `MetricName -> TSID` index when searching for newly ingested time series. This commit switches from global `MetricName -> TSID` index to per-day index. This allows significantly reducing the amounts of data, which needs to be cached in `indexdb/dataBlocks`, since now VictoriaMetrics consults only the index for the current day when new time series is ingested into it. The downside of this change is increased indexdb size on disk for workloads without high churn rate, e.g. with static time series, which do no change over time, since now VictoriaMetrics needs to store identical `MetricName -> TSID` entries for static time series for every day. This change removes an optimization for reducing CPU and disk IO spikes at indexdb rotation, since it didn't work correctly - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401 . At the same time the change fixes the issue, which could result in lost access to time series, which stop receving new samples during the first hour after indexdb rotation - see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2698 The issue with the increased CPU and disk IO usage during indexdb rotation will be addressed in a separate commit according to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1401#issuecomment-1553488685 This is a follow-up for `1f28b46ae9`	2023-07-13 16:07:30 -07:00
Aliaksandr Valialkin	4f6dc25c71	app/vmalert: silence golagci-lint at TestAlertingRule_Template Add a break if gotAlert is nil This removes the following golangci-lint warning: app/vmalert/alerting_test.go:868:8: SA5011(related information): this check suggests that the pointer can be nil (staticcheck) if gotAlert == nil { ^	2023-07-13 11:40:44 -07:00
Dmytro Kozlov	c76084b529	app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined (#4605 ) * app/vmctl: fix panic `--remote-read-filter-time-start` flag not defined * app/vmctl: update CHANGELOG.md --------- Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-07-13 17:14:43 +02:00
Yury Molodov	3bb3ad9991	feat: add link to MetricsQL (#4562 ) (#4608 )	2023-07-13 17:13:54 +02:00
Roman Khavronenko	cbc28ccdb2	vmalert: check for negative offset for missed rounds (#4628 ) It could happen for low evaluation intervals and irregular delays during execution that evaluation time would get a negative offset. This could result into cumulative discrepancy between the actual time and evaluation time for rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-13 17:11:22 +02:00
Aliaksandr Valialkin	30cdcc751d	all: update Go builder from 1.20.5 to 1.20.6 See https://github.com/golang/go/issues?q=milestone%3AGo1.20.6+label%3ACherryPickApproved	2023-07-12 00:59:59 -07:00
Dmytro Kozlov	a0a76d7b48	app/vmctl: remove undefined flag from the documentation. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4552 . (#4606 )	2023-07-10 19:20:31 +04:00
Aliaksandr Valialkin	a360fd5f71	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update`	2023-07-09 12:43:48 -07:00
Zakhar Bessarab	51a9cc9783	docs: make `httpAuth.` flags description less ambiguous (#4588 ) docs: make `httpAuth.` flags description less ambiguous Currently, it may confuse users whether `httpAuth.` flags are used by HTTP client or server configuration(see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4586 for example). Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * docs: fix a typo Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-07 13:50:13 +02:00
Haleygo	20e7db47ee	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-07 11:48:05 +02:00
Haleygo	bca8ae034f	vmalert:fix query request using rfc3339 format (#4577 ) vmalert: consistently use time.RFC3339 format for time in queries Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 10:39:25 +02:00
Aliaksandr Valialkin	d32780de5c	docs/vmgateway.md: update `-help` output	2023-07-06 23:13:07 -07:00
Aliaksandr Valialkin	8a45a2e770	docs/vmbackupmanager.md: update `-help` output	2023-07-06 23:13:06 -07:00
Aliaksandr Valialkin	c6bf60e2cc	docs/vmrestore.md: update `-help` output	2023-07-06 23:13:06 -07:00
Aliaksandr Valialkin	02a81da3ca	docs/vmbackup.md: update `-help` output	2023-07-06 23:13:05 -07:00
Aliaksandr Valialkin	81f5f118ce	docs/vmauth.md: update `-help` output	2023-07-06 23:13:05 -07:00
Aliaksandr Valialkin	9fa8e3895a	docs/vmalert.md: update `-help` output	2023-07-06 23:13:04 -07:00
Aliaksandr Valialkin	009af1ce90	docs/vmagent.md: update `-help` output	2023-07-06 23:13:04 -07:00
Roman Khavronenko	7c8a215a7c	vmalert: allow disabling of `step` param attached to instant queries (#4574 ) vmalert: allow disabling of `step` param attached to instant queries This might be useful for using vmalert with datasources that to not support this param, unlike VictoriaMetrics. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4573 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 07:44:34 +02:00
Aliaksandr Valialkin	7f3b5431a1	app/vmselect/graphite: follow-up after `c7884f8686` - Consistently use -search.maxGraphiteTagValues for limiting tag values from auto-complete API - Use -search.maxGraphiteSeries for limiting paths (aka series), which can be returned from Graphite series API - Clarify the change in docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4339 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2841	2023-07-06 15:21:56 -07:00
Aliaksandr Valialkin	45e345806c	app/vmselect/netstorage: remove runtime.Gosched() call from unpackWorker() This should improve scalability of unpackWorker() on systems with many CPU cores. This is a follow-up for `a2ecf4fa4a` and `16f3b279a2` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-07-06 10:05:58 -07:00
Roman Khavronenko	2f710ec77d	vmctl: interrupt explore procedure in influx mode if no numeric fields were found (#4576 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-04 13:45:42 +02:00
Sergey	9bea7610f0	security: update base Alpine image to 3.18.2 to avoid security risks (#4571 ) libcrypto3 and libssl3 in Alpine 3.18.0 have versions `3.1.0-r4` which contains CVE-2023-2650: https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2023-2650 Use ALpine image 3.18.2 which contains fixed versions of libssl3 and libcrypto3: 3.1.1-r0 NB: In Openshift these containers are marked as vulnerabilities because of these CVEs.	2023-07-04 09:23:53 +02:00
Dmytro Kozlov	9bde95bfff	app/vmalert: show on UI groups error after reload config (#4543 ) show on UI groups error after reload config https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4076 Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-07-03 14:59:52 +02:00
Zakhar Bessarab	adc07b711e	app/vmagent/remotewrite: fix error message for auth config (#4545 ) Error message will be present for any auth error, but message claims an error is about OAuth2 configuration which is confusing. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-07-03 13:12:40 +02:00
Yury Molodov	3cdba1b1c6	vmui: fix app routing issues (#4408 ) The change focuses on rectifying inconsistencies in the navigation behavior of the application and eliminating issues encountered when manually altering the URL. The key updates include: - Refactoring of the routing mechanism to handle all possible routes and their states. - Enhancement of the React Router usage to ensure a smoother navigation experience. - Handling application state when the URL is manually changed.	2023-06-30 10:13:10 +02:00
Roman Khavronenko	72edc31ffb	vmauth: expose latency metrics per user (#4525 ) expose `vmauth_user_request_duration_seconds` and `vmauth_unauthorized_user_request_duration_seconds` summary metrics for measuring requests latency per user. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-27 20:15:17 +02:00
Roman Khavronenko	c9ddfb118e	vmauth: rm ip filters from non-ent config example (#4526 ) It is impossible to run OS vmauth with the provided config. The example of using ip filters should be only a part of docs. All other examples should work seamlessly with OS version. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-27 20:14:52 +02:00
Haleygo	a97887a2d9	vmalert: add `vmalert_remotewrite_sent_duration_seconds_total` metric (#4517 ) add `vmalert_remotewrite_sent_duration_seconds_total` metric	2023-06-26 07:34:51 +02:00
Roman Khavronenko	37c9a631ca	vmalert: make linter happy (#4509 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-22 17:46:12 +02:00
Roman Khavronenko	5f9ad22884	vmalert: update retry policy for pushing data to `-remoteWrite.url` (#4504 ) By default, vmalert will make multiple retry attempts with exponential delay. The total time spent during retry attempts shouldn't exceed `-remoteWrite.retryMaxTime` (default is 30s). When retry time is exceeded vmalert drops the data dedicated for `-remoteWrite.url`. Before, vmalert dropped data after 5 retry attempts with 1s delay between attempts (not configurable). See `-remoteWrite.retryMinInterval` and `-remoteWrite.retryMaxTime` cmd-line flags. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-06-22 15:14:23 +02:00
Roman Khavronenko	4aad7a43df	vmalert: properly interrupt remotewrite retries on shutdown (#4505 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-22 15:07:32 +02:00
Aliaksandr Valialkin	83aa78dfb4	app/vlstorage: export vl_active_merges and vl_merges_total metrics	2023-06-21 20:58:57 -07:00
Aliaksandr Valialkin	57541d5cea	Revert "app/vlselect/logsql: use buffered writer in order to save syscalls when sending big amounts of data to clients" This reverts commit `c19048dc13`. Reason for revert: it has been appeared that the net/http.ResponseWriter is already buffered, so there in no need in double bufferring	2023-06-21 20:40:01 -07:00
Aliaksandr Valialkin	c19048dc13	app/vlselect/logsql: use buffered writer in order to save syscalls when sending big amounts of data to clients	2023-06-21 20:25:32 -07:00
Aliaksandr Valialkin	3ded68d4b8	app/vmui/Makefile: consistently use tabs instead of spaces in multi-line Makefile rules	2023-06-21 19:57:48 -07:00
Aliaksandr Valialkin	8be52ef217	app/vlselect: handle vmui at /select/vmui path instead of /vmui This simplifies routing at auth proxies such as vmauth to vlselect component, which serves VMUI - just route all the requests, which start with /select/, to vlselect.	2023-06-21 19:52:50 -07:00
Aliaksandr Valialkin	dde9ceed07	app/vlinsert/jsonline: code prettifying	2023-06-21 19:39:22 -07:00

1 2 3 4 5 ...

2530 Commits