VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-19 07:01:02 +01:00

Author	SHA1	Message	Date
Haleygo	4b0db17bec	vmalert: allow configuring custom notifier headers per group (#4088 ) vmalert: allow configuring custom notifier headers per group	2023-05-08 17:07:44 -07:00
Zakhar Bessarab	19eaf17e11	app/vmalert: add support of recursive path globs for rules and templates (#4148 ) Supports using `` for `-rule` and `-rule.templates`: `dir//*.tpl` loads contents of dir and all subdirectories recursively. See: #4041 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Artem Navoiev <tenmozes@gmail.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-05-08 16:22:30 -07:00
Zakhar Bessarab	55d772ab39	app/vmalert: return an error when using `query` function in `-external.alert.source` flag (#4191 ) Templating of `-external.alert.source` is not expected to have access to the query which was causing runtime error when query function was passed as nil. See: #4181 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-05-08 15:48:16 -07:00
Roman Khavronenko	20b025dc88	http server: limit max concurrent requests (#4185 ) * lib/httpserver: introduce `-http.maxConcurrentRequests` command-line flag Introduce `-http.maxConcurrentRequests` command-line flag to protect VM components from resource exhaustion during unexpected spikes of HTTP requests. By default, the new flag's value is set to 0 which means no limits are applied. Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/httpserver: mention http.maxConcurrentRequests in docs Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 13:13:58 -07:00
Roman Khavronenko	e9ce67adb8	vmalert: retry datasource requests with EOF or unexpected EOF errors (#4146 ) * vmalert: retry datasource requests with EOF or unexpected EOF errors Retry failed read request on the closed connection one more time. This may improve rules execution reliability when connection between vmalert and datasource closes unexpectedly. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: fix old tests Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 09:49:49 -07:00
Zakhar Bessarab	54edd6992a	app/vmalert: update Grafana URLs to match latest format (#4061 ) See: #4019 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-04-05 13:31:06 -07:00
Roman Khavronenko	0bde9722ed	vmalert: use `missingkey=zero` for templating (#4040 ) Replace empty labels with "" instead of "<no value>" during templating, as Prometheus does. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4012 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-31 22:43:39 -07:00
Aliaksandr Valialkin	9387793f47	app/vmselect: follow-up for `10ab086366` - Expose stats.seriesFetched at `/api/v1/query_range` responses too for the sake of consistency. - Initialize QueryStats when it is needed and pass it to EvalConfig then. This guarantees that the QueryStats is properly collected when the query contains some subqueries.	2023-03-27 15:11:42 -07:00
Roman Khavronenko	a09dabc78f	vmalert: add anchor char to Group's link (#4006 ) This should help users to see that Group's name is clickable and used for anchoring. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-24 17:56:04 -07:00
Roman Khavronenko	ec6a20880c	vmalert: mention VMUI example for alert's source (#4005 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-24 17:55:30 -07:00
Roman Khavronenko	6a7de761f4	vmalert: support logs suppressing during config reloads (#3973 ) * vmalert: support logs suppressing during config reloads The change is mostly required for ENT version of vmalert, since it supports object-storage for config files. Reading data from object storage could be time-consuming, so vmalert emits logs to track the progress. However, these logs are mostly needed on start or on manual config reload. Printing these logs each time `rule.configCheckInterval` is triggered would too verbose. So the change allows to control logs emitting during config reloads. Now, logs are emitted during start up or when SIGHUP is receieved. For periodicall config checks logs emitted by config pkg are suppressed. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: review fixes Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-20 14:25:26 -07:00
Roman Khavronenko	0ac57ef5b9	Vmalert tests (#3975 ) * vmalert: add tests for notifier pkg * vmalert: add tests for remotewrite pkg * vmalert: add tests for template functions * vmalert: add tests for web pages * vmalert: fix int overflow in tests Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-17 16:16:13 -07:00
Zakhar Bessarab	3b7152b1d8	docs: add a note about cache reset for vmalert backfilling docs (#3940 ) docs: add a note about cache reset for vmalert backfilling docs	2023-03-12 00:13:00 -08:00
Roman Khavronenko	310b380a03	app/vmalert: log number of configration files found for each specified `-rule` (#3936 ) The change also introduces `List` method to `FS` interface. The `List` method can be used for wildcard support in object storage FS. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-03-11 23:40:40 -08:00
Aliaksandr Valialkin	54fe207cc0	all: follow-up for `7a3e16e774` - Sync the description for -httpListenAddr.useProxyProtocol command-line flag at vmagent and vmauth, so it is consistent with the description at vmauth and victoria-metrics - Add a sample of panic text to docs/CHANGELOG.md, so it could be googled - Mention the -httpListenAddr.useProxyProtocol command-line flag in the description for the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-08 01:42:58 -08:00
Roman Khavronenko	fa3b2bd205	app/vmalert: do not wait for group start on removal (#3891 ) Each group in vmalert starts with an artifical delay to avoid thundering herd problem. For some groups with high evaluation intervals, the delay could be significant. If during this delay user will remove the group from the config and hot-reload it - vmalert will have to wait until the delay ends. This results into slow config reloading and UI hang. The change moves the start-delay logic back to the group's `start` method. Now, group can immediately exit from the delay when `group.close()` method is called. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-08 01:10:41 -08:00
Roman Khavronenko	b176247e16	vmalert: cancel in-flight requests on group's update or close (#3886 ) When group's update() or close() method is called, the group still need to wait for its current evaluation to finish. Sometimes, evaluation could take a significant amount of time which slows configuration update or vmalert's graceful shutdown. The change interrupts current evaluation in order to speed up the graceful shutdown or config update procedures. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-03-08 00:10:11 -08:00
Aliaksandr Valialkin	bbd5914eb1	all: add makefile rules for GOARCH=s390x for all the VictoriaMetrics components This is a follow-up for `007530f882`	2023-02-26 12:38:48 -08:00
Aliaksandr Valialkin	0c60e4a30a	all: consistently use http.Method{Get,Post,Put} across the codebase This is a follow-up after `9dec3c8f80`	2023-02-22 19:01:09 -08:00
my-git9	7d86c5c94a	chore: Use http constants to replace numbers (#3846 ) Signed-off-by: xin.li <xin.li@daocloud.io>	2023-02-22 18:59:32 -08:00
Roman Khavronenko	a29c1d3a02	docs: mention rules replay blogpost in vmalert docs (#3851 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-21 17:44:32 -08:00
Roman Khavronenko	fd139b463b	docs: update vmalert docs (#3843 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-20 19:13:00 -08:00
Aliaksandr Valialkin	58779363b4	app/vmalert/README.md: sync with docs/vmalert.md after `6ef6f3a771`	2023-02-18 15:21:31 -08:00
Haleygo	9a274567f1	vmalert: fix maxResolveDuration flag note (#3827 ) Signed-off-by: Haleygo <hui.wang@daocloud.io>	2023-02-18 15:20:30 -08:00
Roman Khavronenko	c6251ec8aa	docs: improve troubleshooting docs for vmalert (#3812 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-13 09:42:18 -08:00
Oleksandr Redko	0e1c395609	app,lib: fix typos in comments (#3804 )	2023-02-13 09:32:35 -08:00
Aliaksandr Valialkin	ca61c276ca	app/vmalert: follow-up after d3c64aae8768d58781ee7e358bd7f3d8e0eb836d - Document the change at docs/CHANGELOG.md - Add `Reading rules from object storage` section to docs/vmalert.md - Add `s3` prefix to command-line flags related to the configuration of s3 and gcs clients - Explicitly mention that reading rules from object storage is supported only in enterprise version	2023-02-09 19:10:36 -08:00
Roman Khavronenko	2eb9ca1889	vmalert: support object storage for rules (#519 ) * vmalert: support object storage for rules Support loading of alerting and recording rules from object storages `gcs://`, `gs://`, `s3://`. * review fixes	2023-02-09 19:10:34 -08:00
Aliaksandr Valialkin	34379d4cf1	all: run `apk update && apk upgrade` in base Alpine Docker image in order to get all the recent security fixes	2023-02-09 14:03:02 -08:00
Roman Khavronenko	4e922eb93b	Vmalert fixes (#3788 ) * vmalert: use group's ID in UI to avoid collisions Identical group names are allowed. So we should used IDs for various groupings and aggregations in UI. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: prevent disabling state updates tracking The minimum number of update states to track is now set to 1. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly update `debug` and `update_entries_limit` params on hot-reload Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: display `debug` field for rule in UI Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: exclude `updates` field from json marhsaling This field isn't correctly marshaled right now. And implementing the correct marshaling for it doesn't seem right, since json representation is mostly used by systems like Grafana. And Grafana doesn't expect this field to be present. Signed-off-by: hagen1778 <roman@victoriametrics.com> * fix test for disabled state Signed-off-by: hagen1778 <roman@victoriametrics.com> * fix test for disabled state Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-08 08:45:25 -08:00
Roman Khavronenko	80bf0bcf8c	vmalert: update docs (#3770 ) vmalert: update flags description Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-02-07 09:28:59 -08:00
Roman Khavronenko	96db7ac52c	vmalert: speed up state restore procedure on start (#3758 ) * vmalert: speed up state restore procedure on start Alerts state restore procedure has been changed to become asynchronous. It doesn't block groups start anymore which significantly improves vmalert's startup time. Instead, state restore is called by each group in their goroutines after the first rules evaluation. While previously state restore attempt was made for all loaded alerting rules, now it is called only for alerts which became active after the first evaluation. This reduces the amount of API calls to the configured remote read URL. This also means that `remoteRead.ignoreRestoreErrors` command-line flag becomes deprecated now and will have no effect if configured. See relevant issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2608 Signed-off-by: hagen1778 <roman@victoriametrics.com> * make lint happy Signed-off-by: hagen1778 <roman@victoriametrics.com> * Apply suggestions from code review --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-03 19:46:41 -08:00
Roman Khavronenko	d93ac2b1ea	docs: mention `-vmalert.proxyURL` in vmalert docs (#3730 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-01-31 10:49:49 -08:00
Aliaksandr Valialkin	4cf4c307ea	docs: update command-line descriptions after `73256fe438`	2023-01-27 00:01:14 -08:00
Nikolay	ebebaecd94	lib/netutil: init implimentation of proxy protocol (#3687 ) * lib/netutil: init implimentation of proxy protocol https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335 * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-26 23:25:22 -08:00
Aliaksandr Valialkin	bd809db4d9	docs: update the list of command-line flags according to the latest changes	2023-01-25 09:22:23 -08:00
Aliaksandr Valialkin	ef7683f2e0	app/vmalert: use consistent randomizer in tests Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3683	2023-01-23 19:25:32 -08:00
Aliaksandr Valialkin	ac890b3081	docs: update `-help` outputs for vm* tools	2023-01-03 23:27:31 -08:00
Aliaksandr Valialkin	3369371636	app/{vmagent,vminsert}: add support for streaming aggregation See https://docs.victoriametrics.com/stream-aggregation.html Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3460	2023-01-03 22:22:07 -08:00
Roman Khavronenko	dde750e7c1	vmalert: mention specifics of Alertmanager HA mode (#3573 ) Stress the importance of specifying of all Alertmanager URLs in vmalert's `-notifier.url` or `notifier.config` if it runs in cluster mode. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3547 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-01-03 21:46:00 -08:00
Roman Khavronenko	5cf2998af8	vmalert: allow configuring the default number of stored rule's update states (#3556 ) Allow configuring the default number of stored rule's update states in memory via global `-rule.updateEntriesLimit` command-line flag or per-rule via rule's `update_entries_limit` configuration param. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-29 10:41:51 -08:00
Artem Navoiev	393f4ab86f	update links to grafana dashboards (#3534 ) docs: update links to grafana dashboards Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2022-12-28 11:22:02 -08:00
Zakhar Bessarab	decf46d72b	app/vmbackupmanager: add metrics for better observability (#488 ) * app/vmbackupmanager: add metrics for better observability, include more information to `/api/v1/backups` API call response * app/vmbackupmanager: drop old metrics before creating new ones * app/vmbackupmanager: use `_total` postfix for counter metrics * app/vmbackupmanager: remove `_total` postfix for gauge-like metrics * app/vmbackupmanager: add `_last_run_failed` metrics for backups and retention * app/vmbackupmanager: address review feedback * app/vmbackupmanager: fix metric name * app/vmbackupmanager: address review feedback, remove background updates of metrics, add restoring state of `_last_run_failed` metric from remote storage * app/vmbackupmanager: improve performance for backup size calculation * app/vmbackupmanager: refactor backup and retention runs to deduplicate each run logic * {app/vmbackupmanager,lib/formatutil}: move HumanizeBytes into lib package * app/vmbackupmanager: fix creating new metrics instead of reusing existing ones * lit/formatutil: add comment to make linter happy * app/vmbackupmanager: address review feedback	2022-12-20 14:18:43 -08:00
Aliaksandr Valialkin	2a229a319e	docs/vmalert.md: mention `latency_offset` query arg, which has been added in `86dae56bd0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3481	2022-12-16 17:20:51 -08:00
Aliaksandr Valialkin	3a28a52667	lib/flagutil: support for TB and TiB suffixes for command-line flags, which accept byte sizes	2022-12-14 17:53:18 -08:00
Roman Khavronenko	a44af871d3	vmalert: support `$for` or `.For` template variables (#3474 ) support `$for` or `.For` template variables in alert's annotations. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3246 Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-12 14:42:16 -08:00
Aliaksandr Valialkin	97b41e727c	lib/promscrape: implement target-level and metric-level relabel debugging Target-level debugging is performed by clicking the 'debug' link at the corresponding target on either http://vmagent:8429/targets page or on http://vmagent:8428/service-discovery page. Metric-level debugging is perfromed at http://vmagent:8429/metric-relabel-debug page. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3407 See https://docs.victoriametrics.com/vmagent.html#relabel-debug	2022-12-10 02:25:56 -08:00
Aliaksandr Valialkin	8fce069c7b	app/vmalert: properly handle nil req passed to requestToCurl() This fixes a panic in the TestAlertingRule_Exec_Negative test. The panic has been introduced in the commit `b97bd01605`	2022-12-10 02:05:20 -08:00
Aliaksandr Valialkin	650d1d1ae5	app/vmalert: do not show system links at http://vmalert:8880/ page when it is requested via proxy The system links are absolute, e.g. they start from `/`, so there are high chances they won't work as expected when requested via proxy such as vmselect with -vmalert.proxyURL command-line flag. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3424	2022-12-09 11:49:53 -08:00
Roman Khavronenko	385d082bca	vmalert: do not hold pointer to http.Request (#3467 ) http.Request was used as a part of state struct for generating the curl command when viewing the rule's state changes. It appears, that holding a referencing is far more expensive than generating the curl command immediately. On the test with 40k rules, this change reduces memory and CPU usage by 50%. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-09 11:49:53 -08:00
Aliaksandr Valialkin	676de127aa	all: update Go builder from v1.19.3 to v1.19.4 See https://github.com/golang/go/issues?q=milestone%3AGo1.19.4+label%3ACherryPickApproved	2022-12-08 17:04:41 -08:00
Roman Khavronenko	5bbb88902e	vmalert: correctly return error for RW failures (#3452 ) * vmalert: correctly return error for RW failures By mistake, in `0989649ad0` the error for remote write failures weren't return to user. This change fixes it. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-06 16:31:12 -08:00
Roman Khavronenko	a922308438	vmalert: reduce allocations for Prometheus resp parse (#3435 ) Method `metrics()` now pre-allocates slices for labels and results from query responses. This reduces the number of allocations on the hot path for instant requests. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-05 00:18:11 -08:00
Roman Khavronenko	31ca22109e	vmalert: fix replay step param (#3428 ) The recent change in modifying default value of `datasource.queryStep` flag resulted in situation where replay mode was always running queries with step=`datasource.queryStep`. When it should always use rule's evaluation interval. The fix is related not to replay mode only, but for all Range requests. Now step param is set individually for each mode. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-02 19:09:30 -08:00
Zakhar Bessarab	59f889cd3f	app/vmalert: add `remoteWrite.sendTimeout` command-line flag to configure timeout for sending data to `remoteWrite.url` (#3423 ) * app/vmalert: add `remoteWrite.sendTimeout` command-line flag to configure timeout for sending data to `remoteWrite.url` * vmalert: remove WriteTimeout from clients Cfg No need to have it as a part of configuration struct: * the client isn't used by other packages; * there are no internal tests to check the WriteTimeout. * vmalert: remove DisablePathAppend from clients Cfg No need to have it as a part of configuration struct: * the client isn't used by other packages; * there are no internal tests to check the DisablePathAppend. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2022-12-02 19:03:34 -08:00
Roman Khavronenko	435f6f3add	vmalert: properly pass headers during the restore procedure (#3420 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3418 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-12-02 18:53:44 -08:00
Aliaksandr Valialkin	be6da5053f	lib/promscrape: optimize service discovery speed - Return meta-labels for the discovered targets via promutils.Labels instead of map[string]string. This improves the speed of generating meta-labels for discovered targets by up to 5x. - Remove memory allocations in hot paths during ScrapeWork generation. The ScrapeWork contains scrape settings for a single discovered target. This improves the service discovery speed by up to 2x.	2022-11-29 21:26:23 -08:00
Aliaksandr Valialkin	2a107cc8a7	app/vmalert: substitute -datasource.disablePathAppend with -remoteRead.disablePathAppend in the description for -datasource.url command-line flag This is a follow-up for `959f06d175`	2022-11-29 21:11:18 -08:00
Max Golionko	d272a8270b	vmalert: flag reference update (#3415 ) * flag reference update there is no flag `-datasource.disablePathAppend` and datasource actually checking for `-remoteRead.disablePathAppend` * update source for doc as well	2022-11-29 20:38:02 -08:00
Roman Khavronenko	0475f8a38e	vmalert: add default list of alerting rules (#3373 ) The default list of alerting rules contains the basic rules for checking vmalert's health state and is recommended to use for monitoring vmalert deployments. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-21 16:09:47 +02:00
Aliaksandr Valialkin	6fe8eec745	all: add a link to https://docs.victoriametrics.com/enterprise.html into description for enterprise flags	2022-11-21 15:44:54 +02:00
Roman Khavronenko	8ee464b22b	bump go version to 1.19.3 (#3327 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-11-09 11:56:38 +02:00
Aliaksandr Valialkin	7ae038766c	app/vmalert/templates: properly escape all the special chars in `quotesEscape` function Previously the `quotesEscape` function was escaping only double quotes. This wasn't enough, since the input string could contain other special chars, which must be escaped when put inside JSON string. For example, carriage return and line feed chars (\n\r), backslash char, etc. This led to the following issues, which were improperly fixed: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/890 - this issue was "fixed" by introducing the `crlfEscape` function, which led to unnecessary complications in user templates, while not fixing various corner cases such as backslash chars in the input string. See `1de15ad490` - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3139 - this issue was "fixed" by urlencoding the whole string passed to -external.alert.source command-line flag. This led to invalid urls, which couldn't be parsed by Grafana. See `00c838353d` and `4bd0244599` This commit properly encodes the input string passed to `quotesEscape`, so it can be safely embedded inside JSON strings. This commit deprecates crlfEscape template function and adds the following new template functions: - strvalue and stripDomain - these functions are supported by Prometheus, so they were added for compatibility purposes. - jsonEscape and htmlEscape for converting the input string to valid quoted JSON string and for html-escaping the input string, so it could be safely embedded as a plaintext into html. This commit also documents all supported template functions at https://docs.victoriametrics.com/vmalert.html#template-functions The deprecated crlfEscape function isn't documented on purpose, since its usefulness is negative in general case.	2022-10-28 00:08:50 +03:00
Aliaksandr Valialkin	8a6898b625	Revert "vmalert: escape query params if external alert source defined (#3267 )" This reverts commit `00c838353d`. Reason for revert: it incorrectly fixes the issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3139 . Now `-external.alert.source=explore?orgId=1&left=...` is converted to the following invalid url, which cannot be handled by Grafana: https://grafana.example.com/explore%3ForgId%3D1%26left%3D... The next commit will contain the correct fix of the issue - the `quotesEscape` function must properly escape the string, so it could be embedded into JSON string. This function must properly escape \n\r chars too. In this case the `crlfEscape` function becomes unnecessary. Actually, the next commit makes the `crlfEscape` function deprecated.	2022-10-28 00:08:50 +03:00
Dmytro Kozlov	3123059407	vmalert: escape query params if external alert source defined (#3267 ) vmalert: escape query args if external alert source defined	2022-10-28 00:08:50 +03:00
Aliaksandr Valialkin	450a32970a	lib/envtemplate: allow referring env vars from other env vars via %{ENV_VAR} syntax This is a follow-up for `02096e06d0`	2022-10-26 14:51:02 +03:00
Aliaksandr Valialkin	8ea84432ef	docs/enterprise.md: describe all the enteprise features in a short doc at https://docs.victoriametrics.com/enterprise.html	2022-10-24 18:03:22 +03:00
Roman Khavronenko	f7d69c1735	vmalert: lower severity level for RW retries (#3237 ) The message about dropped data still remains at `error` level. The change supposed to make log message more clear about how serious it is. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-10-18 20:40:37 +03:00
Aliaksandr Valialkin	d0288ea417	all: log error when environment variables referred from `-promscrape.config` are missing This should prevent from using incorrect config files	2022-10-18 10:29:59 +03:00
Roman Khavronenko	895cb3e7c6	vmalert: update troubleshooting docs (#3228 ) The default value of `-datasource.queryStep` has changed, so we update the troubleshooting docs accordingly. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-10-13 10:14:40 +03:00
Roman Khavronenko	7fc812d3c4	vmalert: revert unexpected fileds rename during refactoring (#3222 ) Due to auto-refactoring, the filed `state` was automatically renamed to `ruleState` when the entity with the same name was renamed in other file. Reverting the change. https://github.com/VictoriaMetrics/helm-charts/issues/391 Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-10-12 09:33:16 +03:00
Howie	d9abdc57d4	fix issue#3053 (#3182 ) vmalert: prevent duplicating label `alertname` for notifications The issue has no impact on alerting procedure. But still needs to be fixed for clarity. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3053 Signed-off-by: lihaowei <haoweili35@gmail.com>	2022-10-10 21:54:18 +03:00
Aliaksandr Valialkin	087393bcef	lib/promrelabel: remove unconditional sorting of the labels in ParsedConfigs.Apply(), since the sorting isnt needed in many places Sort labels explicitly after calling the ParsedConfigs.Apply() when needed. This reduces CPU usage when performing metric-level relabeling, where labels' sorting isn't needed.	2022-10-09 14:53:35 +03:00
Aliaksandr Valialkin	98a4ab796c	all: update the minimum required Go verson from 1.19.1 to 1.19.2 This is needed because of security vulnerabilities found in Go 1.19.1 See https://go.dev/doc/devel/release#go1.19.2	2022-10-07 22:46:44 +03:00
Roman Khavronenko	de92a8375c	vmalert: fix misleading line regarding multitenancy (#3206 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-10-06 15:10:52 +03:00
Aliaksandr Valialkin	9b1443bde5	app/vmalert: follow-up after f8ac55d70ada9ef8490b322abefb05f28f75e2e9 * Use vm_account_id and vm_project_id labels to be consistent with https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#multitenancy-via-labels * Document the feature that vmalert now exposes vm_account_id and vm_project_id labels if -clusterMode is set. * Use literal strings instead of string constants for vm_account_id and vm_project_id. This improves code readability.	2022-10-06 00:06:06 +03:00
Aliaksandr Valialkin	98d58fdb57	app/vmalert: update -external.alert.source command-line flag description after `61544e13ad`	2022-10-05 22:54:23 +03:00
Roman Khavronenko	6f6f6afae0	vmalert: allow using `{{$labels}}` for templating in `-external.alert.source` (#3194 ) The change is supposed to provide additional flexibility for generating alert's source link based on label values. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-10-05 22:53:02 +03:00
Aliaksandr Valialkin	6f9ce3f6d6	lib/flagutil: rename Array to ArrayString This makes the ArrayString more consistent with other Array* types. While at it, add ArrayBytes type, which will be used for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3071	2022-10-01 18:28:19 +03:00
Aliaksandr Valialkin	93e84a1c57	lib/httpserver: use 302 redirects instead of 301 redirects Incorrect 301 redirects can be cached by user agents such as web browsers. This can complicate recovery procedure after the incorrect redirect is fixed, e.g. web browser cache must be reset. The related issue - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1752	2022-10-01 16:56:43 +03:00
Roman Khavronenko	408d7043a1	vmalert: support auth configs per static_target (#3188 ) Allow configuring authorization params per list of targets in vmalert's notifier config for `static_configs`. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2690 Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-30 18:38:11 +03:00
Roman Khavronenko	a2ded58600	vmalert: allow using extra labels in annotations (#3181 ) According to Ruler specification, only labels returned within time series should be available for use in annotations. For long time, vmalert didn't respect this rule. And in PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/2403 this was fixed for the sake of compatibility. However, this resulted into users confusion, as they expected all configured and extra labels to be available - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3013 This fix allows to use extra labels in Annotations. But in the case of conflicts the original labels (extracted from time series) are preferred. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-30 07:48:59 +03:00
panguicai	156e7035c7	docs: fix typo for vmalert docs (#3173 ) Signed-off-by: panguicai008 <1121906548@qq.com>	2022-09-28 10:42:13 +03:00
Dmytro Kozlov	28dcff5791	lib/{httpserver,netutil}: allow to define min and max TLS version of the http server (#3109 ) * lib/{httpserver,netutil}: allow to define min and max TLS version of the http server * lib/httpserver: added descriptions about tls supported versions * lib/netutil: check minimal tls version, added supported tls versions to error * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-09-26 17:38:43 +03:00
Aliaksandr Valialkin	594a4ab345	docs/vmalert.md: follow-up for `0c95f928ae` - Clarify the description for -datasource.queryStep command-line flag - Consistently use a single dash in front of -datasource.queryStep command-line flag - Update -help output at docs/vmalert.md	2022-09-26 08:49:47 +03:00
Aliaksandr Valialkin	a6a869c365	docs/vmalert.md: follow-up after `7748a9d629` - Consistently use single dash in front of command-line flags instead of double dashes. - Add a warning that too small -search.latencyOffset may lead to incomplete query results.	2022-09-26 08:49:47 +03:00
Roman Khavronenko	cffceba0f5	vmalert: set default value for `datasource.queryStep` to `5m` (#3149 ) Change default value for command-line flag `datasource.queryStep` from `0s` to `5m`. Param `step` is added by vmalert to every rule evaluation request sent to datasource. Before this change, `step` was equal to group's evaluation interval by default. Param `step` for instant queries defines how far VM can look back for the last written data point. The change supposed to improve reliability of the rules evaluation when evaluation interval is lower than scraping interval. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-26 08:40:17 +03:00
Roman Khavronenko	b86cf7d707	vmalert: add info about `search.latencyOffset` to Troubleshooting (#3151 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-26 08:38:23 +03:00
Dmytro Kozlov	ed842e7d3a	app/{vmctl,vmalert}: update progress bar library (make vendor-update) (#3138 ) * app/{vmctl,vmalert}: update progress bar library (make vendor-update) * app/{vmctl,vmalert}: make vendor-update	2022-09-21 11:11:40 +03:00
Roman Khavronenko	74e81d31a7	vmalert: prodvide more details on duplicates (#3136 ) Now vmalert will print the following messages on dupliсates: ``` "recording rule \"record\"; expr: \"up == 1\"; labels: summary={{ value\|query }}" is a duplicate within the group "test" "alerting rule \"alert\"; expr: \"up == 1\"; labels: description={{ value\|query }}" is a duplicate within the group "test" ``` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3127 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-21 11:11:40 +03:00
Roman Khavronenko	360b022603	vmalert: always re-evaluate Annotations (#3119 ) * vmalert: always re-evaluate Annotations Previously, Annotations were evaluated only: 1. On alert creating. 2. On alert's value change. This is premature optimization. It was assumed that since annotations could contain only text with alert's labels or value - there is no need in spending resources to re-compile Annotations. Later, template function `query` was added, which can execute arbitrary queries and return different results on every evaluation. So if it was used in annotations, it would be executed only on init or value change. Another case when optimization caused an issue - annotations hot reload. In this case, annotations of the active alert won't change even if Rule's annotations were changed. This fix enables Annotations re-evaluation on each iteration to resolve issues above. It would have some impact on performance, but it is unlikely it will be noticeable. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add tp Changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-19 15:04:37 +03:00
Roman Khavronenko	1c13cce5ed	vmalert: add Troubleshooting section to docs (#3115 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-19 15:04:37 +03:00
Roman Khavronenko	09e211a05f	vmalert: print example of `curl` command for rule's state (#3112 ) The change adds an example of `curl` command to the Rule's page. The command is generated for each recorded state. It is supposed user can just copy&execute the command to see what was returned to vmalert. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-19 15:04:37 +03:00
Aliaksandr Valialkin	424bcfc17b	docs/vmalert.md: update `-help` output after explicit marking of enterprise flags	2022-09-15 13:19:02 +03:00
Roman Khavronenko	6ae4f3526b	vmalert: add experimental feature of storing Rule's evaluation state (#3106 ) vmalert: add experimental feature of storing Rule's evaluation state The new feature keeps last 20 state changes of each Rule in memory. The state are available for view on the Rule's view page. The page can be opened by clicking on `Details` link next to Rule's name on the `/groups` page. States change suppose to help in investigating cases when Rule doesn't generate alerts or records. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-14 15:08:40 +03:00
Roman Khavronenko	d071e39694	bump Go version to 1.19.1 (#3108 ) The reason is to cover vulnerability GO-2022-0969 Found in: net/http@go1.18.5 Fixed in: net/http@go1.19.1 More info: https://pkg.go.dev/vuln/GO-2022-0969 Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-09-14 13:43:27 +03:00
Aliaksandr Valialkin	20834c1757	app/vmalert: follow-up after `8441375da2` - Rename logDebug() to logDebugf() and pass format string together with format args directly to logDebugf(). This eliminates fmt.Sprintf() overhead at logDebug() call site when debugging is disabled. - Format labels in debug message in Prometheus format, e.g. {label1="value1",...labelN="valueN"} Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3025	2022-09-13 16:36:31 +03:00
Roman Khavronenko	a887c1bc07	vmalert: add `debug` mode for alerting rules (#3055 ) * vmalert: add `debug` mode for alerting rules Debug information includes alerts state changes and requests sent to the datasource. Debug can be enabled only on rule's level. It might be useful for debugging unexpected behaviour of alerting rule. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3025 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: review fixes Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update app/vmalert/alerting.go Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> * vmalert: go fmt Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-09-13 16:36:30 +03:00
Aliaksandr Valialkin	e2d8916935	docs: mention that it is safe sharing the collected profiles from security PoV The collected profiles do not contain sensitive information	2022-08-24 14:08:30 +03:00
Roman Khavronenko	555a202d80	docs: follow-up after `88425bb285` (#3007 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-24 01:23:37 +03:00
laixintao	76a291a95b	vmalert: add $activeAt into template variables. (#3000 ) vmalert: add `$activeAt` template variable for annotations https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2999	2022-08-24 01:23:37 +03:00
Aliaksandr Valialkin	9ddd2699fd	all: remove the remaining bits of io/ioutil The io/ioutil package is deprecated since Go1.16 - see https://tip.golang.org/doc/go1.16#ioutil VictoriaMetrics requires at least Go1.18, so it is time to remove the io/ioutil from source code This is a follow-up for `02ca2342ab`	2022-08-22 00:22:41 +03:00
Aliaksandr Valialkin	1905618d10	all: subsitute ioutil.ReadAll with io.ReadAll ioutil.ReadAll is deprecated since Go1.16 - see https://tip.golang.org/doc/go1.16#ioutil VictoriaMetrics requires at least Go1.18, so it is OK to switch from ioutil.ReadAll to io.ReadAll. This is a follow-up for `02ca2342ab`	2022-08-22 00:16:04 +03:00
Aliaksandr Valialkin	06f6de6d47	all: use os.{Read\|Write}File instead of ioutil.{Read\|Write}File The ioutil.{Read\|Write}File is deprecated since Go1.16 - see https://tip.golang.org/doc/go1.16#ioutil VictoriaMetrics needs at least Go1.18, so it is safe to remove ioutil usage from source code. This is a follow-up for `02ca2342ab`	2022-08-21 23:55:20 +03:00
Aliaksandr Valialkin	6d2354e7a4	app/vmalert/README.md: sync with docs/vmalert.md after `a229182dbe`	2022-08-20 08:54:55 +03:00
Roman Khavronenko	2256d51418	docs: fix docs formatting related to vmalert (#2994 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-19 11:05:08 +03:00
Aliaksandr Valialkin	bbfa52bd75	docs: follow-up after `68e56b6fc5`	2022-08-17 21:27:24 +03:00
Roman Khavronenko	cfcb5ab15b	vmalert: set alert's source link to UI instead of JSON source (#2986 ) We switch default alert's source link to redirect user to vmalert's UI instead of previous JSON object. While it breaks compatibility, it also supposed to improve user's experience. The old behavior can be achieved by updating `-external.alert.source` command-line flag. Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-17 21:27:24 +03:00
Roman Khavronenko	436b4d90af	docs: update vmalert docs (#2987 ) * mention recently added `$alertID` and `$groupID` variables in the changelog * properly escape template examples in the vmalert's README Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-16 12:10:08 +03:00
Roman Khavronenko	0d5c403b6e	vmalert: support `$alertID` and `$groupID` in template variables (#2983 ) Support of these two variables allows building custom URLs with alert's ID and group ID params. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/517#issuecomment-1207141432 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-16 11:24:01 +03:00
Aliaksandr Valialkin	198d8eeeaa	app/vmalert/templates: add `toTime()` template function in the same way as Prometheus 2.38 does See https://github.com/prometheus/prometheus/pull/10993	2022-08-15 00:49:52 +03:00
Roman Khavronenko	36660bcfe2	vmalert: follow-up after `28441711e6` (#2972 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-11 23:52:18 +03:00
Matthew Blewitt	b1b8c99b17	vmalert: mark some url flags as sensitive (#2965 ) Other components, such as `vmagent`, mark these flags as sensitive and hide them from the `/metrics` endpoint by default. This commit adds similar handling to the `vmalert` component, hiding them by default, to prevent logging of secrets inappropriately. Showing of these values is controlled by an additional flag. Follow up to https://github.com/VictoriaMetrics/VictoriaMetrics/pull/2947	2022-08-11 23:51:43 +03:00
Roman Khavronenko	f90f654cf2	vmalert: sort groups at `/alerts` page (#2968 ) Sorting will produce deterministic output of grops on the page. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-09 12:08:54 +03:00
Aliaksandr Valialkin	69d62d5736	docs: sync -help output with the latest changes	2022-08-08 14:05:36 +03:00
Aliaksandr Valialkin	221dd3a224	all: bump the minimum supported version of Go from 1.17 to 1.18 This is needed because some dependencies uses generics, which have been appeared in Go1.18 This is a follow-up for `caf3dd4fa2`	2022-08-08 13:45:39 +03:00
Roman Khavronenko	6af40f6275	vmalert: remove notions of vmalert being compatible with VM only (#2954 ) vmalert can be successfully used with datasources compatible with Prometheus HTTP API. So we remove comments or notes in Readme which are saying opposite. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-08-08 13:44:15 +03:00
laixintao	9f5fc040a7	bugfix: fix vmalert navbar url. (#2949 ) the doc url should not be joined by `prefix` because it's an abs url.	2022-08-08 00:29:48 +03:00
Aliaksandr Valialkin	98caffcfc4	docs: fix the recommended url for -vmalert.proxyURL accroding to `8667307d73`	2022-08-04 18:03:24 +03:00
Aliaksandr Valialkin	a1e49606ed	app/{vmselect,vmalert}: properly generate http redirects if `-http.pathPrefix` command-line flag is set Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2918	2022-08-02 13:01:13 +03:00
Aliaksandr Valialkin	9f1e558c58	all: rename -pushmetrics.extraLabels to -pushmetrics.extraLabel for the sake of consistency	2022-07-26 19:25:26 +03:00
Roman Khavronenko	da10962d4c	vmselect: cover special cases for vmalert's routing in single-node version (#2845 ) * vmselect: cover special cases for vmalert's routing in single-node version * remove trailing `/` from requests * redirect to vmalert's home page when `/vmalert` is requested. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: fix review comments Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update app/vmselect/main.go Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-07-25 09:43:44 +03:00
Aliaksandr Valialkin	5d62f5a324	app/vmalert/config: add missing docs for ValidateTplFn	2022-07-25 09:22:28 +03:00
Roman Khavronenko	970f36de17	vmalert: remove `notifier` dependency from `config` (#2906 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-25 09:22:28 +03:00
Aliaksandr Valialkin	c0c9f30870	lib/pushmetrics: properly handle errors when initializing pushmetrics	2022-07-22 13:38:25 +03:00
Roman Khavronenko	01755fac38	vmalert: remove dependency on datasource pkg from config (#2905 ) * vmalert: remove dependency on datasource pkg from config Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-22 13:38:25 +03:00
Aliaksandr Valialkin	b49fc2f9f3	app/vmalert/utils: add missing docs to WithHeaders func added at `70a822f3a0`	2022-07-21 20:49:38 +03:00
Roman Khavronenko	d0abdc2b5b	vmalert: allow configuring custom headers per group (#2901 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2860 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-21 20:48:05 +03:00
Roman Khavronenko	356d8b99e0	vmalert: allow configuring custom headers for URLs (#2897 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2860 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-21 20:45:12 +03:00
Aliaksandr Valialkin	fe68bb3ba7	all: follow-up after `46f803fa7a` Add -pushmetrics.* command-line flags to all the VictoriaMetrics apps	2022-07-21 20:18:25 +03:00
cui fliter	e0d30f6ec5	fix some typos (#2882 ) Signed-off-by: cui fliter <imcusg@gmail.com>	2022-07-18 12:10:40 +03:00
Roman Khavronenko	80de6face8	vmalert: drop support of deprecated `extra_filter_labels` param (#2870 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-14 16:10:46 +03:00
Aliaksandr Valialkin	70b9925bf7	app: fix `make publish-*` after `ed93330e66` Add missing `-linux` substring to built binary names for copying into Docker images	2022-07-14 11:01:34 +03:00
Aliaksandr Valialkin	da6c85a2f6	all: follow-up for `d99ba3481b`	2022-07-13 17:17:08 +03:00
Dmytro Kozlov	4e4def9df8	Rename release packages (#2810 ) * makefile: add os to each release file * makefile: update vmutils arm64 * makefile: update victoria-metrics release process * makefile: update publish with os * makefile: update publish with os * makefile: change tar library * update release logic * copy all releases * sort command by GOOS * rollback commands * rollback OSARCH * fix commands * cleanup * fix windows build * sort build by GOOS, update README.md	2022-07-13 17:11:01 +03:00
Aliaksandr Valialkin	eab8ebbe11	all: `make fmt` via the upcoming Go1.19	2022-07-11 19:23:25 +03:00
Aliaksandr Valialkin	64419aa97c	app/vmalert/utils/links.go: document Prefix function, which has been added in `b29fafa86b`	2022-07-08 13:25:23 +03:00
Roman Khavronenko	b29fafa86b	vmalert: deprecate alert's status link (#2840 ) * vmalert: deprecate alert's status link Deprecate alert's status link `/api/v1/<groupID>/<alertID>/status` in favour of `api/v1/alerts?group_id=<group_id>&alert_id=<alert_id>"`. The change was needed for simplifying logic in vmselect for proxying vmalert's requests. The old alert's status link will be still supported for a few versions but will be removed in the future. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2825 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: fix review comments Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-08 13:05:11 +03:00
Roman Khavronenko	218dfe7956	docs: mention deduplication issue for HA vmalert topology (#2838 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-07 01:13:56 +03:00
Roman Khavronenko	56f4058fe3	vmalert: make UI and assets links relative (#2831 ) * make all links in vmalert relative, so links continue to work even if vmalert sits behind the proxy; * update vmalert's routing to always have component-unique path prefix, e.g. /vmalert; See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2825 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-07-06 12:47:53 +03:00
Roman Khavronenko	50c0eb4c4e	vmalert: make `__name__` available for templating in alerts (#2783 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-27 13:53:55 +03:00
Aliaksandr Valialkin	4b41a05ca7	app/vmalert: load static js and css from proper paths if `-http.pathPrefix` command-line flag is set This is a follow-up for `b104f67beb`	2022-06-27 13:12:57 +03:00
Roman Khavronenko	572db17857	vmalert: use absolute path for assets (#2784 ) Using relative path breaks assets loading on alert view page. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-27 00:47:36 +03:00
Aliaksandr Valialkin	3ae6300497	lib/promauth: add ability to send additional http headers in requests to scrape targets This solves https://stackoverflow.com/questions/66032498/prometheus-scrape-metric-with-custom-header	2022-06-22 20:40:50 +03:00
Roman Khavronenko	54f0f2d384	docs: follow-up for `197d3cdd74` (#2766 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-22 13:18:03 +03:00
云原生驿站	67e5833ced	docs: supplement vmalert downsampling docs (#2765 ) Co-authored-by: 吴典秋 <muti_kube@163.com>	2022-06-22 13:18:03 +03:00
Aliaksandr Valialkin	597bce4f55	docs: update docs after `e4d6b750f6` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2753	2022-06-21 14:01:25 +03:00
Roman Khavronenko	3ada676879	docs: reference links from key concepts (#2745 ) Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-06-19 23:14:30 +03:00
Aliaksandr Valialkin	fe9f59fcd6	all: replace `bash` with `console` blocks in all the *.md files This is a follow-up for `954a7a6fc6`	2022-06-19 23:02:02 +03:00
Roman Khavronenko	3e45e1ff63	Vmalert notifiers (#2744 ) * vmalert: remove head of line blocking for sending alerts This change makes sending alerts to notifiers concurrent instead of sequential. This eliminates head of line blocking, where first faulty notifier address prevents the rest of notifiers from receiving notifications. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: make default timeout for sending alerts 10s Previous value of 1m was too high and was inconsistent with default timeout defined for notifiers via configuration file. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: linter checks fix Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-19 22:49:10 +03:00
Roman Khavronenko	ba7ece02c4	docs: add multiple-remote-writes topology to vmalert (#2738 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-16 20:21:12 +03:00
Wataru Manji	0cd750fa5e	Add remote-write headers (#2701 ) Co-authored-by: Wataru Manji <wataru.manji@linecorp.com>	2022-06-13 10:07:19 +03:00
Roman Khavronenko	3c09d25039	vmalert: followup for `76f05f8670` (#2706 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-06-09 13:15:35 +03:00
Howie	4afd7aa695	feat: rule limit (#2676 ) vmalert: support `limit` param in groups definition `limit` param limits number of time series samples produced by a single rule during execution. On reaching the limit rule will return an err. Signed-off-by: lihaowei <haoweili35@gmail.com>	2022-06-09 13:15:33 +03:00
Aliaksandr Valialkin	c07f99cd4f	docs/CHANGELOG.md: document https://github.com/VictoriaMetrics/VictoriaMetrics/pull/2685	2022-06-07 15:39:53 +03:00
Wataru Manji	64f7095c3a	add Content-Encoding Header (#2685 ) Co-authored-by: Wataru Manji <wataru.manji@linecorp.com>	2022-06-07 15:34:25 +03:00
Aliaksandr Valialkin	68b6ddfb14	all: follow-up after `8edb390e21` - Remove unused js bloatware from /targets page. This strips down binary size by more than 100Kb - Add /service-discovery page for API compatibility with Prometheus - Properly load bootstrap.min.css from /prometheus/targets - Serve static contents for /targets page from app/vminsert instead of app/vmselect, because /targets page is served from there	2022-06-07 01:05:53 +03:00
Aliaksandr Valialkin	afced37c0b	all: add initial support for query tracing See https://docs.victoriametrics.com/Single-server-VictoriaMetrics.html#query-tracing Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1403	2022-06-01 02:31:44 +03:00
Aliaksandr Valialkin	e83b96366f	docs/CHANGELOG.md: follow-up after `11f91532c5` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2594	2022-05-31 12:42:48 +03:00
Dmytro Kozlov	cd1fa2e4cd	issue-2594: use embedded for static files (#2650 ) embed static js and css files from CDN into vmalert, vmagent and vmsingle binaries. Co-authored-by: f41gh7 <nik@victoriametrics.com> https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2594	2022-05-31 12:42:48 +03:00
Howie	09fed19ba5	chore: remove duplicated code (#2657 ) Signed-off-by: lihaowei <haoweili35@gmail.com>	2022-05-30 12:27:47 +03:00
Howie	9c5f998438	fix: docs (#2658 ) Signed-off-by: lihaowei <haoweili35@gmail.com>	2022-05-30 12:27:19 +03:00
spectvtor	c5df5c9a95	fix alert relabeling (#2633 )	2022-05-25 15:05:10 +03:00
Roman Khavronenko	d597cdd06f	vmalert: mention how to build a custom image (#2626 ) Thanks to @f41gh7 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-23 10:55:42 +03:00
Roman Khavronenko	88c4c6f465	vmalert: add new metric `vmalert_iteration_interval_seconds` (#2623 ) The new metric shows the configured evaluation interval per group. Metric updates its value when group's interval is changed during hot reload. The new metric can be used to estimate how close group is to start missing evaluation rounds. The following query will show the % of used time by the group to evaluate all rules before the next round: ``` (max(vmalert_iteration_duration_seconds{quantile="0.99"}) / vmalert_iteration_interval_seconds) * 100 ``` https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2618 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-21 01:13:01 +03:00
Aliaksandr Valialkin	aba00b8cb2	docs: update the description for command-line flags according to recent changes	2022-05-20 15:11:37 +03:00
Roman Khavronenko	d814c83b21	vmalert: remove a line added for debug (#2611 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 14:08:57 +03:00
Roman Khavronenko	2aeb00f98f	vmalert: support `scalar` type in response (#2610 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2607 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 14:08:19 +03:00
Roman Khavronenko	a07ddf9b65	vmalert: support strings in `humanize.*` templates (#2606 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2569 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 14:05:53 +03:00
Yurii Kravets	40c614dc4e	Update vmalert.md (#2580 ) docs: update vmalert/README.md Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2022-05-20 14:02:34 +03:00
Roman Khavronenko	87e4e76537	vmalert: support `/rules` path for Grafana's ngalert requests (#2593 ) Unexpectedly, Grafana makes an extra request to `/rules` handler in addition to `/api/v1/rules` calls in alerts UI. This happens only for Grafana versions older than 8.5.*. Apparently, this is related to support of other monitoring systems. Prometheus responds with `text/html` content for UI page `/rules` to such requests. Actually, returning just a blank page with SC=200 works as well. Returning actual response of `/api/v1/rules` results in error in Grafana since it expects a `yaml` (?) in response. So we add a placeholder to `vmalert`. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2583 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 13:53:55 +03:00
Roman Khavronenko	ee1f8bae2e	docs: fix liquid syntax errors (#2592 ) For liquid text processor double braces `{{` `}}` are special chars for templating. Since we use them in some of our docs with different purpose, we must escape them to avoid syntax errors from liquid. For escaping curly braces we use bult-in plugin which helps to enclose sections of text via `{% raw %}` and `{% endraw %}`. This approach prevents liquid syntax errors and makes render correct. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 13:42:47 +03:00
Roman Khavronenko	6e5ba1921d	vmalert: fix readme formatting (#2587 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 13:40:41 +03:00
Roman Khavronenko	0790ba5e26	vmalert: follow-up after `0ac1cdfff5` (#2586 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-20 12:25:13 +03:00
Andrii Chubatiuk	7789c47e41	added reusable templates support (#2532 ) Signed-off-by: Andrii Chubatiuk <andrew.chubatiuk@gmail.com>	2022-05-20 12:25:11 +03:00
Aliaksandr Valialkin	83ff4c411d	app/vmalert: apply `-remoteRead.disablePathAppend` to `-datasource.url` in the same way as for the `-remoteRead.url` This is a follow-up for `0e2486df56` The related pull requests: - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1536 - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1712	2022-05-13 16:59:16 +03:00
Roman Khavronenko	2ea625d5bf	vmalert: properly cleanup stale series tracker on rules update (#2577 ) Rules executor within group tracks series sent to remote write in order to mark them as stale if they had disappeared in next evaluation round. The executor uses rules ID as a key to identifies series which belong to rule. On config reload, executor remains active but the set of rules could change. Hence, we need to properly cleanup the tracker for rules which has been disappeared on config reload. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-13 16:59:16 +03:00
Dmytro Kozlov	1a8a24bcb3	vmctl: fix build for solaris os (#2555 ) * vmctl: fix build for solaris os * vmctl: updated dependency (using Syscall instead of Syscall6) * vmctl: updated dependency * vmctl: updated dependency	2022-05-11 14:30:45 +03:00
Roman Khavronenko	c9b5e6e8ca	Code check (#2558 ) * vmstorage: make gofmt happy Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: make linter happy Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-09 15:29:25 +03:00
Roman Khavronenko	ee93d003d3	Vmalert fix bugs in alerting evaluation (#2557 ) * vmalert: calculate time for firing alert based on the given timestamp Previously, current time was used for checking the `firing` threshold. This is not correct, since alerts are evaluated at specific timestamps. Hence, this specific timestamp supposed to be used in the calculation. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: properly calculate evaluation timestamp for rules Timestamp for rules evaluation should be calculated after the artifical delay for groups start. Otherwise, evaluation timestamp can fall back too far in time. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-09 15:28:30 +03:00
Aliaksandr Valialkin	358fa99af2	app/vmalert: run `make quicktemplate-gen` from the root directory after the commit `f6dcfbcdd6`	2022-05-04 20:28:37 +03:00
Dmytro Kozlov	0aeefeb5f1	vmalert/tpl: fixed truncating alerts expression in table (#2494 ) vmalert: improve `/groups` UI visual The change also fixes truncated rules expressions in UI https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2484	2022-05-04 20:28:37 +03:00
Aliaksandr Valialkin	725cb64e81	app/vmalert: run `make quicktemplate-gen` from the repository root This is a follow-up after `b2294d1cf1`	2022-05-02 15:37:54 +03:00
Dmytro Kozlov	4764f6e522	vmalert: added disableProgressBar flag which disable progressbar (#2506 ) vmalert: added disableProgressBar flag which disable progressbar https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1761	2022-05-02 15:37:54 +03:00
Roman Khavronenko	d0d0be9031	vmalert: do not execute templates during validation (#2528 ) Function `ValidateTemplates`, used on the vmalert startup, is supposed to check whether used templates and functions in loaded rules are correct. The function was parsing and executing loaded templates. However, rules may contain functions which can't be executed without values (label values or query results), like `slice`. Because of this, validation for completely valid expression `{{ slice $labels.job 9 }}` will fail since `$labels.job` is empty during validation. This PR updates `ValidateTemplates` function to only parse templates without executing them. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2514 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-05-02 15:37:54 +03:00
Dmytro Kozlov	25e54d2b50	vmctl/vm: added datapoints collection bar (#2486 ) add progress bars to the VM importer The new progress bars supposed to display the processing speed per each VM importer worker. This info should help to identify if there is a bottleneck on the VM side during the import process, without waiting for its finish. The new progress bars can be disabled by passing `vm-disable-progress-bar` flag. Plotting multiple progress bars requires using experimental progress bar pool from github.com/cheggaaa/pb/v3. Switch to progress bar pool required changes in all import modes. The openTSDB mode wasn't changed due to its implementation, which implies individual progress bars per each series. Because of this, using the pool wasn't possible. Signed-off-by: dmitryk-dk <kozlovdmitriyy@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2022-05-02 10:58:06 +03:00
Aliaksandr Valialkin	7debf57ca6	lib/httpserver: clarify that `-tls` flag enables TLS for http requests to `-httpListenAddr`	2022-04-16 16:59:41 +03:00
Aliaksandr Valialkin	6bd032a6d3	docs: sync docs with the latest changes	2022-04-16 16:00:27 +03:00
Aliaksandr Valialkin	c50e48a74c	lib/promscrape: follow-up after `baa1c24b36`	2022-04-16 14:26:38 +03:00
Roman Khavronenko	56cd2b918a	vmalert: add DNS service discovery (#2465 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2460 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-13 14:14:25 +03:00
Aliaksandr Valialkin	3c27bde77e	docs/CHANGELOG.md: document `ed364a42e3`	2022-04-11 12:12:07 +03:00
hagen1778	fa9601a0f1	vmalert: support relabeling for alert labels sent via notifier Before, relabeling for notifier configured via file was supported only for target labels discovered via SD. With this change, new config field `alert_relabel_configs` is introduced for applying relabeling to labels of sent alerts. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-11 12:12:04 +03:00
Roman Khavronenko	4de1b2b74a	vmalert: fix labels and annotations processing for alerts (#2403 ) To improve compatibility with Prometheus alerting the order of templates processing has changed. Before, vmalert did all labels processing beforehand. It meant all extra labels (such as `alertname`, `alertgroup` or rule labels) were available in templating. All collisions were resolved in favour of extra labels. In Prometheus, only labels from the received metric are available in templating, so no collisions are possible. This change makes vmalert's behaviour similar to Prometheus. For example, consider alerting rule which is triggered by time series with `alertname` label. In vmalert, this label would be overriden by alerting rule's name everywhere: for alert labels, for annotations, etc. In Prometheus, it would be overriden for alert's labels only, but in annotations the original label value would be available. See more details here https://github.com/prometheus/compliance/issues/80 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-07 15:24:06 +03:00
Roman Khavronenko	ce1629b70a	vmalert: add flag for disabling long-lived connections (keepalive) (#2395 ) The new flag `datasource.disableKeepAlive` allows disabling keepalive connections. This may be useful if there are multiple datasource replicas (e.g. vmselects) behind the HTTP balancer to avoid uneven load spread because of long-lived connections. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-04 13:08:06 +03:00
Roman Khavronenko	7aa9d0f5f6	vmalert: protect executor's field from concurrent access (#2387 ) Executor recently gain field for storing previously sent series. Since the same executor object can be used in multiple goroutines, the access to this field should be serialized. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-01 12:03:41 +03:00
Roman Khavronenko	ab10178c85	Vmalert compliance 2 (#2340 ) * vmalert: split alert's `Start` field into `ActiveAt` and `Start` The `ActiveAt` field identifies when alert becomes active for rules with `for > 0`. Previously, this value was stored in field `Start`. The field `Start` now identifies the moment alert became `FIRING`. The split is needed in order to distinguish these two moments in the API responses for alerts. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: support specific moment of time for rules evaluation The Querier interface was extended to accept a new argument used as a timestamp at which evaluation should be made. It is needed to align rules execution time within the group. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: mark disappeared series as stale Series generated by alerting rules, which were sent to remote write now will be marked as stale if they will disappear on the next evaluation. This would make ALERTS and ALERTS_FOR_TIME series more precise. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: evaluate rules at fixed timestamp Before, time at which rules were evaluated was calculated right before rule execution. The change makes sure that timestamp is calculated only once per evalution round and all rules are using the same timestamp. It also updates the logic of resending of already resolved alert notification. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: allow overridin `alertname` label value if it is present in response Previously, `alertname` was always equal to the Alerting Rule name. Now, its value can be overriden if series in response containt the different value for this label. The change is needed for improving compatibility with Prometheus. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: align rules evaluation in time Now, evaluation timestamp for rules evaluates as if there was no delay in rules evaluation. It means, that rules will be evaluated at fixed timestamps+group_interval. This way provides more consistent evaluation results and improves compatibility with Prometheus, Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add metric for missed iterations New metric `vmalert_iteration_missed_total` will show whether rules evaluation round was missed. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: reduce delay before the initial rule evaluation in group Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: rollback alertname override According to the spec: ``` The alert name from the alerting rule (HighRequestLatency from the example above) MUST be added to the labels of the alert with the label name as alertname. It MUST override any existing alertname label. ``` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md#step-3 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: throw err immediately on dedup detection ``` The execution of an alerting rule MUST error out immediately and MUST NOT send any alerts or add samples to samples receiver if there is more than one alert with the same labels ``` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md#step-4 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: cleanup Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: use strings builder to reduce allocs Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-01 12:03:41 +03:00
Roman Khavronenko	d907e0d9f0	docs: fix typo in vmalert's API (#2380 ) The API handler was changed in 1.75 but docs still contain the old address. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2366 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-01 12:03:41 +03:00
Aliaksandr Valialkin	ba7cfd7b25	app: sync Markdown changes from `a8de1ab000`	2022-03-22 14:12:03 +02:00
Aliaksandr Valialkin	3b7aefade2	docs/vmalert.md: sync after `11ae1ae924`	2022-03-17 20:18:07 +02:00
Dmytro Kozlov	2f350c200a	Added resendDelay for alerts (#2296 ) * vmalert: add support of `resendDelay` flag for alerts Co-authored-by: dmitryk-dk <dmitry.kozlov@brightlocal.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2022-03-17 20:09:18 +02:00

... 2 3 4 5 6 ...

564 Commits