VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-25 11:50:13 +01:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	b49b8020d6	docs: sync docs with the latest changes	2022-04-16 15:59:53 +03:00
Aliaksandr Valialkin	ebaa1c7ad5	lib/promscrape: follow-up after `baa1c24b36`	2022-04-16 14:25:54 +03:00
Roman Khavronenko	45fcaa33e8	vmalert: add DNS service discovery (#2465 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2460 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-13 11:50:26 +03:00
Aliaksandr Valialkin	b89e846ce3	docs/CHANGELOG.md: document `ed364a42e3`	2022-04-11 12:11:32 +03:00
hagen1778	ed364a42e3	vmalert: support relabeling for alert labels sent via notifier Before, relabeling for notifier configured via file was supported only for target labels discovered via SD. With this change, new config field `alert_relabel_configs` is introduced for applying relabeling to labels of sent alerts. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-11 11:09:14 +03:00
Roman Khavronenko	2b59fff526	vmalert: fix labels and annotations processing for alerts (#2403 ) To improve compatibility with Prometheus alerting the order of templates processing has changed. Before, vmalert did all labels processing beforehand. It meant all extra labels (such as `alertname`, `alertgroup` or rule labels) were available in templating. All collisions were resolved in favour of extra labels. In Prometheus, only labels from the received metric are available in templating, so no collisions are possible. This change makes vmalert's behaviour similar to Prometheus. For example, consider alerting rule which is triggered by time series with `alertname` label. In vmalert, this label would be overriden by alerting rule's name everywhere: for alert labels, for annotations, etc. In Prometheus, it would be overriden for alert's labels only, but in annotations the original label value would be available. See more details here https://github.com/prometheus/compliance/issues/80 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-06 20:24:45 +02:00
Roman Khavronenko	70bb0d2708	vmalert: add flag for disabling long-lived connections (keepalive) (#2395 ) The new flag `datasource.disableKeepAlive` allows disabling keepalive connections. This may be useful if there are multiple datasource replicas (e.g. vmselects) behind the HTTP balancer to avoid uneven load spread because of long-lived connections. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-04-04 12:59:04 +03:00
Roman Khavronenko	1354e6d712	vmalert: protect executor's field from concurrent access (#2387 ) Executor recently gain field for storing previously sent series. Since the same executor object can be used in multiple goroutines, the access to this field should be serialized. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-30 12:37:27 +02:00
Roman Khavronenko	0989649ad0	Vmalert compliance 2 (#2340 ) * vmalert: split alert's `Start` field into `ActiveAt` and `Start` The `ActiveAt` field identifies when alert becomes active for rules with `for > 0`. Previously, this value was stored in field `Start`. The field `Start` now identifies the moment alert became `FIRING`. The split is needed in order to distinguish these two moments in the API responses for alerts. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: support specific moment of time for rules evaluation The Querier interface was extended to accept a new argument used as a timestamp at which evaluation should be made. It is needed to align rules execution time within the group. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: mark disappeared series as stale Series generated by alerting rules, which were sent to remote write now will be marked as stale if they will disappear on the next evaluation. This would make ALERTS and ALERTS_FOR_TIME series more precise. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: evaluate rules at fixed timestamp Before, time at which rules were evaluated was calculated right before rule execution. The change makes sure that timestamp is calculated only once per evalution round and all rules are using the same timestamp. It also updates the logic of resending of already resolved alert notification. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: allow overridin `alertname` label value if it is present in response Previously, `alertname` was always equal to the Alerting Rule name. Now, its value can be overriden if series in response containt the different value for this label. The change is needed for improving compatibility with Prometheus. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: align rules evaluation in time Now, evaluation timestamp for rules evaluates as if there was no delay in rules evaluation. It means, that rules will be evaluated at fixed timestamps+group_interval. This way provides more consistent evaluation results and improves compatibility with Prometheus, Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: add metric for missed iterations New metric `vmalert_iteration_missed_total` will show whether rules evaluation round was missed. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: reduce delay before the initial rule evaluation in group Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: rollback alertname override According to the spec: ``` The alert name from the alerting rule (HighRequestLatency from the example above) MUST be added to the labels of the alert with the label name as alertname. It MUST override any existing alertname label. ``` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md#step-3 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: throw err immediately on dedup detection ``` The execution of an alerting rule MUST error out immediately and MUST NOT send any alerts or add samples to samples receiver if there is more than one alert with the same labels ``` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md#step-4 Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: cleanup Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: use strings builder to reduce allocs Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-29 15:09:07 +02:00
Roman Khavronenko	56de8f0356	docs: fix typo in vmalert's API (#2380 ) The API handler was changed in 1.75 but docs still contain the old address. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2366 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-28 12:07:02 +02:00
Aliaksandr Valialkin	c8f356a6a8	app: sync Markdown changes from `a8de1ab000`	2022-03-22 14:11:18 +02:00
Aliaksandr Valialkin	09c6c7350b	docs/vmalert.md: sync after `11ae1ae924`	2022-03-17 20:17:40 +02:00
Dmytro Kozlov	11ae1ae924	Added resendDelay for alerts (#2296 ) * vmalert: add support of `resendDelay` flag for alerts Co-authored-by: dmitryk-dk <dmitry.kozlov@brightlocal.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2022-03-16 15:26:33 +00:00
Roman Khavronenko	fb6eab03a2	Vmalert compliance improvements (#2320 ) * vmalert: add support for `sortByLabel` template function * vmalert: update API according to Prometheus conformance program The changes to the API, field names and URL path has been made according to the Prometheus specification for `alert_generator` https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md * vmalert: fix the timestamp of the evaluated rules The timestamp used for alert's `EndsAt` was calculated before sending the notification. While the correct way is to use the timestamp taken right before rules evaluation. * vmalert: add `-datasource.queryTimeAlignment` flag The flag is supposed to provide ability to disable `time` param alignment when executing rules. By default, this flag is enabled, so it remains backward compatible. The flag was introduced to achieve better compatibility with Prometheus behaviour according to https://github.com/prometheus/compliance/blob/main/alert_generator/specification.md Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-15 11:54:53 +00:00
Roman Khavronenko	0fa7effc4b	docs: fix broken links (#2303 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-03-13 15:56:01 +02:00
Dmytro Kozlov	565bd08c43	Issue-1824: added flags and different auth types support (#2287 ) * vmalert/notifier: added flags and different auth types support Co-authored-by: hagen1778 <roman@victoriametrics.com>	2022-03-10 13:09:12 +02:00
Bastien Dronneau	8b21f40217	docs(vmalert): typo in path (#2278 )	2022-03-05 22:35:10 +02:00
Denys Holius	1685e181ae	Added minimal supported version of AlertManager (#2237 ) * added minimal supported version of supported AlertManager * docs: `make docs-sync` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-02-22 20:07:04 +02:00
Roman Khavronenko	69d1893f4c	Consul SD - update services on the watcher's start (#2202 ) * lib/discovery/consul: update services on the watcher's start Previously, watcher's start was only initing goroutines for discovery but not waiting for the first iteration to end. It means first Consul discovery wasn't returning discovered targets until the next iteration. The change makes the watcher's start blocking until we get first discovery iteration done and all registries updated. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: remove workarounds for consul SD Now when consul SD lib properly updates services on the first start, we don't need workarounds in vmalert. Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/discovery/consul: update after review Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-02-21 15:32:45 +02:00
Aliaksandr Valialkin	ee5da826e9	docs: update `-help` output for VictoriaMetrics components	2022-02-15 21:08:22 +02:00
hagen1778	2efa46a11c	vmalert: support `$externalLabels` and `$externalURL` in templates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2193 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-15 17:33:52 +03:00
Nikolay	75e84144c7	adds release build for macos darwin amd64 and arm64 (#2185 ) * adds release build for macos darwin amd64 and arm64 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1896 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1851 * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-02-14 17:28:56 +02:00
Roman Khavronenko	e3adcbec6e	lib/promscrape: support prometheus-like duration in scrape configs (#2169 ) * lib/promscrape: support prometheus-like duration in scrape configs The change allows to specify duration values like `1d`, `1w` for fields `scrape_interval`, `scrape_timeout`, etc. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/817#issuecomment-1033384766 Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/blockcache: make linter happy Signed-off-by: hagen1778 <roman@victoriametrics.com> * lib/promscrape: support prometheus-like duration in scrape configs * add support for extra fields `scrape_align_interval` and `scrape_offset`; * support Prometheus duration parsing for `__scrape_interval__` and `__scrape_duration__` labels; Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip * wip * docs/CHANGELOG.md: document the feature Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-02-11 16:17:00 +02:00
hagen1778	4e722c459b	vmalert: fix bug with relative links in UI https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2167 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-09 12:18:39 +03:00
Aliaksandr Valialkin	9bb60ab00f	lib/promscrape: set `-promscrape.config.strictParse` to true by default This allows detecting long-living silent errors in -promscrape.config	2022-02-08 15:41:43 +02:00
Aliaksandr Valialkin	ba1b3b8ef2	docs: cross-link downsampling docs from deduplication and vmalert docs	2022-02-04 11:57:05 +02:00
Aliaksandr Valialkin	6530bcedec	docs: updates after `5da71eb685` * Mention about the ability to configure vmalert notifiers via files in docs/CHANGELOG.md * Mention about the ability to use Consul service discovery for vmalert notifiers in docs/CHANGELOG.md * Run `make docs-sync` in order to sync app/vmalert/README.md to docs/vmalert.md	2022-02-02 23:26:18 +02:00
hagen1778	55e3bbd4cc	vmalert: add support of `-notifier.basicAuth.passwordFile` flag for notifiers https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1567 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-02 18:58:54 +03:00
hagen1778	f57982eddc	vmalert: remove trailing slash for static notifier addresses This would make addresses `http://localhost:9093` and `http://localhost:9093/` both to result into `http://localhost:9093/api/v2/alerts`. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2022-02-02 18:58:17 +03:00
Roman Khavronenko	5da71eb685	vmalert: support configuration file for notifiers (#2127 ) vmalert: support configuration file for notifiers * vmalert notifiers now can be configured via file see https://docs.victoriametrics.com/vmalert.html#notifier-configuration-file * add support of Consul service discovery for notifiers config see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1947 * add UI section for currently loaded/discovered notifiers * deprecate `-rule.configCheckInterval` in favour of `-configCheckInterval` * add ability to suppress logs for duplicated targets for notifiers discovery * change behaviour of `vmalert_alerts_send_errors_total` - it now accounts for failed alerts, not HTTP calls.	2022-02-02 14:11:41 +02:00
Aliaksandr Valialkin	831b93a755	app/vmalert: add `parseDuration` function in the same way as Prometheus does See https://github.com/prometheus/prometheus/pull/8817	2022-01-13 23:30:41 +02:00
Aliaksandr Valialkin	80f966b80c	app/vmalert: add `stripPort` template function in the same way as Prometheus does See https://github.com/prometheus/prometheus/pull/10002	2022-01-13 22:53:42 +02:00
Andrey Afoninsky	77bfa8181d	chore: add vmalert_remotewrite_total metric (#2040 ) Co-authored-by: Andrey Afoninsky <andrey.afoninsky@booking.com>	2022-01-07 16:15:34 +02:00
Denys Holius	ae89b4e818	Old links replaced for newest (#2033 ) * replaced old links to the website * fixed deletion main README.md file * fix: added docs files after docs-sync	2022-01-05 16:30:13 +02:00
Roman Khavronenko	9bb7905d26	vmalert: check if remoteWrite is configured for replay mode (#1990 ) * vmalert: check if remoteWrite is configured for replay mode The purpose of `replay` mode is to backfill results of recording or alerting rules. So `remoteWrite.url` should be required. Otherwise, process can fail on attempt to send data. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update app/vmalert/main.go Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2021-12-21 20:25:47 +02:00
Roman Khavronenko	6814cc6809	vmalert: always convert `step` value to seconds for better compatibility (#1955 ) When using `vmalert` with older Prometheus versions, the passed `step=2m` may be parsed by Prometheus with an err: "cannot parse \"2m0s\" to a valid duration". In order to improve compatibility vmalert will always convert step duration to seconds. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1943 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-17 15:26:34 +02:00
Roman Khavronenko	2851709745	vmalert: update the order of service labels attaching (#1922 ) Service labels like `alertname` or `alertgroup` were attached after template expanding for `labels` section. Because of this, labels `alertname` or `alertgroup` weren't available for templating in `labels` section of alert's definition. This commit changes the order of labels attaching and adds a test for verifying these labels availability. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1921 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-10 12:10:26 +02:00
Aliaksandr Valialkin	896fa9bb7c	app/vmalert/config: sort extra_filter labels before passing them to query args in order to get consistent order of query args across runs This fixes TestGroupParams test - see https://github.com/VictoriaMetrics/VictoriaMetrics/runs/4432510244?check_suite_focus=true#step:5:288	2021-12-08 13:02:49 +02:00
Roman Khavronenko	0afd14a14a	vmalert: introduce additional HTTP URL params per-group configuration (#1892 ) * vmalert: introduce additional HTTP URL params per-group configuration The new group field `params` allows to configure custom HTTP URL params per each group. These params will be applied to every request before executing rule's expression. Hot config reload is also supported. Field `extra_filter_labels` was deprecated in favour of `params` field. vmalert will print deprecation log message if config file contains the deprecated field. `params` fields are supported by both Prometheus and Graphite datasource types. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: provide more examples for `params` field Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmalert: set higher priority for `params` setting If there would be a conflict between URL params set in `datasource.url` flag and params in group definition the latter will have higher priority. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-02 14:45:08 +02:00
Roman Khavronenko	e5b451a66a	ci: bump go version to 1.17 (#1895 ) The bump was required for `vmalert` package. `vmalert` docs now also contain an updated description. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-02 14:42:25 +02:00
Roman Khavronenko	d052c8c81e	vmalert: adjust topologies docs in README (#1893 ) Commit changes images width and order in topologies section for better readability. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-12-02 11:27:46 +03:00
Roman Khavronenko	866b6a842b	Vmalert docs upd (#1890 ) * vmalert: add topology examples in docs * vmalert: docs typo fix	2021-12-01 18:33:06 +03:00
Roman Khavronenko	40fcf667b0	vmalert: continue to print errors for bad config during hot reload (#1871 ) Previously, vmalert would print an err message and set vmalert_config_last_reload_successful=0 only once during a hot reload of a bad config. Such behaviour may result into non noticed event of a bad config reload attempt Now, it continues to print error messages and keep vmalert_config_last_reload_successful state until successful attempt will be made or config state will be rolled back to prev state. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-11-30 01:23:49 +02:00
Aliaksandr Valialkin	10bd8b1d86	docs/CHANGELOG.md: document `852a895b70`	2021-11-30 01:22:37 +02:00
Roman Khavronenko	852a895b70	vmalert: make notifier.Addr optional (#1870 ) For a long time notifier.Addr flag was required. The assumption was that vmalert will be always used for alerting. However, practice shows that some users need only recording rules. In this case, requirement of notifier.Addr is ambigious. The change verifies if loaded config contains recording or alerting rules and if there are corresponding flags set. This is true for initial config load and hot reload. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-11-30 01:18:48 +02:00
Aliaksandr Valialkin	fc534a1e7f	app/vmalert/README.md: sync with docs/vmalert.md This is a follow-up after `d8c70903ec`	2021-11-17 00:56:04 +02:00
Aliaksandr Valialkin	e5ac9d8e57	all: consistently return `application/json` content-type without `charset=utf-8` The `application/json` content-type has utf-8 encoding by default. See https://stackoverflow.com/questions/9254891/what-does-content-type-application-json-charset-utf-8-really-mean Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/897	2021-11-09 18:04:44 +02:00
Aliaksandr Valialkin	5046efb94b	docs/vmalert.md: improve wording in `Multitenancy` chapter	2021-11-09 14:19:52 +02:00
Aliaksandr Valialkin	a67518fc6d	docs: mention that graphs on the official dashboards contain useful hints	2021-11-08 19:54:10 +02:00
Aliaksandr Valialkin	34b5414ba8	app/{vmalert,vmbackup}/README.md: sync with docs after the commit `47d1612bf8`	2021-11-05 20:45:38 +02:00
Aliaksandr Valialkin	237885e0d2	docs/vmalert.md: document the addition of -defaultTenant.prometheus and -defaultTenant.graphite command-line options to enterprise version of vmalert	2021-11-05 20:04:09 +02:00
Aliaksandr Valialkin	24dce03aaa	app/vmalert/datasource: use plain string literals instead of constants This removes the unneeded level of indirection and improves code readability. The "prometheus" and "graphite" constants aren't going to change in the future, so there is no sense in hiding them behind constants.	2021-11-05 19:57:47 +02:00
Aliaksandr Valialkin	bf814320b0	app/vmalert: remove `rule.type` config, since it doesnt play well with the upcoming default tenants for -clusterMode It is better from the consistency point of view to set up rule types at group level where tenant config is set up.	2021-11-05 19:52:32 +02:00
Aliaksandr Valialkin	cbfc7b7c92	app/{vminsert,vmagent}: hide passwords and auth tokens by default at `/config` page Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1764	2021-11-05 14:41:16 +02:00
Aliaksandr Valialkin	6608705652	app/{vmalert,vmagent}: improve the distribution of scrape offsets among targets / rules Previously only the lower part of 64-bit hash was used for calculating the offset. This may give uneven distribution in some cases. So let's use all the available 64 bits from the hash for calculating the offset.	2021-10-27 19:59:16 +03:00
Roman Khavronenko	3dbdf1632e	vmalert: allow groups with empty rules for compatibility reasons (#1742 ) Prometheus allows to have groups with no rules, so we should support it in vmalert as well for compatibility reasons. It is also allowed to hot-reload empty groups by adding or removing rules. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-25 12:15:02 +03:00
Roman Khavronenko	43a7984cd8	vmalert: correctly calculate alert ID including extra labels (#1734 ) Previously, ID for alert entity was generated without alertname or groupname. This led to collision, when multiple alerting rules within the same group producing same labelsets. E.g. expr: `sum(metric1) by (job) > 0` and expr: `sum(metric2) by (job) > 0` could result into same labelset `job: "job"`. The issue affects only UI and Web API parts of vmalert, because alert ID is used only for displaying and finding active alerts. It does not affect state restore procedure, since this label was added right before pushing to remote storage. The change now adds all extra labels right after receiving response from the datasource. And removes adding extra labels before pushing to remote storage. Additionally, change introduces a new flag `Restored` which will be displayed in UI for alerts which have been restored from remote storage on restart.	2021-10-22 12:30:38 +03:00
Aliaksandr Valialkin	8ad95f0db7	lib/httpserver: expose command-line flags at `/flags` page This should simplify debugging. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1695	2021-10-20 00:45:09 +03:00
Roman Khavronenko	bdfac4ff53	vmalert: make group.ID() thread-safe (#1726 ) Commit fixes potential race condition when group update and generating of ID() happens simultaneously. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-19 16:44:13 +03:00
Roman Khavronenko	dcd881bb7a	vmalert: properly init SIGHUP listener before starting group manager (#1725 ) Regression was introduced during code refactoring. It potentially could lead to situation when SIGHUP signals were ignored while vmalert was still busy with initing group manager. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-19 16:35:27 +03:00
Yury Molodov	a3e09a57c2	vmui: features (#1711 ) * feat: initial uPlot graph * feat: add zoom/pan for graph * fix: add zoom by ctrl/mac * fix: remove unused code * feat: add toggle cache for fetch * feat: add fix y-axis limits * fix: stop point events while panning * fix: change getting cursor position when scaling * feat: add cursor tooltip to graph * fix: uninstall chart.js * fix: change link for create an issue * fix: set default cache value to true * app/vmalert: follow-up after `0e2486df56` * docs/CHANGELOG.md: document `5416e18007` * app/vmui: `make vmui-update` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2021-10-18 15:16:57 +03:00
Roman Khavronenko	146a5b504c	vmalert: remove extra `/` from path in WEB interface (#1717 ) The extra `/` may cause issues when additional path prefixes are configured. Also, removing it makes it consistent with the rest of declarations. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-18 15:12:47 +03:00
Alexander Rickardsson	0e2486df56	vmalert: add disablePathAppend to remote read (#1712 ) * vmalert: add disablePathAppend to remoteRead * docs: add docs for remoteRead.disablePathAppend	2021-10-18 10:24:52 +03:00
Alexander Rickardsson	c0e58ade45	vmalert: Redact passwords from error messages (#1713 )	2021-10-18 10:20:26 +03:00
Roman Khavronenko	7fcbd3fa4b	Adjust `http.Transport.MaxIdleConns` setting for vmauth/vmalert services (#1704 ) * vmalert: adjust `http.Transport.MaxIdleConns` value accordingly to `http.Transport.MaxIdleConnsPerHost` `http.Transport.MaxIdleConnsPerHost` setting is controlled by `datasource.maxIdleConnections` flag, while `http.Transport.MaxIdleConns` is inherited from DefaultTransport and is equal to `100`. The fix adjusts `http.Transport.MaxIdleConns` value if it is lower than `http.Transport.MaxIdleConnsPerHost`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * vmauth: adjust `http.Transport.MaxIdleConns` value accordingly to `http.Transport.MaxIdleConnsPerHost` `http.Transport.MaxIdleConnsPerHost` setting is controlled by `maxIdleConnsPerBackend` flag, while `http.Transport.MaxIdleConns` is inherited from DefaultTransport and is equal to `100`. The fix adjusts `http.Transport.MaxIdleConns` value if it is lower than `http.Transport.MaxIdleConnsPerHost`. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-10-13 17:29:28 +03:00
Roman Khavronenko	8df3c569c7	vmalert: add Source link to alerts UI (#1701 ) The source link is controlled by `external.url` and `external.alert.source` flags, in the same way as for alertmanager notifications. The source link is added to Alerts list view, and specific Alert view.	2021-10-13 15:25:11 +03:00
Roman Khavronenko	0e35fc9538	app/vmalert: remove unnecessary `omitempty` tag for `interval` param (#1649 ) `omitempty` tag resulted into skipping this param on marshaling, which was used as a checksum for groups configuration. Since on config reload checksums are compared before applying changes, any change to `interval` only didn't trigger config reload. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1641 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2021-09-23 17:55:59 +03:00
Roman Khavronenko	ac1abe2faf	app/vmalert: support `http.pathPrefix` flag in UI (#1636 ) The change makes UI to respect `http.pathPrefix` flag for API or navigation items links.	2021-09-21 14:41:01 +03:00
Roman Khavronenko	b75455c650	vmalert: add new metric `vmalert_remotewrite_flush_duration_seconds` (#1622 )	2021-09-16 14:00:16 +03:00
Roman Khavronenko	ecd3069b6c	vmalert: create basic auth config only if args aren't empty (#1618 ) * vmalert: create basic auth config only if args aren't empty follow-up after `68721f6` * vmalert: make lint happy	2021-09-15 01:53:31 +03:00
Aliaksandr Valialkin	3e1683756b	docs/vmalert.md: follow-up after `68721f6e7d` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1608	2021-09-14 14:47:47 +03:00
Roman Khavronenko	68721f6e7d	vmalert: support bearer token for datasource, remotewrite and remoteread (#1614 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1608	2021-09-14 14:32:06 +03:00
Aliaksandr Valialkin	c4f11a49f8	docs/CHANGELOG.md: document `5494bc02a6`	2021-09-13 17:11:23 +03:00
Roman Khavronenko	5494bc02a6	vmalert: add flag to limit the max value for auto-resovle duration for alerts (#1609 ) * vmalert: add flag to limit the max value for auto-resovle duration for alerts The new flag `rule.maxResolveDuration` suppose to limit max value for alert.End param, which is used by notifiers like Alertmanager for alerts auto resolve. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1586	2021-09-13 15:48:18 +03:00
Roman Khavronenko	75f35c3b11	vmalert: display extra filter labels in UI (#1613 )	2021-09-13 14:11:38 +03:00
Aliaksandr Valialkin	cfed015bb6	docs/vmalert.md: typo fix in `Multitenancy` chapter	2021-09-10 17:57:14 +03:00
Aliaksandr Valialkin	e84fa9eb38	app/vmalert: document GroupAlerts This makes golint happy	2021-09-07 22:50:08 +03:00
Aliaksandr Valialkin	e6c9869d86	app/vmalert: follow-up after `21f022e5f0`	2021-09-07 22:43:37 +03:00
Roman Khavronenko	21f022e5f0	vmalert: add initial UI implementation (#1602 ) New UI pages: / - welcome page with API handlers list; /groups - list of all rules per group; /alerts - list of all active alerts; /groupID/alertID/status - status of the active alert;	2021-09-07 22:39:22 +03:00
Roman Khavronenko	cfb6436be5	Vmalert extra params (#1587 ) * vmalert: allow extra GET params in datasource package ExtraParams will be added as GET params to every HTTP request made by datasource. The `roundDigits` param, for example, was substituted by corresponding extra param. * vmalert: add nocache=1 param for replay process The `nocache=1` param is VictoriaMetrics specific parameter which prevents it from caching and boundaries aligning for queries. We set it to avoid cache pollution in `replay` mode and also to avoid unnecessary time range boundaries alignment. * vmalert: mention nocache=1 in replay description * vmalert: fix bug with unused param	2021-08-31 14:57:47 +03:00
Nikolay	7c70dcbe3b	adds external_labels per group for vmalert (#1485 ) * adds external_label per group for vmalert https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1471	2021-08-31 14:52:34 +03:00
Roman Khavronenko	eff940aa76	Vmalert metrics update (#1580 ) * vmalert: remove `vmalert_execution_duration_seconds` metric The summary for `vmalert_execution_duration_seconds` metric gives no additional value comparing to `vmalert_iteration_duration_seconds` metric. * vmalert: update config reload success metric properly Previously, if there was unsuccessfull attempt to reload config and then rollback to previous version - the metric remained set to 0. * vmalert: add Grafana dashboard to overview application metrics * docker: include vmalert target into list for scraping * vmalert: extend notifier metrics with addr label The change adds an `addr` label to metrics for alerts_sent and alerts_send_errors to identify which exact address is having issues. The according change was made to vmalert dashboard. * vmalert: update documentation and docker environment for vmalert's dashboard Mention Grafana's dashboard in vmalert's README in a new section #Monitoring. Update docker-compose env to automatically add vmalert's dashboard. Update docker-compose README with additional info about services.	2021-08-31 12:28:02 +03:00
Aliaksandr Valialkin	2288e75f03	docs/vmalert.md: run `make docs-sync` after `9ee3d0378f`	2021-08-21 20:24:56 +03:00
Roman Khavronenko	9ee3d0378f	vmalert: add flag `disableAlertgroupLabel` for disabling extra label added to series (#1534 ) The new label added in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/611 may negatively impact deduplication in Alertmanager. The new flag supposed to give an option to disable adding this label. To enable flag just add `-disableAlertgroupLabel` to binary execution command. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1532	2021-08-21 20:08:55 +03:00
Alexander Rickardsson	f4cecaf296	vmalert: accept http.StatusOK for remotewrite (#1550 )	2021-08-20 11:58:32 +03:00
Aliaksandr Valialkin	90434ba25b	app/vmalert: mention -remoteWrite.disablePathAppend in the description for -remoteWrite.url	2021-08-16 15:22:47 +03:00
Aliaksandr Valialkin	f37b963619	app/vmalert: follow-up for `2400f85761`	2021-08-16 15:20:22 +03:00
Alexander Rickardsson	2400f85761	vmalert: enable configuring explicit path (#1536 ) * vmalert: allow to disable automatically added path to remote write address via disablePathAppend flag * docs: update docs to include remoteWrite.disablePathAppend	2021-08-16 14:20:57 +03:00
Aliaksandr Valialkin	d375d9b878	lib/envflag: add a link to docs for -envflag.enable	2021-08-11 10:29:33 +03:00
Roman Khavronenko	7416fdaa8b	vmalert: expose new metrics for tracking number of produced samples during last evaluation (#1518 ) * vmalert: expose new metrics for tracking number of produced samples during last evaluation Two new metrics were added to track the number of samples produced during the last evaluation: * vmalert_recording_rules_last_evaluation_samples * vmalert_alerting_rules_last_evaluation_samples The gauge type is used to remain consistent with Prometheus metric `prometheus_rule_group_last_evaluation_samples` which is on the group level. However, the counter type was considered as well. Two metrics instead of one are used to make it easier to separate recording and alerting rules. It is likely, number of samples produced by recording rules is more important so people will refer to it more frequently. The expected usage of the new metric is the following: ``` - alert: RecordingRuleReturnsEmptyResults expr: sum(vmalert_recording_rules_last_evaluation_samples) by(recording) < 1 annotations: summary: Recording rule {{$labels.recording}} returns empty results. Please verify expression correctness. ``` Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1494 * vmalert: rename `vmalert_alerts_error` to `vmalert_alerting_rules_error` to remain consistent with recording rules metrics	2021-08-05 09:59:46 +03:00
Qifei Wan	fa9c5c5940	app/vmalert: update config state metrics if config parsed failed (#1507 )	2021-08-03 12:55:29 +03:00
assassins	a483044557	Performance optimization (#1481 ) There are redundant steps	2021-07-28 19:26:20 +03:00
Aliaksandr Valialkin	bfba4c28a4	app/vmalert: accept Prometheus-like durations in `interval` config option inside `group` section	2021-07-12 12:35:17 +03:00
Aliaksandr Valialkin	c5f0b454f0	app/vmselect: follow-up after `aa11ef6d3b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1413	2021-07-07 17:43:35 +03:00
Aliaksandr Valialkin	766edbc421	lib/httpserver: print full requestURI in httpserver.Errorf This should simplify debugging.	2021-07-07 13:09:40 +03:00
Roman Khavronenko	6d5a8c28cd	Vmalert docs (#1372 ) * vmalert: mention what happens if `for` is set to 0 or omitted * vmalert: add more context to docs	2021-06-11 13:25:53 +03:00
Roman Khavronenko	7adfe878e1	vmalert: fix mistake with object reuse while parsing response (#1370 ) * vmalert: fix mistake with object reuse while parsing response During the refactoring, the wrong optimisations was applied in parse function which caused metric fields reset. The change removes optimisation. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1369 * vmalert: add test to cover multiple metrics in one response	2021-06-11 11:22:05 +03:00
Aliaksandr Valialkin	ab15bf8c90	docs: document rules replay feature for vmalert Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/836 This is a follow-up for `2a259ef5e7`	2021-06-09 12:27:34 +03:00
Roman Khavronenko	2a259ef5e7	vmalert: support rules backfilling (aka `replay`) (#1358 ) * vmalert: support rules backfilling (aka `replay`) vmalert can `replay` configured rules in the past and backfill results via remote write protocol. It supports MetricsQL/PromQL storage as data source, and can backfill data to remote write compatible storage. Supports recording and alerting rules `replay`. See more details in README. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/836 * vmalert: review fixes * vmalert: readme fixes	2021-06-09 12:20:38 +03:00
Roman Khavronenko	d210958fd0	vmalert: automatically reload configuration on file change (#1326 ) New flag `-rule.configCheckInterval` defines how often `vmalert` will re-read config file. If it detects any changes, the config will be reloaded. This behaviour is turned off by default. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/512	2021-05-25 14:27:22 +01:00

1 2 3 4 5 ...

296 Commits