VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-30 07:40:06 +01:00

Author	SHA1	Message	Date
hagen1778	aaf9e3d526	dashboards/vmalert: add new panel `Missed evaluations` The new panel supposed to indicate alerting groups that miss their evaluations. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:35:19 +01:00
hagen1778	9866974a53	deployment/alerts: add `TooManyMissedIterations` alerting rule The new rule for vmalert supposed to detect groups that miss their evaulations due to slow queries. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:35:18 +01:00
hagen1778	8874b525b7	dashboards: fix `Errors rate to Alertmanager` filter The panel `Errors rate to Alertmanager` had `group` label filter applied to the expression, while the metric `vmalert_alerts_send_errors_total` doesn't have that label. This resulted into always empty results. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 10:16:45 +01:00
Roman Khavronenko	ca7457d906	docs: explain motivation behind having `-downsampling.period` on vmselect (#5205 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 19:03:36 +01:00
Roman Khavronenko	23369321f1	docs: mention information loss when downsampling gauges (#5204 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 15:29:06 +01:00
Hui Wang	abcb21aa5e	vmalert: fix alert firing state in replay mode (#5192 ) fix possible missing firing states for alerting rules in replay mode Before if one firing stage is bigger than single query request range, like rule with a big `for`, alerting rule won't able to be detected as firing. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 13:54:18 +01:00
hagen1778	e964df8039	docs/troubleshooting: mention issue with un-ordered labels See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5219#issuecomment-1773441711 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 13:53:14 +01:00
hagen1778	a64b37cf24	docs: rm mention of default values for security HTTP headers The headers, their corresponding flags are mentioned at https://docs.victoriametrics.com/#security Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 11:46:17 +01:00
Dima Lazerka	ad839aa492	lib/httpserver: add flags to specify HSTS / Frame-Options / CSP headers for httpserver (#5111 ) support `Strict-Transport-Security`, `Content-Security-Policy` and `X-Frame-Options` HTTP headers in all VictoriaMetrics components. The values for headers can be specified by users via the following flags: `-http.header.hsts`, `-http.header.csp` and `-http.header.frameOptions`. Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 11:33:38 +01:00
Roman Khavronenko	29cebd82fb	lib/storage: log warning about RO mode only on state change (#5191 ) Before, vmstorage would log the same message each second producing excessive amount of logs. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5159 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-30 10:52:57 +01:00
Aliaksandr Valialkin	632d788b63	lib/promscrape/discovery/kubernetes: stop all the url watchers, which belong to a particular groupWatcher, at once Previously url watchers for pod, service and node objects could be mistakenly closed when service discovery was set up only for endpoints and endpointslice roles, since watchers for these roles may start start pod, service and node url watchers with nil apiWatcher passed to groupWatcher.startWatchersForRole(). Now all the url watchers, which belong to a particular groupWatcher, are stopped at once when this groupWatcher has no apiWatcher subscribers. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5216 The issue has been introduced in v1.93.5 when addressing https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4850	2023-10-27 13:51:35 +02:00
Hui Wang	7c90ce39cb	do not print redundant error logs when failed to scrape consul or no… (#5239 ) * do not print redundant error logs when failed to scrape consul or nomad target prometheus performs the same because it uses consul lib which just drops the error(`1806bcb38c/api/api.go (L1134)`)	2023-10-27 13:31:55 +08:00
Daria Karavaieva	b60bb1d98a	model list - isolation forest (#5235 ) * model list - isolation forest * curse of dimensionality * isol forest definition change, minor fixes * blank line fix	2023-10-26 12:25:54 +02:00
Aliaksandr Valialkin	d5a599badc	lib/promauth: follow-up for `e16d3f5639` - Make sure that invalid/missing TLS CA file or TLS client certificate files at vmagent startup don't prevent from processing the corresponding scrape targets after the file becomes correct, without the need to restart vmagent. Previously scrape targets with invalid TLS CA file or TLS client certificate files were permanently dropped after the first attempt to initialize them, and they didn't appear until the next vmagent reload or the next change in other places of the loaded scrape configs. - Make sure that TLS CA is properly re-loaded from file after it changes without the need to restart vmagent. Previously the old TLS CA was used until vmagent restart. - Properly handle errors during http request creation for the second attempt to send data to remote system at vmagent and vmalert. Previously failed request creation could result in nil pointer dereferencing, since the returned request is nil on error. - Add more context to the logged error during AWS sigv4 request signing before sending the data to -remoteWrite.url at vmagent. Previously it could miss details on the source of the request. - Do not create a new HTTP client per second when generating OAuth2 token needed to put in Authorization header of every http request issued by vmagent during service discovery or target scraping. Re-use the HTTP client instead until the corresponding scrape config changes. - Cache error at lib/promauth.Config.GetAuthHeader() in the same way as the auth header is cached, e.g. the error is cached for a second now. This should reduce load on CPU and OAuth2 server when auth header cannot be obtained because of temporary error. - Share tls.Config.GetClientCertificate function among multiple scrape targets with the same tls_config. Cache the loaded certificate and the error for one second. This should significantly reduce CPU load when scraping big number of targets with the same tls_config. - Allow loading TLS certificates from HTTP and HTTPs urls by specifying these urls at `tls_config->cert_file` and `tls_config->key_file`. - Improve test coverage at lib/promauth - Skip unreachable or invalid files specified at `scrape_config_files` during vmagent startup, since these files may become valid later. Previously vmagent was exitting in this case. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4959	2023-10-25 23:19:37 +02:00
Aliaksandr Valialkin	eed5206376	lib/promauth: properly parse string contents for ca, cert and key fields at tls_config Previously yaml parser wasn't accepting string values for these fields, because it was mistakenly expecting a list of uint8 values instead.	2023-10-25 23:12:21 +02:00
hagen1778	a216fe6728	app/vmalert: follow-up after `c9375cac5e` `c9375cac5e` Descriptions were updated in attempt to make it more clear for readers, re-phrasing and linking missing docs. `eval_delay` was added to tests to verify it can be unmarshalled. `eval_delay` is now applied before timestamp alignment to make it more predictable. Before, if delay < interval the timestamp won't be aligned. `eval_delay` and `eval_offset` was added to API output. `PreviouslySentSeriesToRW` converted to private `previouslySentSeriesToRW`. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-25 13:07:13 +02:00
Hui Wang	c9375cac5e	vmalert: add `-rule.evalDelay` flag and `eval_delay` as group attribute (#5185 ) Also mark `-datasource.lookback` as will be deprecated, see https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5155.	2023-10-25 11:54:18 +02:00
hagen1778	003ef3a518	deployment/alerts: make `TooHighMemoryUsage` more tolerable to spikes Using `min_over_time` should reduce the amount of false positives when component is running in near-the-threshold state. Now it should trigger only if all collected samples were above the threshold on 10m interval. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-24 09:39:46 +02:00
Alexander Marshalov	33484d3365	lib/streamaggr: respect `streamAgg.dropInput` with empty stream aggr config (#5213 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5207	2023-10-20 15:55:58 +02:00
krakazyabra	239849db5d	docs/case-studies: update Wedos info (#5211 )	2023-10-19 22:38:40 +02:00
Aliaksandr Valialkin	2f21c0c119	docs: use https://github.com/VictoriaMetrics/VictoriaMetrics/releases/latest instead of https://github.com/VictoriaMetrics/VictoriaMetrics/releases link where needed The https://github.com/VictoriaMetrics/VictoriaMetrics/releases link may show non-latest releases at the top, such as LTS releases or VictoriaLogs releases. So it is better to use https://github.com/VictoriaMetrics/VictoriaMetrics/releases/latest link, which always redirect to the latest available release of VictoriaMetrics.	2023-10-18 20:06:25 +02:00
Roman Khavronenko	b8b6e120ff	app/vmselect: limit the number of parallel workers by 32 (#5195 ) * app/vmselect: limit the number of parallel workers by 32 The change should improve performance and memory usage during query processing on machines with big number of CPU cores. The number of parallel workers for query processing is controlled via `-search.maxWorkersPerQuery` command-line flag. By default, the number of workers is limited by the number of available CPU cores, but not more than 32. The limit can be increased via `-search.maxWorkersPerQuery`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip - The `-search.maxWorkersPerQuery` command-line flag doesn't limit resource usage, so move it from the `resource usage limits` to `troubleshooting` chapter at docs/Single-server-VictoriaMetrics.md - Make more clear the description for the `-search.maxWorkersPerQuery` command-line flag - Add the description of `-search.maxWorkersPerQuery` to docs/Cluster-VictoriaMetrics.md - Limit the maximum value, which can be passed to `-search.maxWorkersPerQuery`, to GOMAXPROCS, because bigger values may worsen query performance and increase CPU usage - Improve the the description of the change at docs/CHANGELOG.md. Mark it as FEATURE instead of BUGFIX, since it is closer to a feature than to a bugfix. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-18 19:51:37 +02:00
Aliaksandr Valialkin	38b8872a47	docs/Articles.md: add an article https://rtfm.co.ua/en/victoriametrics-vmauth-proxy-authentication-and-authorization/	2023-10-18 18:29:26 +02:00
Aliaksandr Valialkin	ad871dd9ed	docs/FAQ.md: add questions on why VictoriaMetrics doesnt rebalance data and doesnt restore replication factor between vmstorage nodes	2023-10-18 18:06:26 +02:00
Aliaksandr Valialkin	ea4758f5cd	docs/FAQ.md: refresh the answer to the question about how does VictoriaMetrics compare to competing solutions - Mention Grafana Mimir - Fix broken links	2023-10-18 09:13:57 +02:00
Github Actions	0a6932acfe	Automatic update operator docs from VictoriaMetrics/operator@2c826bb (#5188 )	2023-10-17 15:57:11 +02:00
hagen1778	fd2d07ba33	lib/storage: follow-up after `188cfe3a85` `188cfe3a85` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5159 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 15:45:14 +02:00
Hui Wang	e16d3f5639	fix inconsistent behaviors with prometheus when scraping (#5153 ) * fix inconsistent behaviors with prometheus when scraping 1. address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4959. skip job with wrong syntax in `scrape_configs` with error logs instead of exiting; 2. show error messages on vmagent /targets ui if there are wrong auth configs in `scrape_configs`, previously will print error logs and do scrape without auth header; 3. don't send requests if there are wrong auth configs in: 1. vmagent remoteWrite; 2. vmalert datasource/remoteRead/remoteWrite/notifier. * add changelogs * address review comments * fix ut	2023-10-17 17:58:19 +08:00
hagen1778	3e2f09541e	docs: mention key concepts as querying example See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5169 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 11:01:17 +02:00
hagen1778	c2d252c045	dashboards/vmalert: respect job and instance filters in `No data errors` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:40:39 +02:00
hagen1778	edba9f6266	dashboards/vmalert: use `desc` sorting for tooltips on panels Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-17 09:31:09 +02:00
Aliaksandr Valialkin	f89051f83f	docs/Articles.md: add newly appeared articles about VictoriaMetrics at medium.com - https://sarthak-acoustic.medium.com/solving-metrics-at-scale-with-victoriametrics-ac9c306826c3 - https://medium.com/@seifeddinerajhi/victoriametrics-a-comprehensive-guide-comparing-it-to-prometheus-and-implementing-kubernetes-03eb8feb0cc2	2023-10-17 00:55:35 +02:00
Aliaksandr Valialkin	14f3d844fe	docs/CHANGELOG.md: document v1.93.6 LTS release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.93.6	2023-10-17 00:53:18 +02:00
Aliaksandr Valialkin	daaf2b0e61	docs/CHANGELOG.md: document v1.87.10 release See https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v1.87.10	2023-10-16 23:25:38 +02:00
Aliaksandr Valialkin	da77f4deeb	app/vmselect/promql: add labels_equal(q, "label1", "label2", ...) function This function returns q series, which have identical values for the listed labels "label1", "label2", ... See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5148	2023-10-16 21:50:11 +02:00
Aliaksandr Valialkin	484b5ed12f	docs/MetricsQL.md: typo fix after `bdb743c88d` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5071	2023-10-16 21:09:36 +02:00
Aliaksandr Valialkin	6c3dd16a16	app/vmagent/remotewrite: move sas var initialization closer to the place where it is used This makes the code sligthtly easier to understand. This is a follow-up for `1d3d989be5` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5170	2023-10-16 20:52:56 +02:00
Aliaksandr Valialkin	bdb743c88d	app/vmselect/promql: add drop_empty_series() function for dropping empty series before performing additional calculations This can be useful in the following queries: drop_empty_series(temperature <= 30) default 40 This query drops temperature series with all the values bigger than 30 on the selected time range, while replacing gaps in the remaining series with 40. The query without drop_empty_series: (temperature <= 30) default 40 would leave all the temperature series with all the values bigger than 30 on the selected time range, and replace all their values with 40. This is not what could be epxected in some cases like here - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5071	2023-10-16 20:44:56 +02:00
hagen1778	1d3d989be5	app/vmagent/remotewrite: follow-up after `4f102ff945` `4f102ff945` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-16 16:00:24 +02:00
Aliaksandr Valialkin	b8c267075e	lib/promscrape: add a link to https://docs.victoriametrics.com/vmagent.html#scraping-big-number-of-targets in descriptions for -promscrape.cluster.* command-line flags This should help users figuring out the purpose of -promscrape.cluster.* command-line flags	2023-10-16 14:46:22 +02:00
Aliaksandr Valialkin	6f98b9c221	Revert "docs victorialogs use relative links" This reverts commit `3d7a77bf82`. Reason for revert: relative links do not work properly at GitHub code and at GitHub wiki. For example, the following page contains broken links before reverting this commit: https://github.com/VictoriaMetrics/VictoriaMetrics/blob/master/docs/VictoriaLogs/CHANGELOG.md It is always better to use absolute links thank relative links, since the page contents can be copy-n-pasted to other pages, which are located in vastly different directories, and all the links will remain working.	2023-10-16 13:40:43 +02:00
Aliaksandr Valialkin	56b9c0b717	docs/CaseStudies.md: typ fix: vmgent -> vmagent This is a follow-up for `f5c46b8176`	2023-10-16 13:33:04 +02:00
Aliaksandr Valialkin	07150359b2	docs/vmbackup.md: clarify documentation about -deleteAllObjectVersions command-line flag Updates `2fc7e9f47e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5121	2023-10-16 12:09:46 +02:00
Alexander Marshalov	b248413a07	fixed error when creating a full backup using the `-origin` flag (#5180 ) * fixed error when creating a full backup using the `-origin` flag (#5144) * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-16 12:02:51 +02:00
Github Actions	7abdbbc8e2	Automatic update operator docs from VictoriaMetrics/operator@79298bf (#5177 )	2023-10-16 10:45:46 +08:00
Aliaksandr Valialkin	2c334ed953	app/{vmagent,vminsert}: follow-up for NewRelic data ingestion protocol support This is a follow-up for `f60c08a7bd` Changes: - Make sure all the urls related to NewRelic protocol start from /newrelic . Previously some urls were started from /api/v1/newrelic - Remove /api/v1 part from NewRelic urls, since it has no sense - Remove automatic transformation from CamelCase to snake_case for NewRelic labels and metric names, since it may complicate the transition from NewRelic to VictoriaMetrics. Preserve all the metric names and label names, so users could query metrics and labels by the same names which are used in NewRelic. The automatic transformation from CamelCase to snake_case can be added later as a special action for relabeling rules if needed. - Properly update per-tenant data ingestion stats at app/vmagent/newrelic/request_handler.go . Previously it was always zero. - Fix NewRelic urls in vmagent when multitenant data ingestion is enabled. Previously they were mistakenly started from `/`. - Document NewRelic data ingestion url at docs/Cluster-VictoriaMetrics.md - Remove superflouos memory allocations at lib/protoparser/newrelic - Improve tests at lib/protoparser/newrelic/* Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3520 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4712	2023-10-16 00:25:25 +02:00
Aliaksandr Valialkin	ddbe713470	docs/Single-server-VictoriaMetrics.md: add a link to the original issue, which describes how to run VictoriaMetrics as Windows service This is a follow-up for `cc7d5b7bab` The original issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3781 contains up-to-date information on how to run VictoriaMetrics components as Windows service, plus it may contain additional information about this case such as https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3781#issuecomment-1708092680 , so it is better to refer this issue from the docs.	2023-10-15 19:33:39 +02:00
Roman Khavronenko	3594214a16	lib/vmselect: bump maxSearchQuerySize to 5MB (#5158 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5154#issuecomment-1757216612 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5154 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-15 19:24:38 +02:00
Artem Navoiev	8dff1c696f	docs fix broken links Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-10-14 15:39:49 +02:00
Artem Navoiev	a200aaf0ef	docs fix broken links in operator Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2023-10-14 15:17:17 +02:00

1 2 3 4 5 ...

3226 Commits