VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-21 07:56:26 +01:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	bc7cf4950b	lib/promscrape: use the standard net/http.Client instead of fasthttp.Client for scraping targets in non-streaming mode While fasthttp.Client uses less CPU and RAM when scraping targets with small responses (up to 10K metrics), it doesn't work well when scraping targets with big responses such as kube-state-metrics. In this case it could use big amounts of additional memory comparing to net/http.Client, since fasthttp.Client reads the full response in memory and then tries re-using the large buffer for further scrapes. Additionally, fasthttp.Client-based scraping had various issues with proxying, redirects and scrape timeouts like the following ones: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1945 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5425 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2794 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1017 This should help reducing memory usage for the case when target returns big response and this response is scraped by fasthttp.Client at first before switching to stream parsing mode for subsequent scrapes. Now the switch to stream parsing mode is performed on the first scrape after reading the response body in memory and noticing that its size exceeds the value passed to -promscrape.minResponseSizeForStreamParse command-line flag. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5567 Overrides https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4931	2024-01-30 18:39:10 +02:00
Artem Navoiev	a20c289228	docs: add alias for keyconcepts Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-30 17:05:58 +01:00
Aliaksandr Valialkin	e7844f2efd	docs/keyConcepts.md: clarify the information about which data is returned by instant and range queries Do not use `raw samples` term there, since it adds more confusion than clarity: the `raw samples` refers to real samples stored in the database, while neither range nor instant queries do not return raw samples - they both return calculated samples at the given timestamps. This is a follow-up for `b5978ed8f9` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5710 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5708	2024-01-29 18:19:46 +01:00
Fred Navruzov	b2434ec340	- fix link/version of helm chart in update request (#5716 )	2024-01-29 18:55:07 +02:00
Aliaksandr Valialkin	5d66ee88bd	lib/storage: do not check the limit for -search.maxUniqueTimeseries when performing /api/v1/labels and /api/v1/label/.../values requests This limit has little sense for these APIs, since: - Thses APIs frequently result in scanning of all the time series on the given time range. For example, if extra_filters={datacenter="some_dc"} . - Users expect these APIs shouldn't hit the -search.maxUniqueTimeseries limit, which is intended for limiting resource usage at /api/v1/query and /api/v1/query_range requests. Also limit the concurrency for /api/v1/labels, /api/v1/label/.../values and /api/v1/series requests in order to limit the maximum memory usage and CPU usage for these API. This limit shouldn't affect typical use cases for these APIs: - Grafana dashboard load when dashboard labels should be loaded - Auto-suggestion list load when editing the query in Grafana or vmui Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5055	2024-01-29 16:45:12 +01:00
Artem Navoiev	b9b18b5fd8	docs: add backward compaitble redicrt for url examples page Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-29 16:01:32 +01:00
Fred Navruzov	b4aef0c141	- update versions to 1.9.2 (#5714 ) - update guide asset urls to flat	2024-01-29 15:47:27 +02:00
hagen1778	b5978ed8f9	docs: specify results of Instant and Range queries Mention explicitly what are value and timestamp field in returned results from Instant and Range queries. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5710 https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5708 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-29 14:00:14 +01:00
Fred Navruzov	9b555a0034	update guide and changelog to 1.9.1 (#5706 )	2024-01-28 09:43:28 +02:00
hagen1778	6c6c2c185f	docs: follow-up after `491287ed15` * port un-synced changed from docs/readme to readme * consistently use `sh` instead of `console` highlight, as it looks like a more appropriate syntax highlight * consistently use `sh` instead of `bash`, as it is shorter * consistently use `yaml` instead of `yml` See syntax codes here https://gohugo.io/content-management/syntax-highlighting/ Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-27 19:29:11 +01:00
hagen1778	c20d68e28d	docs: follow-up after `491287ed15` `491287ed15` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-27 19:11:38 +01:00
Artem Navoiev	491287ed15	docs: remove witdh from images, remove <p>, remove <div> (#5705 ) * docs: remove witdh from images, remove <p>, remove <div> Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * docs: remove <div> clarify language in code blocks Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-27 10:08:07 -08:00
Daria Karavaieva	4a9f8f4cb0	version 1.9.1 update, dashboard viz flag (#5704 )	2024-01-27 14:16:02 +01:00
Roman Khavronenko	aaa526e8ff	lib/streamaggr: skip unfinished aggregation state on shutdown by default (#5689 ) Sending unfinished aggregate states tend to produce unexpected anomalies with lower values than expected. The old behavior can be restored by specifying `flush_on_shutdown: true` setting in streaming aggregation config Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:45:23 +01:00
Roman Khavronenko	df59ac7f0e	app/vmalert: fix data race during hot-config reload (#5698 ) * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix caches the cancel context function into local variable first. And only after performs the group update. With cached cancel function we can safely call it without worrying that we cancel the evaluation for already updated group. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "app/vmalert: fix data race during hot-config reload" This reverts commit `a4bb7e8932`. * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix cancels the evaulation context before applying the update, making sure that the context will be cancelled for old group always. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:42:21 +01:00
Yury Molodov	a7b11eff7c	vmui: fix `Enter` key in query field (#5667 ) (#5681 )	2024-01-26 22:38:32 +01:00
Aliaksandr Valialkin	3bce55be0c	docs: update -help output after `bb7a419cc3`	2024-01-26 22:28:40 +01:00
Aliaksandr Valialkin	bb7a419cc3	lib/{mergeset,storage}: make background merge more responsive and scalable - Maintain a separate worker pool per each part type (in-memory, file, big and small). Previously a shared pool was used for merging all the part types. A single merge worker could merge parts with mixed types at once. For example, it could merge simultaneously an in-memory part plus a big file part. Such a merge could take hours for big file part. During the duration of this merge the in-memory part was pinned in memory and couldn't be persisted to disk under the configured -inmemoryDataFlushInterval . Another common issue, which could happen when parts with mixed types are merged, is uncontrolled growth of in-memory parts or small parts when all the merge workers were busy with merging big files. Such growth could lead to significant performance degradataion for queries, since every query needs to check ever growing list of parts. This could also slow down the registration of new time series, since VictoriaMetrics searches for the internal series_id in the indexdb for every new time series. The third issue is graceful shutdown duration, which could be very long when a background merge is running on in-memory parts plus big file parts. This merge couldn't be interrupted, since it merges in-memory parts. A separate pool of merge workers per every part type elegantly resolves both issues: - In-memory parts are merged to file-based parts in a timely manner, since the maximum size of in-memory parts is limited. - Long-running merges for big parts do not block merges for in-memory parts and small parts. - Graceful shutdown duration is now limited by the time needed for flushing in-memory parts to files. Merging for file parts is instantly canceled on graceful shutdown now. - Deprecate -smallMergeConcurrency command-line flag, since the new background merge algorithm should automatically self-tune according to the number of available CPU cores. - Deprecate -finalMergeDelay command-line flag, since it wasn't working correctly. It is better to run forced merge when needed - https://docs.victoriametrics.com/#forced-merge - Tune the number of shards for pending rows and items before the data goes to in-memory parts and becomes visible for search. This improves the maximum data ingestion rate and the maximum rate for registration of new time series. This should reduce the duration of data ingestion slowdown in VictoriaMetrics cluster on e.g. re-routing events, when some of vmstorage nodes become temporarily unavailable. - Prevent from possible "sync: WaitGroup misuse" panic on graceful shutdown. This is a follow-up for `fa566c68a6` . Thanks @misutoth to for the inspiration at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3790 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3425 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3647 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/648 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/291	2024-01-26 22:27:47 +01:00
Artem Navoiev	97937d58c4	docs: remove <p> for imanges (#5702 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 13:06:48 -08:00
Artem Navoiev	3e0a117ddf	remove all <div> as far they obsolete and can break markdown (#5701 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 12:52:21 -08:00
Artem Navoiev	7de19c3748	docs: delete docs/provision_datasources.png as we support webp Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:25:39 +01:00
Github Actions	5a8daa725e	Automatic update Grafana datasource docs from VictoriaMetrics/grafana-datasource@ef5cfe6 (#5700 )	2024-01-26 12:22:17 -08:00
Aliaksandr Valialkin	4ac7e3a355	docs/Makefile: mention that the Makefile rules must be run from VictoriaMetrics repository root	2024-01-26 21:01:40 +01:00
Artem Navoiev	60fc2da6c1	docs: fix key concepts image and links Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 20:30:29 +01:00
Artem Navoiev	25165656bb	docs: change [image] to img as far we support it in release guide Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 19:01:20 +01:00
Artem Navoiev	41e99765cc	docs: remoev vmanomaly as far we have dedicated section with alredy exists redirects Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 18:38:01 +01:00
Artem Navoiev	bc033a2b30	docs: vmanomaly fix images Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 17:59:37 +01:00
Daria Karavaieva	105cb44884	Vmanomaly Guide dashboard provisioning (#5679 ) * dashboard provisioning * delete dashboard filter, new query * dashboard screens, guide fixes	2024-01-26 17:12:58 +01:00
Artem Navoiev	9ded04e643	docs: remove raw and endraw tags as they are not needed for the new v… (#5696 ) * docs: remove raw and endraw tags as they are not needed for the new version of site Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * revert formating in vmaler Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 07:30:45 -08:00
Github Actions	fae801edd3	Automatic update operator docs from VictoriaMetrics/operator@0628def (#5694 )	2024-01-26 10:21:52 +01:00
Github Actions	2582b1e15a	Automatic update Grafana datasource docs from VictoriaMetrics/grafana-datasource@c644bec (#5691 )	2024-01-26 11:44:02 +04:00
Roman Khavronenko	b11f4ef5ea	app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` (#5680 ) * app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` Previously, `ALERTS_FOR_STATE` was generated only for alerts with `for > 0`. This behavior differs from Prometheus behavior - it generates ALERTS_FOR_STATE time series for alerting rules with `for: 0` as well. Such time series can be useful for tracking the moment when alerting rule became active. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5648 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3056 Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: support ALERTS_FOR_STATE in `replay` mode Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-25 15:42:57 +01:00
Github Actions	a95246d885	Automatic update operator docs from VictoriaMetrics/operator@e75a096 (#5690 )	2024-01-25 15:33:01 +01:00
Github Actions	e29fe0933b	Automatic update operator docs from VictoriaMetrics/operator@f6b9c08 (#5676 )	2024-01-25 15:10:23 +01:00
hagen1778	56310ffb47	docs: fix the issue link Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-25 13:35:41 +01:00
Alexander Marshalov	806c07ddd5	vmsingle/vmselect returns http status 429 (TooManyRequests) instead of 503 (ServiceUnavailable) when max concurrent requests limit is reached. (#5682 )	2024-01-24 17:55:06 +01:00
Aliaksandr Valialkin	ef12598ad4	lib/promscrape/discovery/kubernetes: do not generate targets for already terminated pods and containers Already terminated pods and containers cannot be scraped and will never resurrect, so there is zero sense in creating scrape targets for them.	2024-01-24 14:57:53 +02:00
Aliaksandr Valialkin	4d961c70f7	app/{vmselect,vmstorage}: return compression of the data passed from vmstorage to vmselect This reverts `cd4f641d32` , since it has been appeared that the disabled compression for vmstorage->vmselect data increase network bandwidth usage by more than 10x on typical production workloads, while it decreases CPU usage at vmstorage by up to 10% and improves query latency by up to 10%. The 10x increase in network usage is too high price for 10% improvements on query latency and vmstorage CPU usage. This may result in network bandwidth bottlenecks, which can reduce the overall performance and stability of VictoriaMetrics cluster. That's why return back the vmstorage->vmselect data compression by default. The vmstorage->vmselect compression can be disabled by passing -rpc.disableCompression command-line flag to vmstorage. The vmselect->vmselect compression in multi-level cluster setup can be disabled by passing -clusternative.disableCompression command-line flag.	2024-01-24 13:39:28 +02:00
Aliaksandr Valialkin	f888a019fe	lib/streamaggr: expand `%{ENV}` placeholders in stream aggregation configs	2024-01-24 12:31:27 +02:00
Aliaksandr Valialkin	fa566c68a6	lib/mergeset: really limit the number of in-memory parts to 15 It has been appeared that the registration of new time series slows down linearly with the number of indexdb parts, since VictoriaMetrics needs to check every indexdb part when it searches for TSID by newly ingested metric name. The number of in-memory parts grows when new time series are registered at high rate. The number of in-memory parts grows faster on systems with big number of CPU cores, because the mergeset maintains per-CPU buffers with newly added entries for the indexdb, and every such entry is transformed eventually into a separate in-memory part. The solution has been suggested in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 by @misutoth - to limit the number of in-memory parts with buffered channel. This solution is implemented in this commit. Additionally, this commit merges per-CPU parts into a single part before adding it to the list of in-memory parts. This reduces CPU load when searching for TSID by newly ingested metric name. The https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 recommends setting the limit on the number of in-memory parts to 100, but my internal testing shows that much lower limit 15 works with the same efficiency on a system with 16 CPU cores while reducing memory usage for `indexdb/dataBlocks` cache by up to 50%. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190	2024-01-24 03:38:12 +02:00
Aliaksandr Valialkin	5543c04061	docs/Cluster-VictoriaMetrics.md: document that `vmstorage` doesnt compress data it sends to `vmselect` by default This is a follow-up for `cd4f641d32`	2024-01-23 23:22:18 +02:00
Github Actions	05c9a4d7ce	Automatic update operator docs from VictoriaMetrics/operator@1470569 (#5668 )	2024-01-23 16:22:16 +01:00
Aliaksandr Valialkin	fac7c30f4e	docs/vmagent.md: clarify how `-promscrape.seriesLimitPerTarget` command-line flag, `series_limit` config option and `__series_limit__` label interact with each other This is a follow-up for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5663 See also `89e3c70ccd`	2024-01-23 13:14:50 +02:00
Roman Khavronenko	89e3c70ccd	lib/promscrape: respect `0` value for `series_limit` param (#5663 ) * lib/promscrape: respect `0` value for `series_limit` param Respect `0` value for `series_limit` param in `scrape_config` even if global limit was set via `-promscrape.seriesLimitPerTarget`. Previously, `0` value will be ignored in favor of `-promscrape.seriesLimitPerTarget`. This behavior aligns with possibility to override `series_limit` value via relabeling with `__series_limit__` label. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update docs/CHANGELOG.md --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 13:09:14 +02:00
Fred Navruzov	2adb38a9c4	- fix 404 errors after page remaning (#5664 ) - slight text fixes	2024-01-23 01:56:42 -08:00
Aliaksandr Valialkin	114822d585	app/{vmstorage,vmselect}: disable vmstorage->vmselect RPC compression by default in order to improve query performance	2024-01-23 04:24:57 +02:00
Zakhar Bessarab	bf4742526d	lib/storage: print tenant ID in log when discarding or truncating labels (#5658 ) Previously, it was not possible to determine which tenant sends metrics with excessive amount of labels of label values. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:24:56 +02:00
Yury Molodov	38231d5994	vmui: query report (#5497 ) * vmui: add query analyzer page * vmui: fix tabs for query analyzer * vmui: add help to export query * vmui: add time params to query analyzer * docs/vmui: add query analyzer * vmui: fix validation JSON form --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:23:26 +02:00
Yury Molodov	eb6def0695	vmui: add flag for default timezone setting (#5611 ) * vmui: add flag for default timezone setting #5375 * vmui: validate timezone before client return * Update app/vmselect/vmui.go --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:11:19 +02:00
Yury Molodov	633e6b48ad	vmui: fix cache autocomplete (#5591 ) * vmui: fix the logic of closing the popper #5470 * vmui: fix the logic of caching autocomplete results #5472 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:06:14 +02:00

1 2 3 4 5 ...

3583 Commits