VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-15 08:23:34 +01:00

Author	SHA1	Message	Date
Artem Navoiev	17a4d310d0	docs: add alias for keyconcepts Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-30 18:39:59 +02:00
Aliaksandr Valialkin	09c388a8e4	lib/promscrape: use the standard net/http.Client instead of fasthttp.Client for scraping targets in non-streaming mode While fasthttp.Client uses less CPU and RAM when scraping targets with small responses (up to 10K metrics), it doesn't work well when scraping targets with big responses such as kube-state-metrics. In this case it could use big amounts of additional memory comparing to net/http.Client, since fasthttp.Client reads the full response in memory and then tries re-using the large buffer for further scrapes. Additionally, fasthttp.Client-based scraping had various issues with proxying, redirects and scrape timeouts like the following ones: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1945 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5425 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2794 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1017 This should help reducing memory usage for the case when target returns big response and this response is scraped by fasthttp.Client at first before switching to stream parsing mode for subsequent scrapes. Now the switch to stream parsing mode is performed on the first scrape after reading the response body in memory and noticing that its size exceeds the value passed to -promscrape.minResponseSizeForStreamParse command-line flag. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5567 Overrides https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4931	2024-01-30 18:39:55 +02:00
Aliaksandr Valialkin	36be93f53c	docs/keyConcepts.md: clarify the information about which data is returned by instant and range queries Do not use `raw samples` term there, since it adds more confusion than clarity: the `raw samples` refers to real samples stored in the database, while neither range nor instant queries do not return raw samples - they both return calculated samples at the given timestamps. This is a follow-up for `b5978ed8f9` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5710 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5708	2024-01-29 18:22:27 +01:00
Fred Navruzov	ff67f4d3fb	- fix link/version of helm chart in update request (#5716 )	2024-01-29 18:22:11 +01:00
Artem Navoiev	00db646851	docs: add backward compaitble redicrt for url examples page Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-29 18:21:03 +01:00
Fred Navruzov	8043cf35d8	- update versions to 1.9.2 (#5714 ) - update guide asset urls to flat	2024-01-29 18:20:38 +01:00
hagen1778	131afea8ab	docs: specify results of Instant and Range queries Mention explicitly what are value and timestamp field in returned results from Instant and Range queries. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5710 https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5708 Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-29 18:20:02 +01:00
Fred Navruzov	790ee9af6d	update guide and changelog to 1.9.1 (#5706 )	2024-01-29 17:23:06 +01:00
hagen1778	216f268c1a	docs: follow-up after `491287ed15` * port un-synced changed from docs/readme to readme * consistently use `sh` instead of `console` highlight, as it looks like a more appropriate syntax highlight * consistently use `sh` instead of `bash`, as it is shorter * consistently use `yaml` instead of `yml` See syntax codes here https://gohugo.io/content-management/syntax-highlighting/ Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-29 17:06:26 +01:00
hagen1778	3baeface89	docs: follow-up after `491287ed15` `491287ed15` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-29 16:59:13 +01:00
Artem Navoiev	0d210308c3	docs: remove witdh from images, remove <p>, remove <div> (#5705 ) * docs: remove witdh from images, remove <p>, remove <div> Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * docs: remove <div> clarify language in code blocks Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-29 16:50:51 +01:00
Daria Karavaieva	469bbbb2fa	version 1.9.1 update, dashboard viz flag (#5704 )	2024-01-29 16:48:55 +01:00
Aliaksandr Valialkin	f5559c038c	lib/storage: do not check the limit for -search.maxUniqueTimeseries when performing /api/v1/labels and /api/v1/label/.../values requests This limit has little sense for these APIs, since: - Thses APIs frequently result in scanning of all the time series on the given time range. For example, if extra_filters={datacenter="some_dc"} . - Users expect these APIs shouldn't hit the -search.maxUniqueTimeseries limit, which is intended for limiting resource usage at /api/v1/query and /api/v1/query_range requests. Also limit the concurrency for /api/v1/labels, /api/v1/label/.../values and /api/v1/series requests in order to limit the maximum memory usage and CPU usage for these API. This limit shouldn't affect typical use cases for these APIs: - Grafana dashboard load when dashboard labels should be loaded - Auto-suggestion list load when editing the query in Grafana or vmui Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5055	2024-01-29 16:44:46 +01:00
Roman Khavronenko	9e9f170fe7	lib/streamaggr: skip unfinished aggregation state on shutdown by default (#5689 ) Sending unfinished aggregate states tend to produce unexpected anomalies with lower values than expected. The old behavior can be restored by specifying `flush_on_shutdown: true` setting in streaming aggregation config Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:45:45 +01:00
Roman Khavronenko	562edb72ea	app/vmalert: fix data race during hot-config reload (#5698 ) * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix caches the cancel context function into local variable first. And only after performs the group update. With cached cancel function we can safely call it without worrying that we cancel the evaluation for already updated group. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Revert "app/vmalert: fix data race during hot-config reload" This reverts commit `a4bb7e8932`. * app/vmalert: fix data race during hot-config reload During hot-reload, the logic evokes the group update and rules evaluation interruption simultaneously. Falsely assuming that interruption happens before the update. However, it could happen that group will be updated first and only after the rules evaluation will be cancelled. Which will result in permanent interruption for all rules within the group. The fix cancels the evaulation context before applying the update, making sure that the context will be cancelled for old group always. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 22:43:02 +01:00
Yury Molodov	551f48466c	vmui: fix `Enter` key in query field (#5667 ) (#5681 )	2024-01-26 22:38:51 +01:00
Artem Navoiev	d42908133c	docs: remove <p> for imanges (#5702 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 22:34:40 +01:00
Artem Navoiev	36fa314161	remove all <div> as far they obsolete and can break markdown (#5701 ) Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 22:32:54 +01:00
Aliaksandr Valialkin	2b0123058a	docs: update -help output after `bb7a419cc3`	2024-01-26 22:29:22 +01:00
Aliaksandr Valialkin	7a8b92b590	lib/{mergeset,storage}: make background merge more responsive and scalable - Maintain a separate worker pool per each part type (in-memory, file, big and small). Previously a shared pool was used for merging all the part types. A single merge worker could merge parts with mixed types at once. For example, it could merge simultaneously an in-memory part plus a big file part. Such a merge could take hours for big file part. During the duration of this merge the in-memory part was pinned in memory and couldn't be persisted to disk under the configured -inmemoryDataFlushInterval . Another common issue, which could happen when parts with mixed types are merged, is uncontrolled growth of in-memory parts or small parts when all the merge workers were busy with merging big files. Such growth could lead to significant performance degradataion for queries, since every query needs to check ever growing list of parts. This could also slow down the registration of new time series, since VictoriaMetrics searches for the internal series_id in the indexdb for every new time series. The third issue is graceful shutdown duration, which could be very long when a background merge is running on in-memory parts plus big file parts. This merge couldn't be interrupted, since it merges in-memory parts. A separate pool of merge workers per every part type elegantly resolves both issues: - In-memory parts are merged to file-based parts in a timely manner, since the maximum size of in-memory parts is limited. - Long-running merges for big parts do not block merges for in-memory parts and small parts. - Graceful shutdown duration is now limited by the time needed for flushing in-memory parts to files. Merging for file parts is instantly canceled on graceful shutdown now. - Deprecate -smallMergeConcurrency command-line flag, since the new background merge algorithm should automatically self-tune according to the number of available CPU cores. - Deprecate -finalMergeDelay command-line flag, since it wasn't working correctly. It is better to run forced merge when needed - https://docs.victoriametrics.com/#forced-merge - Tune the number of shards for pending rows and items before the data goes to in-memory parts and becomes visible for search. This improves the maximum data ingestion rate and the maximum rate for registration of new time series. This should reduce the duration of data ingestion slowdown in VictoriaMetrics cluster on e.g. re-routing events, when some of vmstorage nodes become temporarily unavailable. - Prevent from possible "sync: WaitGroup misuse" panic on graceful shutdown. This is a follow-up for `fa566c68a6` . Thanks @misutoth to for the inspiration at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3790 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3551 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3425 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3647 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/648 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/291	2024-01-26 22:19:52 +01:00
Artem Navoiev	9e0416c666	docs: delete docs/provision_datasources.png as we support webp Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:36:19 +01:00
Github Actions	41c48a4c59	Automatic update Grafana datasource docs from VictoriaMetrics/grafana-datasource@ef5cfe6 (#5700 )	2024-01-26 21:35:53 +01:00
Aliaksandr Valialkin	9e70d9ab47	docs/Makefile: mention that the Makefile rules must be run from VictoriaMetrics repository root	2024-01-26 21:11:14 +01:00
Artem Navoiev	2b870b1116	docs: fix key concepts image and links Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:10:36 +01:00
Artem Navoiev	bb5a0719a5	docs: change [image] to img as far we support it in release guide Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:09:22 +01:00
Artem Navoiev	89fbefefdb	docs: remoev vmanomaly as far we have dedicated section with alredy exists redirects Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:08:54 +01:00
Artem Navoiev	6ed9a05a08	docs: vmanomaly fix images Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:08:28 +01:00
Daria Karavaieva	b042982339	Vmanomaly Guide dashboard provisioning (#5679 ) * dashboard provisioning * delete dashboard filter, new query * dashboard screens, guide fixes	2024-01-26 21:07:44 +01:00
Artem Navoiev	aee3e51315	docs: remove raw and endraw tags as they are not needed for the new v… (#5696 ) * docs: remove raw and endraw tags as they are not needed for the new version of site Signed-off-by: Artem Navoiev <tenmozes@gmail.com> * revert formating in vmaler Signed-off-by: Artem Navoiev <tenmozes@gmail.com> --------- Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-01-26 21:06:10 +01:00
Github Actions	cd287b2e4c	Automatic update operator docs from VictoriaMetrics/operator@0628def (#5694 )	2024-01-26 21:05:13 +01:00
Github Actions	b8a4a78fef	Automatic update Grafana datasource docs from VictoriaMetrics/grafana-datasource@c644bec (#5691 )	2024-01-26 20:52:47 +01:00
Roman Khavronenko	a2f83115ae	app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` (#5680 ) * app/vmalert: autogenerate `ALERTS_FOR_STATE` time series for alerting rules with `for: 0` Previously, `ALERTS_FOR_STATE` was generated only for alerts with `for > 0`. This behavior differs from Prometheus behavior - it generates ALERTS_FOR_STATE time series for alerting rules with `for: 0` as well. Such time series can be useful for tracking the moment when alerting rule became active. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5648 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3056 Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: support ALERTS_FOR_STATE in `replay` mode Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 20:51:50 +01:00
Github Actions	362e52f880	Automatic update operator docs from VictoriaMetrics/operator@e75a096 (#5690 )	2024-01-26 20:50:57 +01:00
Github Actions	8fa6c42396	Automatic update operator docs from VictoriaMetrics/operator@f6b9c08 (#5676 )	2024-01-26 20:50:00 +01:00
hagen1778	bd7ebb41b2	docs: fix the issue link Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-01-26 20:49:01 +01:00
Alexander Marshalov	14712e3b99	vmsingle/vmselect returns http status 429 (TooManyRequests) instead of 503 (ServiceUnavailable) when max concurrent requests limit is reached. (#5682 )	2024-01-25 10:21:09 +02:00
Aliaksandr Valialkin	1e364c992d	lib/promscrape/discovery/kubernetes: do not generate targets for already terminated pods and containers Already terminated pods and containers cannot be scraped and will never resurrect, so there is zero sense in creating scrape targets for them.	2024-01-24 14:58:51 +02:00
Aliaksandr Valialkin	0dca3c4025	app/{vmselect,vmstorage}: return compression of the data passed from vmstorage to vmselect This reverts `cd4f641d32` , since it has been appeared that the disabled compression for vmstorage->vmselect data increase network bandwidth usage by more than 10x on typical production workloads, while it decreases CPU usage at vmstorage by up to 10% and improves query latency by up to 10%. The 10x increase in network usage is too high price for 10% improvements on query latency and vmstorage CPU usage. This may result in network bandwidth bottlenecks, which can reduce the overall performance and stability of VictoriaMetrics cluster. That's why return back the vmstorage->vmselect data compression by default. The vmstorage->vmselect compression can be disabled by passing -rpc.disableCompression command-line flag to vmstorage. The vmselect->vmselect compression in multi-level cluster setup can be disabled by passing -clusternative.disableCompression command-line flag.	2024-01-24 13:37:05 +02:00
Aliaksandr Valialkin	e6e5b97e1e	lib/streamaggr: expand `%{ENV}` placeholders in stream aggregation configs	2024-01-24 12:31:42 +02:00
Aliaksandr Valialkin	12698b9136	lib/mergeset: really limit the number of in-memory parts to 15 It has been appeared that the registration of new time series slows down linearly with the number of indexdb parts, since VictoriaMetrics needs to check every indexdb part when it searches for TSID by newly ingested metric name. The number of in-memory parts grows when new time series are registered at high rate. The number of in-memory parts grows faster on systems with big number of CPU cores, because the mergeset maintains per-CPU buffers with newly added entries for the indexdb, and every such entry is transformed eventually into a separate in-memory part. The solution has been suggested in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 by @misutoth - to limit the number of in-memory parts with buffered channel. This solution is implemented in this commit. Additionally, this commit merges per-CPU parts into a single part before adding it to the list of in-memory parts. This reduces CPU load when searching for TSID by newly ingested metric name. The https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5212 recommends setting the limit on the number of in-memory parts to 100, but my internal testing shows that much lower limit 15 works with the same efficiency on a system with 16 CPU cores while reducing memory usage for `indexdb/dataBlocks` cache by up to 50%. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5190	2024-01-24 03:41:19 +02:00
Aliaksandr Valialkin	5205f1c6a6	docs/Cluster-VictoriaMetrics.md: document that `vmstorage` doesnt compress data it sends to `vmselect` by default This is a follow-up for `cd4f641d32`	2024-01-23 23:21:31 +02:00
Github Actions	4ccf3f41c6	Automatic update operator docs from VictoriaMetrics/operator@1470569 (#5668 )	2024-01-23 17:47:36 +02:00
Aliaksandr Valialkin	47cb79198e	docs/vmagent.md: clarify how `-promscrape.seriesLimitPerTarget` command-line flag, `series_limit` config option and `__series_limit__` label interact with each other This is a follow-up for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5663 See also `89e3c70ccd`	2024-01-23 13:15:45 +02:00
Roman Khavronenko	8461add541	lib/promscrape: respect `0` value for `series_limit` param (#5663 ) * lib/promscrape: respect `0` value for `series_limit` param Respect `0` value for `series_limit` param in `scrape_config` even if global limit was set via `-promscrape.seriesLimitPerTarget`. Previously, `0` value will be ignored in favor of `-promscrape.seriesLimitPerTarget`. This behavior aligns with possibility to override `series_limit` value via relabeling with `__series_limit__` label. Signed-off-by: hagen1778 <roman@victoriametrics.com> * Update docs/CHANGELOG.md --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 13:09:36 +02:00
Fred Navruzov	7b1325c3a3	- fix 404 errors after page remaning (#5664 ) - slight text fixes	2024-01-23 12:59:01 +02:00
Yury Molodov	1db2b991b7	vmui: query report (#5497 ) * vmui: add query analyzer page * vmui: fix tabs for query analyzer * vmui: add help to export query * vmui: add time params to query analyzer * docs/vmui: add query analyzer * vmui: fix validation JSON form --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:26:04 +02:00
Yury Molodov	3a26e4d6ec	vmui: add flag for default timezone setting (#5611 ) * vmui: add flag for default timezone setting #5375 * vmui: validate timezone before client return * Update app/vmselect/vmui.go --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:15:14 +02:00
Yury Molodov	574d69775e	vmui: fix cache autocomplete (#5591 ) * vmui: fix the logic of closing the popper #5470 * vmui: fix the logic of caching autocomplete results #5472 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-01-23 04:06:39 +02:00
Daria Karavaieva	7ad16c8f17	add 1.8.0 notes to changelog (#5616 ) * add 1.8.0 notes to changelog * added release date * MAD internal link * monitoring health deprecation	2024-01-23 02:31:21 +02:00
Jaskeerat Singh Randhawa	291802ea84	custom-resources: fix link text for alertmanager (#5660 )	2024-01-23 02:30:51 +02:00

1 2 3 4 5 ...

3543 Commits