VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-21 07:56:26 +01:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	4921ec5604	docs/CHANGELOG.md: use new link to VictoriaMetrics cluster docs instead of old link The old link was changed globally to the new link in the commit `f4b1cbfef0` . Unfortunately, old links are still posted in new commits :( This is a follow-up for `680b8c25c8` . While at it, remove duplicate 'len(*remoteWriteURLs) > 0' check in the remotewrite.Init() functions, since this check is already made at the beginning of the function. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6253	2024-07-13 03:04:20 +02:00
Aliaksandr Valialkin	bc1f92d7f5	app/vmagent/remotewrite: follow-up for `87fd400dfc` - Drop samples and return true from remotewrite.TryPush() at fast path when all the remote storage systems are configured with the disabled on-disk queue, every in-memory queue is full and -remoteWrite.dropSamplesOnOverload is set to true. This case is quite common, so it should be optimized. Previously additional CPU time was spent on per-remoteWriteCtx relabeling and other processing in this case. - Properly count the number of dropped samples inside remoteWriteCtx.pushInternalTrackDropped(). Previously dropped samples were counted only if -remoteWrite.dropSamplesOnOverload flag is set. In reality, the samples are dropped when they couldn't be sent to the queue because in-memory queue is full and on-disk queue is disabled. The remoteWriteCtx.pushInternalTrackDropped() function is called by streaming aggregation for pushing the aggregated data to the remote storage. Streaming aggregation cannot wait until the remote storage processes pending data, so it drops aggregated samples in this case. - Clarify the description for -remoteWrite.disableOnDiskQueue command-line flag at -help output, so it is clear that this flag can be set individually per each -remoteWrite.url. - Make the -remoteWrite.dropSamplesOnOverload flag global. If some of the remote storage systems are configured with the disabled on-disk queue, then there is no sense in keeping samples on some of these systems, while dropping samples on the remaining systems, since this will result in global stall on the remote storage system with the disabled on-disk queue and with the -remoteWrite.dropSamplesOnOverload=false flag. vmagent will always return false from remotewrite.TryPush() in this case. This will result in infinite duplicate samples written to the remaining remote storage systems. That's why the -remoteWrite.dropSamplesOnOverload is forcibly set to true if more than one -remoteWrite.disableOnDiskQueue flag is set. This allows proceeding with newly scraped / pushed samples by sending them to the remaining remote storage systems, while dropping them on overloaded systems with the -remoteWrite.disableOnDiskQueue flag set. - Verify that the remoteWriteCtx.TryPush() returns true in the TestRemoteWriteContext_TryPush_ImmutableTimeseries test. - Mention in vmagent docs that the -remoteWrite.disableOnDiskQueue command-line flag can be set individually per each -remoteWrite.url. See https://docs.victoriametrics.com/vmagent/#disabling-on-disk-persistence Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6248 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6065	2024-07-13 02:30:10 +02:00
Aliaksandr Valialkin	5c7345b8ce	app/victoria-logs/Makefile: add `make victoria-logs-linux-loong64` build rule This is a follow-up for `80f3644ee3` The https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6222 missed build rule for VictoriaLogs.	2024-07-12 23:13:19 +02:00
Aliaksandr Valialkin	43fc1183b9	app/vmalert: switch from table-driven tests to f-tests This makes test code more clear and reduces the number of code lines by 500. This also simplifies debugging tests. See https://itnext.io/f-tests-as-a-replacement-for-table-driven-tests-in-go-8814a8b19e9e While at it, consistently use t.Fatal* instead of t.Error* across tests, since t.Error* requires more boilerplate code, which can result in additional bugs inside tests. While t.Error* allows writing logging errors for the same, this doesn't simplify fixing broken tests most of the time. This is a follow-up for `a9525da8a4`	2024-07-12 22:45:50 +02:00
Aliaksandr Valialkin	04a304fd39	app/vmctl: switch from table-driven tests to f-tests This simplifies debugging tests and makes the test code more clear and concise. See https://itnext.io/f-tests-as-a-replacement-for-table-driven-tests-in-go-8814a8b19e9e While at is, consistently use t.Fatal* instead of t.Error* across tests, since t.Error* requires more boilerplate code, which can result in additional bugs inside tests. While t.Error* allows writing logging errors for the same, this doesn't simplify fixing broken tests most of the time. This is a follow-up for `a9525da8a4`	2024-07-12 22:45:49 +02:00
Aliaksandr Valialkin	7c97cef95c	app: consistently use t.Fatal* instead of t.Error* (except of app/vmalert and app/vmctl - these packages will be processed in a separate commit) Consistently using t.Fatal* simplifies the test code and makes it less fragile, since it is common error to forget to make proper cleanup after t.Error* call. Also t.Error* calls do not provide any practical benefits when some tests fail. They just clutter test output with additional noise information, which do not help in fixing failing tests most of the time. This is a follow-up for `a9525da8a4`	2024-07-11 16:01:25 +02:00
Zhu Jiekun	2ea575e776	vmalert: [bug] fixed System hyperlink 404 redirect (#6620 ) ### Describe Your Changes As mentioned in https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6603, some hyperlinks under `vmalert` -> `System` section is not working as expected. Pages and redirection: - For page `http://127.0.0.1:8880/`: `flags` button will redirect to `http://127.0.0.1:8880/flags` - For page `http://127.0.0.1:8880/vmalert`: `http://127.0.0.1:8880/flags` - For page `http://127.0.0.1:8880/vmalert/`: `http://127.0.0.1:8880/vmalert/flags` (page not exists) - Similar redirection could be observed with `-http.pathPrefix` Two potential ways to avoid 404 redirection: 1. avoid visiting `/vmalert/` (I'm trying to do this). 2. provide support for `/vmalert/flags`. `/vmalert/` could be visit only when user click other navigator (e.g. Group) and click vmalert again: ![Peek 2024-07-10 10-07](https://github.com/VictoriaMetrics/VictoriaMetrics/assets/30280396/13d7b147-a1b6-4e93-9ee0-26f881a16bef) Because: `http://127.0.0.1:8880/vmalert/groups?search=` + `<a class="nav-link" href=".">` = `http://127.0.0.1:8880/vmalert/` So I'm trying to change the `href="."` to `href="../vmalert"`. ### Checklist The following checks are mandatory: - [X] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). (cherry picked from commit `cadf1eb5ab`)	2024-07-11 12:40:23 +02:00
Zakhar Bessarab	401ae72587	app/vmselect/promql: propagate lower bucket values when fixing a histogram (#6547 ) ### Describe Your Changes In most cases histograms are exposed in sorted manner with lower buckets being first. This means that during scraping buckets with lower bounds have higher chance of being updated earlier than upper ones. Previously, values were propagated from upper to lower bounds, which means that in most cases that would produce results higher than expected once all buckets will become updated. Propagating from upper bound effectively limits highest value of histogram to the value of previous scrape. Once the data will become consistent in the subsequent evaluation this causes spikes in the result. Changing propagation to be from lower to higher buckets reduces value spikes in most cases due to nature of the original inconsistency. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4580 An example histogram with previous(red) and updated(blue) versions: ![1719565540](https://github.com/VictoriaMetrics/VictoriaMetrics/assets/1367798/605c5e60-6abe-45b5-89b2-d470b60127b8) This also makes logic of filling nan values with lower buckets values: [1 2 3 nan nan nan] => [1 2 3 3 3 3] obsolete. Since buckets are now fixed from lower ones to upper this happens in the main loop, so there is no need in a second one. --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Andrii Chubatiuk <andrew.chubatiuk@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `6a4bd5049b`)	2024-07-10 15:17:08 +02:00
Aliaksandr Valialkin	a1decb5ca1	app/vlinsert/loki: use easyproto instead for parsing Loki protobuf messages	2024-07-10 03:05:55 +02:00
Aliaksandr Valialkin	32ae40410c	app/vlselect/vmui: run `make vmui-logs-update` after `662e026279`	2024-07-10 03:05:55 +02:00
Aliaksandr Valialkin	b8a8d3d6f1	lib/logstorage: drop all the pipes from the query when calculating the number of matching logs at /select/logsql/hits API	2024-07-10 00:39:16 +02:00
Aliaksandr Valialkin	d6415b2572	all: consistently use 'any' instead of 'interface{}' 'any' type is supported starting from Go1.18. Let's consistently use it instead of 'interface{}' type across the code base, since `any` is easier to read than 'interface{}'.	2024-07-10 00:23:26 +02:00
Aliaksandr Valialkin	73ca22bb7d	app/vlinsert/loki: remove unused functions from the generated protobuf code	2024-07-10 00:22:10 +02:00
Yury Molodov	33bd5ccbab	vmui/logs: add spinner to bar chart (#6577 ) Add a spinner to the bar chart https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6558 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `662e026279`)	2024-07-09 18:27:23 +02:00
Hui Wang	6f602a4ef5	security: upgrade base docker image (Alpine) from 3.20.0 to 3.20.1 See https://www.alpinelinux.org/posts/Alpine-3.20.1-released.html >including security fixes for: OPENSSL [CVE-2024-4741](https://security.alpinelinux.org/vuln/CVE-2024-4741) BUSYBOX [CVE-2023-42364](https://security.alpinelinux.org/vuln/CVE-2023-42364) [CVE-2023-42365](https://security.alpinelinux.org/vuln/CVE-2023-42365) (cherry picked from commit `8e9f98e725`)	2024-07-09 11:38:44 +02:00
Artem Navoiev	7b508a9334	fix typo Signed-off-by: Artem Navoiev <tenmozes@gmail.com> (cherry picked from commit `4527020a68`)	2024-07-09 10:52:50 +02:00
Yury Molodov	7fc9912d15	vmui: add compact JSON display (#6582 ) ### Describe Your Changes If a JSON element has only one field, it will be displayed on a single line. #6559 \| Old Display \| New Display \| \|-------------\|-------------\| \| ![image](https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/8866517b-a49d-450f-904c-19117397a078) \| ![image](https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/8e222b43-a4cb-4f32-9a79-6199778404d3) \| ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `959a4383c5`)	2024-07-05 09:49:12 +02:00
Hui Wang	bbd49a1a61	vmalert: allow omitting `-replay.timeTo` in replay mode, default valu… (#6575 ) …e is the current timestamp address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6492 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `3169524fb7`)	2024-07-05 09:49:06 +02:00
Roman Khavronenko	b13c363f12	app/vmalert: add examples for `source` override (#6561 ) The change adds a new docs section with examples on how source can be overridden. It should address questions like https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6536 While there, fix the example in `external.alert.source` cmd-line flag and docker-compose examples. ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `c429bbf889`)	2024-07-05 09:49:03 +02:00
Aliaksandr Valialkin	172ae1adf7	Revert `c6c5a5a186` and `b2765c45d0` Reason for revert: There are many statsd servers exist: - https://github.com/statsd/statsd - classical statsd server - https://docs.datadoghq.com/developers/dogstatsd/ - statsd server from DataDog built into DatDog Agent ( https://docs.datadoghq.com/agent/ ) - https://github.com/avito-tech/bioyino - high-performance statsd server - https://github.com/atlassian/gostatsd - statsd server in Go - https://github.com/prometheus/statsd_exporter - statsd server, which exposes the aggregated data as Prometheus metrics These servers can be used for efficient aggregating of statsd data and sending it to VictoriaMetrics according to https://docs.victoriametrics.com/#how-to-send-data-from-graphite-compatible-agents-such-as-statsd ( the https://github.com/prometheus/statsd_exporter can be scraped as usual Prometheus target according to https://docs.victoriametrics.com/#how-to-scrape-prometheus-exporters-such-as-node-exporter ). Adding support for statsd data ingestion protocol into VictoriaMetrics makes sense only if it provides significant advantages over the existing statsd servers, while has no significant drawbacks comparing to existing statsd servers. The main advantage of statsd server built into VictoriaMetrics and vmagent - getting rid of additional statsd server. The main drawback is non-trivial and inconvenient streaming aggregation configs, which must be used for the ingested statsd metrics ( see https://docs.victoriametrics.com/stream-aggregation/ ). These configs are incompatible with the configs for standalone statsd servers. So you need to manually translate configs of the used statsd server to stream aggregation configs when migrating from standalone statsd server to statsd server built into VictoriaMetrics (or vmagent). Another important drawback is that it is very easy to shoot yourself in the foot when using built-in statsd server with the -statsd.disableAggregationEnforcement command-line flag or with improperly configured streaming aggregation. In this case the ingested statsd metrics will be stored to VictoriaMetrics as is without any aggregation. This may result in high CPU usage during data ingestion, high disk space usage for storing all the unaggregated statsd metrics and high CPU usage during querying, since all the unaggregated metrics must be read, unpacked and processed during querying. P.S. Built-in statsd server can be added to VictoriaMetrics and vmagent after figuring out more ergonomic specialized configuration for aggregating of statsd metrics. The main requirements for this configuration: - easy to write, read and update (ideally it should work out of the box for most cases without additional configuration) - hard to misconfigure (e.g. hard to shoot yourself in the foot) It would be great if this configuration will be compatible with the configuration of the most widely used statsd server. In the mean time it is recommended continue using external statsd server. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6265 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5053 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5052 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/206 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4600	2024-07-03 23:57:49 +02:00
Aliaksandr Valialkin	cd152693c6	Revert "Exemplar support (#5982 )" This reverts commit `5a3abfa041`. Reason for revert: exemplars aren't in wide use because they have numerous issues which prevent their adoption (see below). Adding support for examplars into VictoriaMetrics introduces non-trivial code changes. These code changes need to be supported forever once the release of VictoriaMetrics with exemplar support is published. That's why I don't think this is a good feature despite that the source code of the reverted commit has an excellent quality. See https://docs.victoriametrics.com/goals/ . Issues with Prometheus exemplars: - Prometheus still has only experimental support for exemplars after more than three years since they were introduced. It stores exemplars in memory, so they are lost after Prometheus restart. This doesn't look like production-ready feature. See `0a2f3b3794/content/docs/instrumenting/exposition_formats.md (L153-L159)` and https://prometheus.io/docs/prometheus/latest/feature_flags/#exemplars-storage - It is very non-trivial to expose exemplars alongside metrics in your application, since the official Prometheus SDKs for metrics' exposition ( https://prometheus.io/docs/instrumenting/clientlibs/ ) either have very hard-to-use API for exposing histograms or do not have this API at all. For example, try figuring out how to expose exemplars via https://pkg.go.dev/github.com/prometheus/client_golang@v1.19.1/prometheus . - It looks like exemplars are supported for Histogram metric types only - see https://pkg.go.dev/github.com/prometheus/client_golang@v1.19.1/prometheus#Timer.ObserveDurationWithExemplar . Exemplars aren't supported for Counter, Gauge and Summary metric types. - Grafana has very poor support for Prometheus exemplars. It looks like it supports exemplars only when the query contains histogram_quantile() function. It queries exemplars via special Prometheus API - https://prometheus.io/docs/prometheus/latest/querying/api/#querying-exemplars - (which is still marked as experimental, btw.) and then displays all the returned exemplars on the graph as special dots. The issue is that this doesn't work in production in most cases when the histogram_quantile() is calculated over thousands of histogram buckets exposed by big number of application instances. Every histogram bucket may expose an exemplar on every timestamp shown on the graph. This makes the graph unusable, since it is litterally filled with thousands of exemplar dots. Neither Prometheus API nor Grafana doesn't provide the ability to filter out unneeded exemplars. - Exemplars are usually connected to traces. While traces are good for some I doubt exemplars will become production-ready in the near future because of the issues outlined above. Alternative to exemplars: Exemplars are marketed as a silver bullet for the correlation between metrics, traces and logs - just click the exemplar dot on some graph in Grafana and instantly see the corresponding trace or log entry! This doesn't work as expected in production as shown above. Are there better solutions, which work in production? Yes - just use time-based and label-based correlation between metrics, traces and logs. Assign the same `job` and `instance` labels to metrics, logs and traces, so you can quickly find the needed trace or log entry by these labes on the time range with the anomaly on metrics' graph. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5982	2024-07-03 16:09:18 +02:00
Aliaksandr Valialkin	a5d60ad78e	app/vmagent/remotewrite,lib/streamaggr: re-use common code in tests after `879771808b` - Export streamaggr.LoadFromData() function, so it could be used in tests outside the lib/streamaggr package. This allows removing a hack with creation of temporary files at TestRemoteWriteContext_TryPush_ImmutableTimeseries. - Move common code for mustParsePromMetrics() function into lib/prompbmarshal package, so it could be used in tests for building []prompbmarshal.TimeSeries from string. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6205 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6206	2024-07-03 15:22:51 +02:00
Aliaksandr Valialkin	4268a310c1	app/vmagent/remotewrite/remotewrite.go: make remoteWriteCtx.TryPush code easier to follow Move the code responsible for relabelCtx clearing into deferred function. This allows making more clear the remoteWriteCtx.TryPush code. This is a follow-up for `879771808b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6205 While at it, clarify the description of the bugfix at docs/CHANGELOG.md	2024-07-03 14:18:51 +02:00
Aliaksandr Valialkin	f406764ccc	app/vmagent/remotewrite/streamaggr.go: clarify the description for -remoteWrite.streamAggr.* command-line flags, so they are applied to the corresponding -remoteWrite.url	2024-07-03 14:18:51 +02:00
Aliaksandr Valialkin	bb7406e9c0	app/vmselect/promql: follow-up for `dd0d2c77c8` and `6149adbe10` Use metricsql.IsLikelyInvalid() function for determining whether the given query is likely invalid, e.g. there is high change the query is incorrectly written, so it will return unexpected results. The query is invalid most of the time if it passes something other than series selector into rollup function. For example: - rate(sum(foo)) - rate(foo + bar) - rate(foo > bar) Improtant note: the query is considered valid if it misses the lookbehind window in square brackes inside rollup function, e.g. rate(foo), since this is very convenient MetricsQL extention to PromQL, and this query returns the expected results most of the time. Other unsafe query types can be added in the future into metricsql.IsLikelyInvalid(). TODO: probably, the -search.disableImplicitConversion command-line flag must be set by default in the future releases of VictoriaMetrics. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4338 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6180 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6450	2024-07-03 00:46:56 +02:00
Aliaksandr Valialkin	82748b2b9d	deployment/docker: update Go builder from Go1.22.4 to Go1.22.5 See https://github.com/golang/go/issues?q=milestone%3AGo1.22.5+label%3ACherryPickApproved	2024-07-03 00:07:55 +02:00
LHHDZ	c8431c8e4d	app/vmauth: reader pool to reduce gc & mem alloc (#6533 ) follow up https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6446 issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6445 --------- Signed-off-by: f41gh7 <nik@victoriametrics.com> Co-authored-by: f41gh7 <nik@victoriametrics.com> (cherry picked from commit `4d66e042e3`)	2024-07-02 14:37:15 +02:00
Aliaksandr Valialkin	0912a652d5	app/vlinsert/insertutils: flush the ingested logs from in-memory buffer to storage every second Previously the in-memory buffer could remain unflushed for long periods of time under low ingestion rate. The ingested logs weren't visible for search during this time.	2024-07-02 01:39:45 +02:00
Aliaksandr Valialkin	ab28a1f93e	app/vlinsert/syslog: add an ability to use log ingestion time as the _time field	2024-07-02 01:39:45 +02:00
Hui Wang	085bc1f15c	vmui: increase max query tab from 4 to 10 (#6546 ) (cherry picked from commit `9da78f1e0e`)	2024-07-01 16:40:42 +02:00
Hui Wang	87cb132f53	app/vmselect/netstorage: do not retry request when complexity limit i… (#6469 ) …s already exceeded --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-07-01 16:38:15 +02:00
Andrii Chubatiuk	937ae2ca90	lib/streamaggr: added stale samples metric, added metrics labels (#6462 ) ### Describe Your Changes - added stale metrics counters for input and output samples - added labels for aggregator metrics => `name="{rwctx}:{aggrId}:{aggrSuffix}"` - rwctx - global or number starting from 1 - aggrid - aggregator id starting from 1 - aggrSuffix - <interval>_(by\|without)_label1_label2_labeln e.g: `name="global:1:1m_without_instance_pod"` ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `861852f262`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-07-01 15:01:49 +02:00
Aliaksandr Valialkin	4b3477e62b	lib/logstorage: add `stream_context` pipe, which allows selecting surrounding logs for the matching logs	2024-06-28 19:15:19 +02:00
Aliaksandr Valialkin	c9fc8079c4	app/vlinsert/syslog: properly skip empty lines in Syslog protocol Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6548	2024-06-28 14:09:45 +02:00
Aliaksandr Valialkin	bb6424aeca	app/vlselect/logsql: add optional fields_limit query arg to /select/logsql/hits HTTP endpoint This query arg is needed for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6545 in order to return top N groups with the biggest number of hits.	2024-06-28 03:10:05 +02:00
Aliaksandr Valialkin	b26acec9a8	app/vlselect: properly return live tailing results	2024-06-27 15:06:15 +02:00
Aliaksandr Valialkin	dd62a2b9d6	lib/logstorage: work-in-progress	2024-06-27 14:21:03 +02:00
Andrii Chubatiuk	f79df2aa8b	app/vmauth: allow dropping host header (#6525 ) ### Describe Your Changes Fixes #6453 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-06-26 19:12:35 +02:00
Yury Molodov	6bde0196d8	vmui/logs: fix the update of the relative time range (#6517 ) ### Describe Your Changes - Fixed the update of the relative time range when `Execute Query` is clicked - Optimized server requests: now, if an error occurs in the `/query` request, the `/hits` request will not be executed. #6345 (duplicates: #6440, #6312) (cherry picked from commit `43342745ac`)	2024-06-26 11:26:08 +02:00
Yury Molodov	904ec020ed	vmui: fix input cursor position reset (#6530 ) ### Describe Your Changes This PR addresses the issue where the cursor jumps to the end of the input fields in the modal settings window after each keystroke. ### Before fix: ![ezgif-7-4c69805cea](https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/2e99e833-09e3-4b44-89aa-fc1bd3c4346d) ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). (cherry picked from commit `e9b71a2883`)	2024-06-26 11:25:47 +02:00
Yury Molodov	25f3e700a6	vmui: update package-lock.json (#6532 ) 1. Updated `package-lock.json` to resolve [Dependabot alerts](https://github.com/VictoriaMetrics/VictoriaMetrics/security/dependabot). 2. Updated types to align with the latest `Preact` update. (cherry picked from commit `6cab811134`)	2024-06-26 11:25:45 +02:00
Aliaksandr Valialkin	d5cbda3424	app/vlstorage: add -retention.maxDiskSpaceUsageBytes command-line flag for limiting the retention at VictoriaLogs by disk space usage	2024-06-25 17:30:46 +02:00
Aliaksandr Valialkin	f24123a776	lib/logstorage: parse syslog structured data into separate fields in order to simplify further querying of this data	2024-06-25 14:54:25 +02:00
Aliaksandr Valialkin	1716c4e609	lib/logstorage: properly parse timezone offset at TryParseTimestampRFC3339Nano() The TryParseTimestampRFC3339Nano() must properly parse RFC3339 timestamps with timezone offsets. While at it, make tryParseTimestampISO8601 function private in order to prevent from improper usage of this function from outside the lib/logstorage package. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6508	2024-06-25 14:54:24 +02:00
Aliaksandr Valialkin	30d1f0711f	app/vmselect/netstorage: add a comment explaining why all the samples in block are taken into account when checking the -search.maxSamplesPerQuery limit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5851 This is a follow-up for `b07a02c516`	2024-06-25 03:06:42 +02:00
Aliaksandr Valialkin	a5445e09c2	Revert "app/vmselect: fix the way of counting raw samples in single query (#6464 )" This reverts commit `5ecf439078`. Reason for revert: the previous logic was correct. The purpose of `-search.maxSamplesPerQuery` command-line flag is to limit the amounts of CPU resources, which could be taken by a single query - see https://docs.victoriametrics.com/#resource-usage-limits . VictoriaMetrics processes samples in blocks during querying - it reads the block, then unpacks it, then filters out samples outside the selected time range. This means that it _spends CPU time_ on reading and unpacking of _all the samples_ in every block on the requested time range, even if only a single sample per each block matches the given time range. The previous logic was effectively limiting CPU time a single query could take. The new logic fails limiting CPU time a single query could take in some pathological cases when only a small fraction of samples per each requested block fit the requested time range. This allows performing multiplication DoS-attacks by querying very narrow time ranges over historical blocks, which tend to be full. For example, if the `-search.maxSamplesPerQuery` equals to a billion, and the query requests a single sample out of 8K samples per each block, this means that the query may unpack a billion of such blocks without exceeding the limit, e.g. it may unpack and process 8K*1e9=8e12 samples. This is not what the resource usage limits were created for originally - see https://docs.victoriametrics.com/#resource-usage-limits Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5851 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6464	2024-06-25 02:55:43 +02:00
Aliaksandr Valialkin	f8ff09cd8d	app/vmui: run `make vmui-update` after `65f414acee`	2024-06-24 23:21:19 +02:00
Aliaksandr Valialkin	faed1394d9	app/vmctl/prometheus/prometheus.go: add missing arg to tsdb.OpenDBReadOnly() function after updating github.com/prometheus/prometheus dependency from v0.52.1 to v0.53.0 in `5c55722db4` See `c5a1cc9148`	2024-06-24 23:16:30 +02:00
Andrii Chubatiuk	516848783e	deployment: build image for vmagent streamaggr benchmark (#6515 ) ### Describe Your Changes optionally build vmagent image for benchmark needed for https://github.com/VictoriaMetrics/ops/pull/1297 ### Checklist The following checks are mandatory: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). (cherry picked from commit `6b128da811`)	2024-06-24 16:29:14 +02:00
hagen1778	1aadf2b267	app/vmalert: fix typo in replay error handling Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `279815818c`)	2024-06-20 15:15:59 +02:00
hagen1778	d98577ae37	app/vmalert: follow-up `bc37b279aa` * rm extra interface method for rw Client, as it has low applicability and doesn't fit multitenancy well * add `GetDroppedRows` method instead Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `4ef76eed7b`)	2024-06-20 15:15:58 +02:00
Hui Wang	a393b993d6	vmalert: exit replay mode with non-zero code if generated samples are… (#6513 ) … not successfully written into remoteWrite url address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6512 (cherry picked from commit `bc37b279aa`)	2024-06-20 14:00:19 +02:00
Aliaksandr Valialkin	d5224f3363	lib/logstorage: work-in-progress	2024-06-20 03:10:37 +02:00
Yury Molodov	5e8e89f22d	vmui/logs: update footer links (#6498 ) ### Describe Your Changes Update the links in the footer for logs: [LogsQL](https://docs.victoriametrics.com/victorialogs/logsql/) and [Documentation](https://docs.victoriametrics.com/victorialogs/) (cherry picked from commit `13e3bb88a9`)	2024-06-18 15:29:14 +02:00
Yury Molodov	88650abf97	vmui/logs: add bar chart (#6461 ) - Added a bar chart displaying the number of log entries over a time range. #6404 - When `_msg` is empty, all fields are displayed in a single line. - Added double quotes when copying pairs: `key: "value"`. - Minor style adjustments. (cherry picked from commit `32fbffedd9`)	2024-06-18 15:28:56 +02:00
Hui Wang	5be2f2c4e4	vmalert-tool: support file path with hierarchical patterns and regexp… (#6501 ) …es, and http url in unittest cmd-line flag `-files` (cherry picked from commit `3b8970802e`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-06-18 14:17:42 +02:00
Aliaksandr Valialkin	ee114ca59c	app/vlinsert: properly parse timestamps with nanosecond precision at /insert/jsonline HTTP endpoint This has been broken in `2b6a634ec0`	2024-06-18 00:24:11 +02:00
Aliaksandr Valialkin	c10a646d19	app/vlinsert/syslog: allow accepting syslog messages with different configs at different ports	2024-06-17 23:16:58 +02:00
Aliaksandr Valialkin	f74e6b0674	app/vlinsert: properly parse length-delimited syslog messages sent over TCP according to RFC5425	2024-06-17 22:30:28 +02:00
jackyin	4a6bf7f218	app/vmui: copy button shows undefined (#6495 ) ### Describe Your Changes fix #6421 some aggregation func don't return \_\_name\_\_ value	2024-06-17 22:30:28 +02:00
Roman Khavronenko	df7e300071	app/vmselect/promql: check for ranged vectors in aggr funcs if implicit conversions are disabled (#6450 ) Check for ranged vector arguments in aggregate expressions when `-search.disableImplicitConversion` or `-search.logImplicitConversion` are enabled. For example, `sum(up[5m])` will fail to execute if these flags are set. ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [*] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `6149adbe10`)	2024-06-17 14:25:43 +02:00
Aliaksandr Valialkin	1750991119	lib/logstorage: work-in-progress	2024-06-17 12:13:25 +02:00
Hui Wang	5ecf439078	app/vmselect: fix the way of counting raw samples in single query (#6464 ) The limit is specified with command-line flag `-search.maxSamplesPerQuery`. Previously, samples might be over-counted and query can't be fixed by reducing time range. address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5851 (cherry picked from commit `6e395048d3`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-06-14 16:24:31 +02:00
jackyin	f69495cd5f	app/vmalert: fix VMAlert oauth2 error (#6478 ) Properly set ClientSecret param for notifier. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6471 --------- Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `5223981fed`)	2024-06-14 15:21:30 +02:00
Andrii Chubatiuk	779436bd9c	app/vmalert: fixed path prefixes for system routes (#6435 ) Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6433 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> (cherry picked from commit `eea361defb`)	2024-06-14 14:14:54 +02:00
LHHDZ	41e4135371	app/vmauth: fix discovering backend IPs when `url_prefix` contains hostname with `srv+` prefix (#6401 ) This change fixes the following panic: ``` 2024-06-04T11:16:52.899Z warn app/vmauth/auth_config.go:353 cannot discover backend SRV records for http://srv+localhost:8080: lookup localhost on 10.100.10.4:53: server misbehaving; use it literally panic: runtime error: integer divide by zero goroutine 9 [running]: github.com/VictoriaMetrics/VictoriaMetrics/lib/httpserver.handlerWrapper.func1() /Users/lhhdz/wd/projects/go/VictoriaMetrics/lib/httpserver/httpserver.go:291 +0x58 panic({0x103115100?, 0x10338d700?}) /Users/lhhdz/go/pkg/mod/golang.org/toolchain@v0.0.1-go1.22.3.darwin-arm64/src/runtime/panic.go:770 +0x124 main.getLeastLoadedBackendURL({0x0?, 0x22?, 0x1400014757b?}, 0x1400013c120?) /Users/lhhdz/wd/projects/go/VictoriaMetrics/app/vmauth/auth_config.go:473 +0x210 main.(*URLPrefix).getBackendURL(0x140000aa080) /Users/lhhdz/wd/projects/go/VictoriaMetrics/app/vmauth/auth_config.go:312 +0xb8 ``` --------- Co-authored-by: Haley Wang <haley@victoriametrics.com>	2024-06-12 11:47:44 +02:00
Aliaksandr Valialkin	9135b404d9	lib/logstorage: work-in-progress	2024-06-11 17:51:01 +02:00
Nikolay	66fbea70a5	follow-up after `77f22fdb8d` (#6458 ) * fixes linter error * simplify code a bit * fixes bug with incorrectly set configSuccess metric. It was not set to 1 in case of config rollback Signed-off-by: f41gh7 <nik@victoriametrics.com>	2024-06-11 12:08:00 +02:00
noodles2hg	77f22fdb8d	[cluster/vminsert]:add reload -relabelConfig on the request to /-/reload (#3923 ) When I use vminsert's `relabelConfig`, I found that now there is no reloaded api. However, `vminsert` under `VM-Single` has it. So, I hope to add it to the `cluster/vminster`. --------- Signed-off-by: z-anshun <1179798460@qq.com> Co-authored-by: Nikolay <nik@victoriametrics.com>	2024-06-10 19:36:41 +02:00
Yury Molodov	2300e30ff3	vmui/logs: add markdown support (#6292 ) Add support for markdown format and emoji for the `_msg` field in the "Group" view. Add markdown rendering toggle. Disabled by default. Value is stored in `localStorage`.	2024-06-10 16:39:25 +02:00
hagen1778	fdf0a936f0	vmctl: rm `--vm-disable-progress-bar` flag It is better to remove deprecated flag completely, so vmctl will fail if this flag is used and user can immediately fix the issue. Before, flag was ignored and it is worse then fail fast. follow-up after `8b46bb0c41 (diff-2bfab3db5cc1baf4c6d3ff6b19901926e3bdf4411ec685dac973e5fcff1c723b)` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `8d95522529`)	2024-06-10 14:05:58 +02:00
Nikolay	0ce7f38e1c	app/vmauth: adds idleConnTimeout flag, retry trivial errors (#6388 ) * adds idleConnTimeout flag, which must reduce probability of `broken pipe` and `connection reset` errors. * one-time retry trivial network requests for the same backend --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `d44058bcd6`)	2024-06-10 12:41:51 +02:00
Dmytro Kozlov	a4bdc14bc5	vmctl: disable progress bar for prometheus snapshot migrations (#6385 ) * deprecate `--vm-disable-progress-bar` in favour of `--disable-progress-bar` * new `--disable-progress-bar` consistently disables usage of progress bar for all migration modes. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6367 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `8b46bb0c41`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-06-10 12:41:44 +02:00
Hui Wang	028a80613f	lib/httpserver: allow reloadAuthKey and configAuthKey to override htt… (#6338 ) …pAuth.* address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6329, makes `reloadAuthKey`, `configAuthKey`, `flagsAuthKey`, `pprofAuthKey` behavior the same way, but keys like `-snapshotAuthKey`, `-forceMergeAuthKey` are still protected by httpAuth.*. All the available key are listed in https://docs.victoriametrics.com/single-server-victoriametrics/#security. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `61dce6f2a1`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-06-10 12:41:29 +02:00
Aliaksandr Valialkin	3492f4e1fe	app/vmselect/vmui: run `make vmui-update` after c236e3c03c1bf8ca00292b800a839fcb300e7e51 and 04744c274c269f6b6efb45f68df11abe0fb0ce25	2024-06-07 16:39:06 +02:00
Aliaksandr Valialkin	9dfc7190fe	app/vlselect/vmui: run `make vmui-logs-update` after `a68c2c0f17` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6419 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6408 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6405 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6406 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6407	2024-06-06 12:21:47 +02:00
Yury Molodov	8cf417e1c7	vmui/logs: improve log display for group view (#6419 ) ### Describe Your Changes 1) Set the default limit to `50`. #6408 2) Configure the default search to cover the `last 5 minutes` and include all messages (``). #6405 3) In the header, display only streams and group by stream. #6406 4) Add log processing, without the fields `msg`, `time`, and `stream`. 5) When clicking on logs, display a list of all fields. #6407 <img width="400" alt="image" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/666dcaa3-20fb-4828-b77b-1d849dd9a8ed"> ### Checklist The following checks are mandatory*: - [ ] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2024-06-06 12:21:46 +02:00
Dima Lazerka	362ee240cd	vmui: Improve DownloadConfig button interaction with VMAnomaly (#6397 ) Co-authored-by: Dzmitry Lazerka <dlazerka@gmail.com>	2024-06-06 12:12:58 +02:00
Aliaksandr Valialkin	b45e466a1b	lib/logstorage: work-in-progress	2024-06-05 03:18:25 +02:00
Aliaksandr Valialkin	b7b3a9e9a3	lib/logstorage: work-in-progress	2024-06-04 01:50:55 +02:00
hagen1778	d6096a477f	app/vmalert: rm extra response for unsupported path Unsupported path is already handled by `lib/httpserver`. This prevents from misleading errors in logs caused by double-writing response headers. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `a5f81f67fd`)	2024-06-03 12:53:38 +02:00
hagen1778	73c9981335	chore: follow-up after `c740a8042e` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `6d8e02f278`)	2024-06-03 11:53:37 +02:00
Nikolay	908a50f79d	app/vmalert: adds idleConnTimeout flags and retry trivial network errors (#6382 ) * ".idleConnTimeout" flags must reduce probability of `write: broken pipe` and `read: connection reset by peer` errors Those errors may occur if remote server closes TCP socket for connection, while it's still exist at client. single time retries for `write: broken pipe` and `read: connection reset by peer` must handle a case for incorrectly configured timeouts at middleware proxies, mitigate minor network issues. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5661 ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. --------- Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> (cherry picked from commit `b97916276f`)	2024-06-03 11:52:58 +02:00
yumeiyin	95b8cf76f8	chore: remove redundant words (#6348 ) (cherry picked from commit `9289c7512d`)	2024-05-29 14:37:04 +02:00
Andrii Chubatiuk	2c4a42554a	app/vmagent: fixed streamaggr args (#6374 ) use GetOptionalArg instead of index to fallback to a first argument if index is absent for remotewrite.streamaggr.config (cherry picked from commit `7e5a206057`)	2024-05-29 14:04:24 +02:00
Alexander Marshalov	7d532a31fb	Update base Alpine image to 3.20.0 to avoid security risks (#6370 ) fixes: CVE-2023-42366, CVE-2023-42363, CVE-2024-4603, CVE-2024-2511, CVE-2024-24788, CVE-2024-24787	2024-05-28 22:16:29 +02:00
Aliaksandr Valialkin	03fe4c8963	lib/logstorage: work-in-progress	2024-05-25 21:36:24 +02:00
Aliaksandr Valialkin	3152df2bce	lib/logstorage: work-in-progress	2024-05-25 00:31:55 +02:00
Nikolay	5025ede7bc	lib/mergeset: adds tracking for indexdb records drop (#6297 ) It allows to create alert for possible item drops at indexdb. It may happen, if ingested metric size exceeds max indexdb item size. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `69d244e6fb`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-24 16:08:34 +02:00
Zakhar Bessarab	85eee7de0a	app/vmselect: update flag description (#6347 ) Update wording to highlight that cache is not persistent if flag is value is empty. Previously, it was not clear if cache is not used at all or just not persistent.	2024-05-24 15:58:54 +02:00
Aliaksandr Valialkin	7a2a2f173e	lib/logstorage: work-in-progress	2024-05-24 03:07:07 +02:00
Aliaksandr Valialkin	ae8b2bcf2e	app/vlselect: fix loading web UI	2024-05-22 23:25:00 +02:00
Aliaksandr Valialkin	6addc79bdb	app/vlselect/vmui: run `make vmui-logs-update`	2024-05-22 22:06:28 +02:00
Nikolay	dfbd2f8ff7	lib/storage: change default value for maxLabelValueLen to 1024 (#6313 ) * It must reduce memory usage for misbehaving clients. Since VictoriaMetrics stores sparse index inmemory. * Reduce disk space usage for indexdb. * Prevent possible indexDB items drops. * It may trigger slow insert and new timeseries registration due to default value for flag change https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6176 --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-05-22 21:55:21 +02:00
Alexander Marshalov	0b70c4c1f1	[vmlogs] fixed time parsing with millisecond precision time (#6293 ) (#6295 ) fix for #6293 Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-05-22 21:54:50 +02:00
Yury Molodov	252a196405	vmui/logs: fix parsing long `_msg` values (#6310 ) This PR fixes an issue where parsing long `_msg` values caused errors, resulting in some log records not being displayed. The error occurred due to partial processing of strings. In some cases, a long record could be split into multiple chunks, causing only part of the record to be processed instead of the entire entry. #6281 Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-05-22 21:44:49 +02:00
Aliaksandr Valialkin	04d0dd2542	lib/logstorage: work-in-progress	2024-05-22 21:01:28 +02:00
Hui Wang	5b8c3fc9d0	app/vmalert: support DNS SRV record in `-remoteWrite.url` (#6299 ) part of https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6053, supports [DNS SRV](https://en.wikipedia.org/wiki/SRV_record) address in `-remoteWrite.url` command-line option. (cherry picked from commit `d7b5062917`)	2024-05-22 10:53:22 +02:00
Yury Molodov	33eaa18c14	vmui: fix URL params handling for navigation (#6284 ) This PR fixes the handling of URL parameters to ensure correct browser navigation using the back and forward buttons. #6126 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5516#issuecomment-1867507232 (cherry picked from commit `f14497f1cd`)	2024-05-20 14:46:41 +02:00
Yury Molodov	97c3c946a7	vmui/logs: change time range to `start` and `end` query args (#6296 ) change time range limitation from `_time` in the expression to `start` and `end` query args. (cherry picked from commit `a6a599cbdc`)	2024-05-20 14:46:39 +02:00
Roman Khavronenko	3e8b5e74d5	lib/streamaggr: skip empty aggregators (#6307 ) Prevent excessive resource usage when stream aggregation config file contains no matchers by prevent pushing data into Aggregators object. Before this change a lot of extra work was invoked without reason. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `7ce052b32d`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-20 14:46:36 +02:00
Roman Khavronenko	8daa1d9505	app/vmagent: fix panic on shutdown when no global deduplication is co… (#6308 ) …nfigured Follow-up for `f153f54d11` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `7dc18bf67a`)	2024-05-20 14:46:10 +02:00
Aliaksandr Valialkin	582e7d5439	lib/logstorage: work-in-progress	2024-05-20 04:09:15 +02:00
viperstars	ab78f3c89d	app/vmagent/remotewrite: skip sending empty block to downstream server (#6241 ) Occasionally, vmagent sends empty blocks to downstream servers. If a downstream server returns an unexpected response, vmagent gets stuck in a retry loop. While vmagent handles 400 and 409 errors, there are various prometheus remote write implementations that return different error codes. For example, vector returns a 422 error. To mitigate the risk of vmagent getting stuck in a retry loop, it is advisable to skip sending empty blocks to downstream servers. Co-authored-by: hao.peng <hao.peng@smartx.com> Co-authored-by: Zhu Jiekun <jiekun.dev@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `3661373cc2`)	2024-05-17 14:57:07 +02:00
Yury Molodov	5bfbfe6ad2	vmui: remove redundant requests on the `Explore Cardinality` page (#6263 ) Remove redundant requests on the Explore Cardinality page. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6240 (cherry picked from commit `be291c36f7`)	2024-05-17 14:56:55 +02:00
Yury Molodov	0edef9105b	vmui: fix calendar display (#6255 ) Fix the calendar display issue occurring with the `UTC+00:00` timezone https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6239 (cherry picked from commit `4ad577cc6f`)	2024-05-17 14:56:53 +02:00
Andrii Chubatiuk	fe332c3419	app/vmagent: add global aggregator (#6268 ) Add global stream aggregation for VMAgent https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5467 (cherry picked from commit `f153f54d11`)	2024-05-17 14:01:31 +02:00
Nikolay	ee4a94a371	follow-up for `c6c5a5a186` (#6265 ) * adds datadog extensions for statsd: - multiple packed values (v1.1) - additional types distribution, histogram * adds type check and append metric type to the labels with special tag name `__statsd_metric_type__`. It simplifies streaming aggregation config. * remove statsd support from cluster, since cluster doesn't support stream aggregation. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `b2765c45d0`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-17 13:49:24 +02:00
Roman Khavronenko	a5c427bac4	app/vmalert/datasource: reduce number of allocations when parsing instant responses (#6272 ) Allocations are reduced by implementing custom json parser via fastjson lib. The change also re-uses `promInstant` object in attempt to reduce number of allocations when parsing big responses, as usually happens with heavy recording rules. ``` name old allocs/op new allocs/op delta ParsePrometheusResponse/Instant-10 9.65k ± 0% 5.60k ± 0% ~ (p=1.000 n=1+1) ``` Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `4f0525852f`)	2024-05-16 09:35:58 +02:00
Aliaksandr Valialkin	28626db066	lib/logstorage: work-in-progress (cherry picked from commit `0aa19a2837`)	2024-05-16 09:35:55 +02:00
Roman Khavronenko	955d36357c	app/vmalert/rule: reduce number of allocations for getStaleSeries fn (#6269 ) Allocations are reduced by re-using the byte buffer when converting labels to string keys. ``` name old allocs/op new allocs/op delta GetStaleSeries-10 703 ± 0% 203 ± 0% ~ (p=1.000 n=1+1) ``` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `b0c1f3d819`)	2024-05-16 09:35:51 +02:00
Nikolay	2b2fdffd77	app/vmauth: explicitly unregister metrics set for auth config (#6252 ) it's needed to remove Summary metric type from the global state of metrics package. metrics package tracks each bucket of summary and periodically swaps old buckets with new. Simple set unregister is not enough to release memory used by Set https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6247 (cherry picked from commit `6a6e34ab8e`)	2024-05-14 09:28:37 +02:00
Aliaksandr Valialkin	b1ee7bca1a	lib/logstorage: work-in-progress	2024-05-14 03:06:02 +02:00
Andrii Chubatiuk	ec2273b247	app/vmagent: removed deprecated -remoteWrite.multitenantURL flag support (#6253 ) Removed deprecated `-remoteWrite.multitenantURL` flag to simplify global stream aggregation --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `680b8c25c8`)	2024-05-13 16:49:33 +02:00
Yury Molodov	f18ae015de	vmui/vmanomaly: add download config button (#6231 ) This pull request adds a button to the vmanomaly ui that opens a modal window for viewing and downloading the config file. <img width="610" alt="button" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/0132b178-eb73-4272-8144-be7ed2a8dcaf"> <img height="300" alt="error" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/6d9f2627-77d7-4ce6-b73b-542ce1bbc999"> <img height="300" alt="modal" src="https://github.com/VictoriaMetrics/VictoriaMetrics/assets/29711459/680bffdd-d6a3-445e-bd48-8f0feb30016e"> (cherry picked from commit `37c22ee053`)	2024-05-13 16:49:31 +02:00
Yury Molodov	e430ab1999	vmui/vmanomaly: fix default server url (#6178 ) This PR for ui vmanomaly eliminates URL parameters to automatically use the default server URL, simplifying URLs like: From http://localhost:3000/#/?g0.expr=vm_blocks... to http://localhost:3000 From http://localhost:3000/select/0/vmui/#/?g0.expr=vm_blocks... to http://localhost:3000/select/0/vmui/ etc. (cherry picked from commit `29bd120126`)	2024-05-13 16:49:29 +02:00
Aliaksandr Valialkin	147704aab0	lib/logstorage: initial implementation of pipes in LogsQL See https://docs.victoriametrics.com/victorialogs/logsql/#pipes	2024-05-12 16:36:01 +02:00
Aliaksandr Valialkin	87338633b1	lib/slicesutil: add helper functions for setting slice length and extending its capacity The added helper functions - SetLength() and ExtendCapacity() - replace error-prone code with simple function calls.	2024-05-12 11:33:49 +02:00
Aliaksandr Valialkin	6b81441ed0	app/vmselect: use strings.EqualFold instead of strings.ToLower where appropriate Strings.EqualFold doesn't allocate memory contrary to strings.ToLower if the input string contains uppercase chars	2024-05-12 10:21:24 +02:00
Aliaksandr Valialkin	536d87cd51	app/vmselect/promql: properly estimate the needed amounts of memory for executing aggregate function over rollup function in incremental mode Incremental aggregation processes only GOMAXPROCS time series at a time, so its' memory usage doesn't depend on the number of input time series. The issue has been introduced in `5138eaeea0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3203	2024-05-12 10:14:27 +02:00
Roman Khavronenko	0bed453737	Feature allow configuring disableOnDiskQueue and dropSamplesOnOverload per url (#6248 ) * FEATURE: [vmagent](https://docs.victoriametrics.com/vmagent.html): allow configuring `-remoteWrite.disableOnDiskQueue` and `-remoteWrite.dropSamplesOnOverload` cmd-line flags per each `-remoteWrite.url`. See this [pull request](https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6065). Thanks to @rbizos for implementaion! * FEATURE: [vmagent](https://docs.victoriametrics.com/vmagent.html): add labels `path` and `url` to metrics `vmagent_remotewrite_push_failures_total` and `vmagent_remotewrite_samples_dropped_total`. Now number of failed pushes and dropped samples can be tracked per `-remoteWrite.url`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Raphael Bizos <r.bizos@criteo.com> (cherry picked from commit `87fd400dfc`)	2024-05-10 14:32:23 +02:00
qiangxuhui	885fc4122a	Add build support for loong64 (#6222 ) ### Describe Your Changes Added makefile rule for `GOARCH=loong64` to support building all VictoriaMetrics components on the `loongarch64` platform. ### Checklist The following checks are mandatory: * [X] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: qiangxuhui <qiangxuhui@loongson.cn> (cherry picked from commit `80f3644ee3`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-10 14:32:05 +02:00
hagen1778	879170221c	app/vmselect/vmui: add missing static files These files weren't added to the git after `make vmui-build vmui-update` command in commit `7fd9325e62 (diff-50d9a4b91bdad190f2db92553736267103ab4225dfb6642b675fb4b8196e6560)` Related to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6224 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `56531abd56`)	2024-05-10 14:29:20 +02:00
Zhu Jiekun	139f909cdb	chore: [deployment] upgrade from go 1.22.2 to 1.22.3 to include security fixes (#6238 ) ### Describe Your Changes upgrade from go 1.22.2 to 1.22.3 to include security fixes. Also see: - https://go.dev/doc/devel/release - https://github.com/golang/go/issues?q=milestone%3AGo1.22.3+label%3ACherryPickApproved ### Checklist The following checks are mandatory: - [X] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: Jiekun <jiekun.dev@gmail.com> (cherry picked from commit `02851d7800`)	2024-05-10 14:28:56 +02:00
Oleg	76af930e4a	Statsd protocol compatibility (#5053 ) In this PR I added compatibility with [statsd protocol](https://github.com/b/statsd_spec) with tags to be able to send metrics directly from statsd clients to vmagent or directly to VM. For example its compatible with [statsd-instrument](https://github.com/Shopify/statsd-instrument) and [dogstatsd-ruby](https://github.com/DataDog/dogstatsd-ruby) gems Related issues: #5052, #206, #4600 (cherry picked from commit `c6c5a5a186`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-10 14:27:31 +02:00
Ted Possible	0206a01d03	Exemplar support (#5982 ) This code adds Exemplars to VMagent and the promscrape parser adhering to OpenMetrics Specifications. This will allow forwarding of exemplars to Prometheus and other third party apps that support OpenMetrics specs. --------- Signed-off-by: Ted Possible <ted_possible@cable.comcast.com> (cherry picked from commit `5a3abfa041`)	2024-05-10 13:14:17 +02:00
Andrii Chubatiuk	e26b55db1e	app/vmagent/remotewrite: do not cleanup timeseries which are used in multiple remote write contexts (#6206 ) When at least one remote write has deduplication configured it cleans up timeseries while they can be in use by another remote write without deduplication https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6205 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `879771808b`)	2024-05-06 12:10:45 +02:00
Yury Molodov	75af52c1d0	vmui: fix issue preventing first query trace expansion (#6197 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6186 (cherry picked from commit `046a4a5ecf`)	2024-04-30 18:39:22 +02:00
Hui Wang	abd29c15ab	docs: update vmalert and vmagent docs (#6207 ) * restore and actualize doc section explaining duplicated labels error * rm misleading comment about post-aggregation in stream aggregation (cherry picked from commit `e3c226cf92`)	2024-04-30 10:30:19 +02:00
Roman Khavronenko	fc28390618	app/vmauth: add test for LeastLoaded balance policy (#6144 ) Check if least-loaded works correctly. related to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6136 Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `e2590b339d`)	2024-04-30 10:30:14 +02:00
hagen1778	dfad598092	app/vmselect: run make vmui-update Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `7fd9325e62`)	2024-04-25 16:02:59 +02:00
Hui Wang	7fdea4b31c	app/vmselect: implement cmd-line flags `-search.disableImplicitConversions` and `-search.logImplicitConversions` (#6180 ) address https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4338 support disable or log [implicit conversions](https://docs.victoriametrics.com/metricsql/#implicit-query-conversions) for subquery with cmd-line flags `-search.disableImplicitConversion` and `-search.logImplicitConversion` Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `dd0d2c77c8`)	2024-04-25 13:08:05 +02:00
Yury Molodov	14c0c06526	vmui: improve error message for server response issues (#6177 ) Updates error messages for better clarity and guidance on server response issues. (cherry picked from commit `57b7d16259`)	2024-04-25 13:08:02 +02:00
Yury Molodov	669cbcb92e	vmui: trigger auto-suggestion at any cursor position (#6155 ) - Implemented auto-suggestion triggers for mid-string cursor positions in vmui. - Improved the suggestion list positioning to appear directly beneath the active text editing area. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5864 (cherry picked from commit `6193fa3dcf`)	2024-04-25 13:08:00 +02:00
hagen1778	59b3f21708	Revert "app/vmbackup: introduce new flag type URL (#6152 )" This reverts commit `029060af60`. (cherry picked from commit `679844feaf`)	2024-04-24 17:08:26 +02:00
Roman Khavronenko	ff73b66182	app/vmbackup: introduce new flag type URL (#6152 ) The new flag type is supposed to be used for specifying URL values which could contain sensitive information such as auth tokens in GET params or HTTP basic authentication. The URL flag also allows loading its value from files if `file://` prefix is specified. As example, the new flag type was used in app/vmbackup as it requires specifying `authKey` param for making the snapshot. See related issue https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5973 Thanks to @wasim-nihal for initial implementation https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6060 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `029060af60`)	2024-04-24 17:08:24 +02:00
hagen1778	57c841669c	app/vmagent: mention corner case with dangling queues and identical URLs See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6140 We don't cover this corner case as it has low chance for reproduction. Precisely, the requirements are following: 1. vmagent need to be configured with multiple identical `remoteWrite.url` flags; 2. At least one of the persistent queues need to be non-empty, which already signalizes about issues with setup; 3. vmagent need to be restarted with removing of one of `remoteWrite.url` flags. We do not document this case in vmagent.md as it seems to be a rare corner case and its explanation will require too much of explanation and confuse users. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `4251292708`)	2024-04-23 14:52:35 +02:00
Roman Khavronenko	2566e19306	app/vmalert: fix links with anchors in vmalert's UI (#6146 ) Starting from v1.99.0 vmalert could ignore anchors pointing to specific rule groups if `search` param was present in URL. This change makes anchors compatible with `search` param in UI. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `5f487c7090`)	2024-04-22 15:05:23 +02:00
hagen1778	342290275e	app/streamaggr: follow-up after `c0e4ccb7b5` * rm vmagent mentions from vminsert flags * improve documentation wording, add links to related sections * mention `ignore_first_intervals` in the stream aggr options * update flags description * add basic test for config parsing validation Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `bae3874e6a`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-04-22 14:39:23 +02:00
Andrii Chubatiuk	131367fb59	lib/streamaggr: add option to ignore first N aggregation intervals (#6137 ) Stream aggregation may yield inaccurate results if it processes incomplete data. This issue can arise when data is sourced from clients that maintain a queue of unsent data, such as Prometheus or vmagent. If the queue isn't fully cleared within the aggregation interval, only a portion of the time series may be included in that period, leading to distorted calculations. To mitigate this we add an option to ignore first N aggregation intervals. It is expected, that client queues will be cleared during the time while aggregation ignores first N intervals and all subsequent aggregations will be correct. (cherry picked from commit `c0e4ccb7b5`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-04-22 14:34:36 +02:00
Aliaksandr Valialkin	c146b24196	app/vminsert: replace hybrid sync.Pool+channel-based pool scheme for poolCtx with plain sync.Pool This simplifies the code, while doesn't increase memory usage under low and high data ingestion rate. This is a follow-up for `1decbcf6eb`	2024-04-20 21:46:11 +02:00
Aliaksandr Valialkin	71d0020c2f	app/vminsert/influx: replace hybrid channel-based pool+sync.Pool with plain sync.Pool for pushCtx The memory usage for plain sync.Pool doesn't increase comparing to the memory usage for the hybrid scheme, so it is better to use plain sync.Pool in order to simplify the code and make it more readable and maintainable. This is a follow-up for `c22da2f917`	2024-04-20 21:41:18 +02:00
Aliaksandr Valialkin	a249ab96b4	app/vmagent/influx: replace hybrid channel-based pool + sync.Pool with plain sync.Pool for pushCtx Data ingestion benchmark doesn't show memory usage difference between two approaches, so let's use simpler approach in order to improve code readability and maintainability. This is a follow-up for `77c597738c`	2024-04-20 21:38:25 +02:00
Aliaksandr Valialkin	d6da68ee90	app/vmagent/common: use plain sync.Pool instead of a mix of sync.Pool with channel-based pool for PushCtx This scheme was used for reducing memory usage when vmagent runs on a machine with big number of CPU cores and the ingestion rate isn't too big. The scheme with channel-based pool could reduce memory usage, since it minimizes the number of PushCtx structs in the pool in this case. Performance tests didn't reveal significant difference in memory usage under both low and high ingestion rate between plain sync.Pool and the current hybrid scheme, so replace the scheme with plain sync.Pool in order to simplify the code.	2024-04-20 21:31:14 +02:00
Aliaksandr Valialkin	9004bc098e	all: use clear() built-in Go function for clearing []prompbmarshal.TimeSeries and []prompbmarshal.Label slices This makes the code a bit clear.	2024-04-20 21:00:24 +02:00
Aliaksandr Valialkin	498fe1cfa5	app/vminsert/common: remove obsolete optimization for reducing memory usage for InsertCtx pool This optimization is no longer needed according to benchmarks with ingestion rate. This simplifies the code a bit.	2024-04-20 20:51:38 +02:00
Aliaksandr Valialkin	fba3c10ed1	app/vmselect/promql: add support for matching against multiple numeric constants via `q == (c1,...,cN)` and `q != (c1,...,cN)` syntax	2024-04-19 17:57:09 +02:00
Aliaksandr Valialkin	5b29be1f4d	app/vmagent/remotewrite: add support for replication additionally to sharding when both -remoteWrite.shardByURL and -remoteWrite.shardByURLReplicas=RF command-line flags are set This allows setting up data replication among failure domains if the replication factor is smaller than the number of failure domains. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6054 See https://docs.victoriametrics.com/vmagent/#sharding-among-remote-storages	2024-04-19 11:37:04 +02:00
Hui Wang	e0d47ab6af	vmalert: avoid blocking APIs when alerting rule uses template functio… (#6129 ) * vmalert: avoid blocking APIs when alerting rule uses template function `query` * app/vmalert: small refactoring * simplify labels and templates expanding * simplify `newAlert` interface * fix `TestGroupStart` which mistakenly skipped annotations and response labels check Signed-off-by: hagen1778 <roman@victoriametrics.com> * reduce alerts lock time when restore --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-04-19 11:30:40 +02:00
Roman Khavronenko	95b0f82c9b	app/vmalert: make `TestGroupStart` more reliable (#6130 ) There was a sleep statement in the test, waiting for Group to perform a couple of evaluation. But looks like it worked unreliable for some CI tests like the one below https://github.com/VictoriaMetrics/VictoriaMetrics/actions/runs/8718213844/job/23915007958?pr=6115 This commit changes the sleep statement on a function that waits for a specific number of evaluations. It should make this test faster in general case, and more reliable for slow environemnts.	2024-04-19 11:28:30 +02:00

1 2 3 4 5 ...

3456 Commits