VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-21 07:56:26 +01:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	3a9cf13aaa	app/{vmagent,vmalert}: add the ability to set OAuth2 endpoint params via the corresponding *.oauth2.endpointParams command-line flags This is a follow-up for `5ebd5a0d7b` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5427	2023-12-20 21:38:16 +02:00
Morgan	64e96fccd9	Expose OAuth2 Endpoint Parameters to cli (#5427 ) The user may which to control the endpoint parameters for instance to set the audience when requesting an access token. Exposing the parameters as a map allows for additional use cases without requiring modification.	2023-12-20 21:38:13 +02:00
Aliaksandr Valialkin	c888d76c4b	app/vmselect/netstorage: make sure that at least a single result is collected from every storage group before deciding whether it is OK to skip results from the remaining storage nodes	2023-12-20 19:53:49 +02:00
Aliaksandr Valialkin	261c173f4b	all: use Gauge instead of Counter for `*_config_last_reload_successful` metrics This allows exposing the correct TYPE metadata for these labels when the app runs with -metrics.exposeMetadata command-line flag. See https://github.com/VictoriaMetrics/metrics/pull/61#issuecomment-1860085508 for more details. This is follow-up for `326a77c697`	2023-12-20 14:25:44 +02:00
Yury Molodov	d0b047a2bf	vmui: add vmanomaly explorer (#5401 )	2023-12-20 14:15:25 +02:00
Roman Khavronenko	bcb2b8247c	vmctl: rename `vm-native-disable-retries` to `vm-native-disable-per-metric-migration` (#5476 ) The change supposed to better reflect the meaning of this flag. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-17 19:19:22 +02:00
Roman Khavronenko	5f14fa94dd	vmctl: retry requests that failed in the very end for `vm-native` (#5475 ) Before, retries happened only on writes into a network connection between source and destination. But errors returned by server after all the data was transmitted were logged, but not retried. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `664fa5cb78`)	2023-12-15 11:54:08 +01:00
Hui Wang	ed4f77575f	vmalert: validate schema for `-external.url` (#5450 ) Requests with wrong or no schema in `-external.url` could be rejected by alertmanager. So we validate schema on start up. (cherry picked from commit `9253c24dd6`)	2023-12-15 11:54:07 +01:00
Dima Lazerka	44c113f829	VMUI: Handle unknown query error response type (#5451 ) * VMUI: Handle unknown query error response type * vmui: add error text for unknown error type * Simplify nested `if`s for unknown error Accepting @Loori-R's suggestion Co-authored-by: Yury Molodov <yurymolodov@gmail.com> --------- Co-authored-by: Yury Moladau <yurymolodov@gmail.com> (cherry picked from commit `cd277e3f84`)	2023-12-15 11:54:07 +01:00
Aliaksandr Valialkin	42629dead1	app/vmstorage: addd missing -inmemoryDataFlushInterval command-line flag Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3337	2023-12-14 20:47:49 +02:00
Anton Tykhyy	51af1dfff7	Fix sum(aggr_over_time) 'got 1 args' error (#3028 ) (#5414 ) app/vmselect/promql/eval.go:evalAggrFunc shunts evaluation of AggrFuncExpr over rollupFunc over MetricsExpr to an optimized path. tryGetArgRollupFuncWithMetricExpr() checks whether expression can be shunted, but it mangles the AggrFuncExpr when the aggregation function has more than one argument. This results in queries like `sum(aggr_over_time("avg_over_time",m))` failing with error message 'expecting at least 2 args to "aggr_over_time"; got 1 args' while the analogous query `sum(avg_over_time(m))` executes successfully. This fix removes the unnecessary mangling. Signed-off-by: Anton Tykhyy <atykhyy@gmail.com>	2023-12-14 12:49:01 +02:00
Aliaksandr Valialkin	4ee42e9e73	app/vmauth: allow specifying an empty retry_status_codes and and zero drop_src_path_prefix_parts in order to override user-level setting Previously `retry_status_codes: []` and `drop_src_path_prefix_parts: 0` at `url_map` were equivalent to missing values. This was resulting in using the user-level values instead.	2023-12-14 01:06:50 +02:00
Aliaksandr Valialkin	51acf0179c	app/vmauth: add ability to route requests to different backends depending on the request host	2023-12-14 00:47:00 +02:00
Yury Molodov	e76c44c5b4	vmui: autocomplete usability improvements (#5422 ) * vmui: add show quick tip for autocomplete * vmui: auto-completion usability improvements #5348 * vmui: add const for min symbols in autocomplete * Use proper queries to VictoriaMetrics * vmui: fix comments for autocomplete * app/vmselect: run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-12-13 00:33:27 +02:00
Aliaksandr Valialkin	e4bb2808f1	app/vmselect: add support for vmstorage groups with independent -replicationFactor per group Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5197 See https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#vmstorage-groups-at-vmselect Thanks to @zekker6 for the initial pull request at https://github.com/VictoriaMetrics/VictoriaMetrics-enterprise/pull/718	2023-12-13 00:14:34 +02:00
hagen1778	3b841fe9ce	app/vmctl: follow-up after `6af732b6f7` Make docs more clear about new feature. Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `242472086b`)	2023-12-12 13:45:35 +01:00
Dmytro Kozlov	63fc200f16	app/vmctl: enable range steps in reverse order (#5444 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5376 (cherry picked from commit `6af732b6f7`)	2023-12-12 13:45:35 +01:00
hagen1778	307bcb8d4d	app/vmctl: follow-up after `27668c9d01` * remove duplications in error messages * mention the change in CHANGELOG.md `27668c9d01` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `39c405ed4d`)	2023-12-11 15:38:22 +01:00
wozz	61f940591f	vmctl: check error in response from influxdb (#5446 ) (cherry picked from commit `27668c9d01`)	2023-12-11 15:38:20 +01:00
Aliaksandr Valialkin	842aba3f46	deployment/docker: update base Docker image from alpine:3.18.5 to alpine:3.19.0 See https://www.alpinelinux.org/posts/Alpine-3.19.0-released.html	2023-12-10 02:28:31 +02:00
Aliaksandr Valialkin	3d6517b05e	app/vmselect: add -search.maxResponseSeries command-line flag for limiting the number of time series a single response can return This limit can be used for preventing from high memory usage at Grafana when the response returns too many series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5372	2023-12-10 00:54:32 +02:00
Aliaksandr Valialkin	55eb48f5ee	app: make more clear that -tls enables https at -httpListenAddr	2023-12-10 00:25:23 +02:00
Aliaksandr Valialkin	49552eaa15	app/vmauth: add support for `hot standby` mode via `first_available` load balancing policy vmauth in `hot standby` mode sends requests to the first url_prefix while it is available. If the first url_prefix becomes unavailable, then vmauth falls back to the next url_prefix. This allows building highly available setup as described at https://docs.victoriametrics.com/vmauth.html#high-availability Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4893 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792	2023-12-08 23:32:10 +02:00
Roman Khavronenko	276e9301f4	app/vmalert: sanitize label names before sending to Alertmanager (#5442 ) Before, vmalert would send notifications with labels containing characters not supported by Alertmanager validator, resulting into validation errors like `msg="Failed to validate alerts" err="invalid label set: invalid name "foo.bar"` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-08 18:09:07 +02:00
Alexander Marshalov	e9cf39f519	added field `version` to the response for `/api/v1/status/buildinfo` API for using more efficient API in Grafana for receiving label values, added additional info about setup Grafana datasource (#5370 ) (#5437 )	2023-12-07 16:41:56 +02:00
Aliaksandr Valialkin	32aea90847	app/vmselect/prometheus: go fmt after `b39e9257eb`	2023-12-07 16:05:01 +02:00
Aliaksandr Valialkin	9f79342e6a	app/vmselect/prometheus: properly encode Prometheus label values at /federate endpoint Prometheus spec says that only \, \n and " must be escaped inside label values. See `995743836e/content/docs/instrumenting/exposition_formats.md (L90)` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5431	2023-12-07 15:36:50 +02:00
Aliaksandr Valialkin	12e94f10cc	deployment/docker: update Go builder from Go1.21.4 to Go1.21.5 See https://github.com/golang/go/issues?q=milestone%3AGo1.21.5+label%3ACherryPickApproved	2023-12-06 22:33:27 +02:00
Dmytro Kozlov	6a41e1ec0c	app/vmalert: replace error metrics for gauges with counter metrics (#5217 ) See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5160 Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `935bec447b`)	2023-12-06 19:41:34 +01:00
Aliaksandr Valialkin	509339bf63	app/vmselect: properly adjust the lower bound for the time range where raw samples must be selected for default_rollup() function Previously the lower bound could be too small, which could result in missing values at the beginning of the graph for default_rollup() function. This function is automatically applied to all the series selectors if they aren't explicitly wrapped into a rollup function - see https://docs.victoriametrics.com/MetricsQL.html#implicit-query-conversions While at it, properly take into account `-search.minStalenessInterval` command-line flag when adjusting the lower bound for the selected time range. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5388	2023-12-06 14:46:18 +02:00
Aliaksandr Valialkin	559e4db512	Revert "add datadog /api/v2/series and /api/beta/sketches support (#5094 )" This reverts commit `d6b4c8e4ef`. Reason for revert: https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5094#issuecomment-1839789080	2023-12-05 02:30:40 +02:00
Aliaksandr Valialkin	bf187b2dc9	app/vmagent: add `-enableMultitenantHandlers` command-line flag This flag allows converting tenant id to (vm_account_id, vm_project_id) labels. this flag deprecates `-remoteWrite.multitenantURL` command-line flag, because `-enableMultitenantHandlers` is easier to use and combine with multitenant url at vminsert - https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#multitenancy-via-labels See https://docs.victoriametrics.com/vmagent.html#multitenancy Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/1505	2023-12-05 01:35:59 +02:00
Aliaksandr Valialkin	85fcefaa34	app/vmagent: code cleanup for Kafka and Google PubSub consumers / producers - Add links to relevant docs into descriptions for every -kafka.* and -gcp.pubsub.* command-line flags. - Wait until message processing goroutines are stopped before returning from gcppubsub.Stop(). - Prevent from multiple calls to Init() without Stop(). - Drop message if tenantID cannot be parsed properly. - Take into account tenantID for all the supported message formats. - Support gzip-compressed messages for graphite format. - Use exponential backoff sleep when the message cannot be pushed to remote storage systems because of disabled on-disk persistence - https://docs.victoriametrics.com/vmagent.html#disabling-on-disk-persistence - Unblock from sleep as soon as Stop() is called. Previously the sleep could take up to 2 seconds after Stop() is called. - Remove unused globalCtx and initContext from app/vmagent/remotewrite/gcppubsub - Mention Google PubSub support at docs/enterprise.md - Make Google PubSub docs more clear at docs/vmagent.md This is a follow-up for commits 115245924a5f096c5a3383d6cc8e8b6fbd421984 and e6eab781ce42285a6a1750dc01eba6801dd35516 . Updates https://github.com/VictoriaMetrics/VictoriaMetrics-enterprise/pull/717 Updates https://github.com/VictoriaMetrics/VictoriaMetrics-enterprise/pull/713	2023-12-04 22:51:04 +02:00
Dmytro Kozlov	6770bad207	app/vmalert: expose `/vmalert/api/v1/rule` and `/api/v1/rule` API which returns rule status in JSON format (#5397 ) * app/vmalert: expose `/vmalert/api/v1/rule` and `/api/v1/rule` API which returns rule status in JSON format * app/vmalert: hide updates if query param not set * app/vmalert: fix panic (recursion call) * app/vmalert: add needed group name and file name * app/vmalert: fix comment, update behavior * app/vmalert: fix description * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmalert: simplify API for /api/v1/rule Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2023-12-04 22:49:39 +02:00
Aliaksandr Valialkin	d868155751	app/vmselect: do not limit concurrency for static and fast queries Previously concurrency for static and fast queries was limited with the -search.maxConcurrentRequests command-line flag. This could complicate identifying heavy queries via `vmui` at `Top queries` and `Active queries` pages, since `vmui` and these pages couldn't be opened on overloaded vmselect. Thanks to @f41gh7 for the idea.	2023-12-04 18:14:29 +02:00
Aliaksandr Valialkin	9f352f1b93	app/vminsert/newrelic: simplify the code a bit after `1fb8dc0092` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5416 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5421	2023-12-04 16:26:52 +02:00
Dmytro Kozlov	1fb8dc0092	app/vminsert: fix newrelic ingestion in cluster version (#5421 ) Properly pass tenant ID to ingested data from newrelic. Before tenant ID was mistakenly skipped.	2023-12-04 09:38:32 +01:00
Aliaksandr Valialkin	e017176f45	docs/vmauth.md: add typical use cases (cherry picked from commit `837f6f0975`)	2023-12-01 14:00:23 +01:00
Andrii Chubatiuk	d6b4c8e4ef	add datadog /api/v2/series and /api/beta/sketches support (#5094 ) Co-authored-by: Andrew Chubatiuk <andrew.chubatiuk@motional.com> Co-authored-by: Nikolay <https://github.com/f41gh7> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> (cherry picked from commit `543f218fe9`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-12-01 13:55:32 +01:00
luckyxiaoqiang	8ce82c5400	app/vmselect/promql: add day_of_year() function (#5368 ) Co-authored-by: dingxiaoqiang <dingxiaoqiang@bytedance.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> (cherry picked from commit `d7897e0d70`)	2023-11-28 12:49:48 +01:00
Aliaksandr Valialkin	5ccc22d66d	app/vmagent: properly increase vmagent_remotewrite_samples_dropped_total when scraped samples cannot be sent to the remote storage and -remoteWrite.dropSamplesOnOverload is set This is a follow-up for `5034aa0773` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2110	2023-11-25 14:44:42 +02:00
Aliaksandr Valialkin	2f14394335	app/vmagent: follow-up for `090cb2c9de` - Add Try* prefix to functions, which return bool result in order to improve readability and reduce the probability of missing check for the result returned from these functions. - Call the adjustSampleValues() only once on input samples. Previously it was called on every attempt to flush data to peristent queue. - Properly restore the initial state of WriteRequest passed to tryPushWriteRequest() before returning from this function after unsuccessful push to persistent queue. Previously a part of WriteRequest samples may be lost in such case. - Add -remoteWrite.dropSamplesOnOverload command-line flag, which can be used for dropping incoming samples instead of returning 429 Too Many Requests error to the client when -remoteWrite.disableOnDiskQueue is set and the remote storage cannot keep up with the data ingestion rate. - Add vmagent_remotewrite_samples_dropped_total metric, which counts the number of dropped samples. - Add vmagent_remotewrite_push_failures_total metric, which counts the number of unsuccessful attempts to push data to persistent queue when -remoteWrite.disableOnDiskQueue is set. - Remove vmagent_remotewrite_aggregation_metrics_dropped_total and vm_promscrape_push_samples_dropped_total metrics, because they are replaced with vmagent_remotewrite_samples_dropped_total metric. - Update 'Disabling on-disk persistence' docs at docs/vmagent.md - Update stale comments in the code Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5088 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2110	2023-11-25 12:13:39 +02:00
Nikolay	25ac2aac31	app/vmagent: allow to disabled on-disk persistence (#5088 ) * app/vmagent: allow to disabled on-disk queue Previously, it wasn't possible to build data processing pipeline with a chain of vmagents. In case when remoteWrite for the last vmagent in the chain wasn't accessible, it persisted data only when it has enough disk capacity. If disk queue is full, it started to silently drop ingested metrics. New flags allows to disable on-disk persistent and immediatly return an error if remoteWrite is not accessible anymore. It blocks any writes and notify client, that data ingestion isn't possible. Main use case for this feature - use external queue such as kafka for data persistence. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2110 * adds test, updates readme * apply review suggestions * update docs for vmagent * makes linter happy --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-25 12:12:29 +02:00
Roman Khavronenko	26242f526e	lib/protoparser: decrease `import.maxLineLen` from 100MB to 10MB (#5364 ) Tests showed that importing a single line with 70MB size takes 5.3GiB RSS memory for VictoriaMetrics single-node. In the scenario when user exports and imports data from one VM to another, it could possibly lead to OOM exception for destination VM. Importing a single line with 16MB size taks 1.3GiB RSS memory. Hence, the limit for `import.maxLineLen` was decreased from 100MB to 10MB to improve reliability of VictoriaMetrics during imports. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-11-24 13:13:33 +02:00
Aliaksandr Valialkin	a906a7d85c	app/vmagent/remotewrite: do not drop persistent queues when -remoteWrite.multitenantURL is set It is unsafe to drop persistent queues when -remoteWrite.multitenantURL command-line flag is set, since these queues are created on demand when a new sample for the given tenant is pushed to the remote storage. This addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5357 The issue has been appeared in the commit `f3a51e8b1d` when implementing https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4014	2023-11-23 20:43:21 +02:00
Aliaksandr Valialkin	10b4dfbbf9	app/vmalert/notifier: remove backticks from the description for -notifier.blackhole command-line flag Backticks in flag description are automatically converted to flag type. See https://pkg.go.dev/flag#PrintDefaults This is a follow-up for `20025d4fd6` and `25317b4e70`	2023-11-22 20:17:45 +02:00
Aliaksandr Valialkin	db6dadf1f7	docs: convert png images to webp in all the docs except of docs/operator/* This reduces the size of docs/* folder from 33MB to 18MB Images inside docs/operator/* must be converted at the https://github.com/VictoriaMetrics/operator/tree/master/docs and then the updated images must be automatically propagated to the docs/operator/* This is a follow-up for `d3f919df3e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5206	2023-11-22 19:29:47 +02:00
Aliaksandr Valialkin	46e58f3669	app/vmagent/README.md: sync with docs/vmagent.md after `cbe4a5c251` , so `make docs-sync` properly works	2023-11-20 22:43:28 +02:00
Nikolay	c06044ef52	app/vmagent: adds google pubsub as remoteWrite dst and ingest consumer (#713 ) it allows to push and receive metrics from google pubsub queue Adds needed documentation and examples for it	2023-11-20 22:43:26 +02:00
hagen1778	0dbbffbdd5	docs: typo after `3f5a41e35e` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `20025d4fd6`)	2023-11-20 17:06:21 +01:00

1 2 3 4 5 ...

2966 Commits