VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-26 20:30:10 +01:00

Author	SHA1	Message	Date
Roman Khavronenko	7ce052b32d	lib/streamaggr: skip empty aggregators (#6307 ) Prevent excessive resource usage when stream aggregation config file contains no matchers by prevent pushing data into Aggregators object. Before this change a lot of extra work was invoked without reason. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-20 14:03:28 +02:00
Roman Khavronenko	7dc18bf67a	app/vmagent: fix panic on shutdown when no global deduplication is co… (#6308 ) …nfigured Follow-up for `f153f54d11` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-05-20 13:23:09 +02:00
viperstars	3661373cc2	app/vmagent/remotewrite: skip sending empty block to downstream server (#6241 ) Occasionally, vmagent sends empty blocks to downstream servers. If a downstream server returns an unexpected response, vmagent gets stuck in a retry loop. While vmagent handles 400 and 409 errors, there are various prometheus remote write implementations that return different error codes. For example, vector returns a 422 error. To mitigate the risk of vmagent getting stuck in a retry loop, it is advisable to skip sending empty blocks to downstream servers. Co-authored-by: hao.peng <hao.peng@smartx.com> Co-authored-by: Zhu Jiekun <jiekun.dev@gmail.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-05-17 14:55:17 +02:00
Andrii Chubatiuk	f153f54d11	app/vmagent: add global aggregator (#6268 ) Add global stream aggregation for VMAgent https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5467	2024-05-17 14:00:47 +02:00
Nikolay	b2765c45d0	follow-up for `c6c5a5a186` (#6265 ) * adds datadog extensions for statsd: - multiple packed values (v1.1) - additional types distribution, histogram * adds type check and append metric type to the labels with special tag name `__statsd_metric_type__`. It simplifies streaming aggregation config. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-05-16 09:25:42 +02:00
Andrii Chubatiuk	680b8c25c8	app/vmagent: removed deprecated -remoteWrite.multitenantURL flag support (#6253 ) Removed deprecated `-remoteWrite.multitenantURL` flag to simplify global stream aggregation --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-05-13 15:22:37 +02:00
Roman Khavronenko	87fd400dfc	Feature allow configuring disableOnDiskQueue and dropSamplesOnOverload per url (#6248 ) * FEATURE: [vmagent](https://docs.victoriametrics.com/vmagent.html): allow configuring `-remoteWrite.disableOnDiskQueue` and `-remoteWrite.dropSamplesOnOverload` cmd-line flags per each `-remoteWrite.url`. See this [pull request](https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6065). Thanks to @rbizos for implementaion! * FEATURE: [vmagent](https://docs.victoriametrics.com/vmagent.html): add labels `path` and `url` to metrics `vmagent_remotewrite_push_failures_total` and `vmagent_remotewrite_samples_dropped_total`. Now number of failed pushes and dropped samples can be tracked per `-remoteWrite.url`. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Raphael Bizos <r.bizos@criteo.com>	2024-05-10 12:09:21 +02:00
qiangxuhui	80f3644ee3	Add build support for loong64 (#6222 ) ### Describe Your Changes Added makefile rule for `GOARCH=loong64` to support building all VictoriaMetrics components on the `loongarch64` platform. ### Checklist The following checks are mandatory: * [X] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/). Signed-off-by: qiangxuhui <qiangxuhui@loongson.cn>	2024-05-09 14:22:03 +02:00
Oleg	c6c5a5a186	Statsd protocol compatibility (#5053 ) In this PR I added compatibility with [statsd protocol](https://github.com/b/statsd_spec) with tags to be able to send metrics directly from statsd clients to vmagent or directly to VM. For example its compatible with [statsd-instrument](https://github.com/Shopify/statsd-instrument) and [dogstatsd-ruby](https://github.com/DataDog/dogstatsd-ruby) gems Related issues: #5052, #206, #4600	2024-05-07 21:46:08 +02:00
Ted Possible	5a3abfa041	Exemplar support (#5982 ) This code adds Exemplars to VMagent and the promscrape parser adhering to OpenMetrics Specifications. This will allow forwarding of exemplars to Prometheus and other third party apps that support OpenMetrics specs. --------- Signed-off-by: Ted Possible <ted_possible@cable.comcast.com>	2024-05-07 12:09:44 +02:00
Andrii Chubatiuk	879771808b	app/vmagent/remotewrite: do not cleanup timeseries which are used in multiple remote write contexts (#6206 ) When at least one remote write has deduplication configured it cleans up timeseries while they can be in use by another remote write without deduplication https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6205 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-05-06 12:09:51 +02:00
hagen1778	4251292708	app/vmagent: mention corner case with dangling queues and identical URLs See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6140 We don't cover this corner case as it has low chance for reproduction. Precisely, the requirements are following: 1. vmagent need to be configured with multiple identical `remoteWrite.url` flags; 2. At least one of the persistent queues need to be non-empty, which already signalizes about issues with setup; 3. vmagent need to be restarted with removing of one of `remoteWrite.url` flags. We do not document this case in vmagent.md as it seems to be a rare corner case and its explanation will require too much of explanation and confuse users. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-04-23 14:49:45 +02:00
hagen1778	bae3874e6a	app/streamaggr: follow-up after `c0e4ccb7b5` * rm vmagent mentions from vminsert flags * improve documentation wording, add links to related sections * mention `ignore_first_intervals` in the stream aggr options * update flags description * add basic test for config parsing validation Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-04-22 14:22:59 +02:00
Andrii Chubatiuk	c0e4ccb7b5	lib/streamaggr: add option to ignore first N aggregation intervals (#6137 ) Stream aggregation may yield inaccurate results if it processes incomplete data. This issue can arise when data is sourced from clients that maintain a queue of unsent data, such as Prometheus or vmagent. If the queue isn't fully cleared within the aggregation interval, only a portion of the time series may be included in that period, leading to distorted calculations. To mitigate this we add an option to ignore first N aggregation intervals. It is expected, that client queues will be cleared during the time while aggregation ignores first N intervals and all subsequent aggregations will be correct.	2024-04-22 13:52:04 +02:00
Aliaksandr Valialkin	c22da2f917	app/vmagent/influx: replace hybrid channel-based pool + sync.Pool with plain sync.Pool for pushCtx Data ingestion benchmark doesn't show memory usage difference between two approaches, so let's use simpler approach in order to improve code readability and maintainability. This is a follow-up for `77c597738c`	2024-04-20 21:38:11 +02:00
Aliaksandr Valialkin	77c597738c	app/vmagent/common: use plain sync.Pool instead of a mix of sync.Pool with channel-based pool for PushCtx This scheme was used for reducing memory usage when vmagent runs on a machine with big number of CPU cores and the ingestion rate isn't too big. The scheme with channel-based pool could reduce memory usage, since it minimizes the number of PushCtx structs in the pool in this case. Performance tests didn't reveal significant difference in memory usage under both low and high ingestion rate between plain sync.Pool and the current hybrid scheme, so replace the scheme with plain sync.Pool in order to simplify the code.	2024-04-20 21:27:05 +02:00
Aliaksandr Valialkin	7531e9084a	all: use clear() built-in Go function for clearing []prompbmarshal.TimeSeries and []prompbmarshal.Label slices This makes the code a bit clear.	2024-04-20 21:00:03 +02:00
Aliaksandr Valialkin	8f535f9e76	app/vmagent/remotewrite: add support for replication additionally to sharding when both -remoteWrite.shardByURL and -remoteWrite.shardByURLReplicas=RF command-line flags are set This allows setting up data replication among failure domains if the replication factor is smaller than the number of failure domains. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6054 See https://docs.victoriametrics.com/vmagent/#sharding-among-remote-storages	2024-04-19 11:27:33 +02:00
Aliaksandr Valialkin	f4b1cbfef0	all: replace old https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html url with the new one - https://docs.victoriametrics.com/cluster-victoriametrics/	2024-04-18 02:54:20 +02:00
Aliaksandr Valialkin	4d2b9fe6b2	all: replace old https://docs.victoriametrics.com/stream-aggregation.html url with the new one - https://docs.victoriametrics.com/stream-aggregation/	2024-04-18 02:19:11 +02:00
Aliaksandr Valialkin	c81a633b02	all: replace the outdated url https://docs.victoriametrics.com/vmagent.html with the new one - https://docs.victoriametrics.com/vmagent/	2024-04-18 01:31:37 +02:00
Aliaksandr Valialkin	dc326f70b4	app/vmagent: support for DNS SRV urls at -remoteWrite.url, scrape target urls and service discovery urls Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6053	2024-04-17 20:54:39 +02:00
Aliaksandr Valialkin	931dd3f320	app/{vmagent,vminsert}: accept Prometheus remote write protocol requests at /prometheus/api/v1/push additionally to /api/v1/push This is a follow-up for `7ccdb57ea4` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5990	2024-04-04 02:15:52 +03:00
Eugene Ma	7ccdb57ea4	add "/api/v1/push" to request handler (#5990 ) Co-authored-by: Eugene Ma <eugene.ma@airbnb.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-04-04 02:10:24 +03:00
Aliaksandr Valialkin	967d5496cf	app/vmagent: follow-up for `b3b29ba6ac` - Automatically reload changed TLS root CA pointed by -remoteWrite.tlsCAFile command-line flag - Automatically reload changed TLS root CA configured via oauth2.tsl_config.ca_file option at -promscrape.config - Document the change as a feature instead of a bug at docs/CHANGELOG.md - Simplify the code at lib/promauth, which is responsible for reloading changed TLS root CA files. - Simplify the usage of lib/promauth.Config.NewRoundTripper() - now it accepts the base http.Transport instead of a callback, which can change the internal http.Transport. - Reuse the default tls config if lib/promauth.Config doesn't contain tls-specific configs. This should reduce memory usage a bit when tls isn't used for scraping big number of targets. - Do not re-read TLS root CA files on every processed request. Re-read them once per second. This should reduce CPU usage when scraping big number of targets over https. - Do not store cert.pem and key.pem files in TestTLSConfigWithCertificatesFilesUpdate, since they can be loaded from byte slices via crypto/tls.X509KeyPair(). - Remove obsolete comparisons of string representations for authConfig and proxyAuthConfig at areEqualScrapeConfigs(). Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5725 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5526 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2171	2024-04-04 01:27:35 +03:00
Aliaksandr Valialkin	3de8656551	app/vmagent/remotewrite: follow-up for `166b97b8d0` and `b6bd9a97a3` - Make the configuration more clear by accepting the list of ignored labels during sharding via a dedicated command-line flag - -remoteWrite.shardByURL.ignoreLabels. This prevents from overloading the meaning of -remoteWrite.shardByURL.labels command-line flag. - Removed superfluous memory allocation per each processed sample if sharding by remote storage is enabled. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5938	2024-04-03 00:54:01 +03:00
Aliaksandr Valialkin	918cccaddf	all: fix golangci-lint(revive) warnings after `0c0ed61ce7` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6001	2024-04-02 23:16:29 +03:00
Artem Navoiev	9bd3cadce6	app/{vmagent/insert} fix typo in Firehose Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2024-04-02 17:41:21 +02:00
Aliaksandr Valialkin	904e95fc69	app/vmagent: simplify code after `509df44d03` - Simplify the code in order to improve its maintenance - Properly pass tenant ID when processing multi-tenant opentelemetry request at vmagent Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6016	2024-04-02 17:58:13 +03:00
Aliaksandr Valialkin	830b871baf	app/vmagent: properly shutdown when -maxIngestionRate limit is reached The remotewrite.Stop() expects that there are no pending calls to TryPush(). This means that the ingestionRateLimiter.Register() must be unblocked inside TryPush() when calling remotewrite.Stop(). Provide remotewrite.StopIngestionRateLimiter() function for unblocking the rate limiter before calling the remotewrite.Stop(). While at it, move the rate limiter into lib/ratelimiter package, since it has two users. Also move the description of the feature to the correct place at docs/CHANGELOG.md. Also cross-reference -remoteWrite.rateLimit and -maxIngestionRate command-line flags. This is a follow-up for `02bccd1eb9` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5900	2024-03-30 06:43:48 +02:00
hagen1778	b6bd9a97a3	app/vmagent: follow-up `166b97b8d0` * add tests for sharding function * update flags description * add changelog note `166b97b8d0` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-03-29 14:08:08 +01:00
Eugene Ma	166b97b8d0	vmagent: support sharding by excluded labels (#5938 ) To horizontally scale streaming aggregation, you might want to deploy a separate hashing tier of vmagents that route to a separate aggregation tier. The hashing tier should shard by all labels except the instance-level labels, to ensure the input metrics are routed correctly to the aggregator instance responsible for those labels. For this to achieve we introduce `remoteWrite.shardByURL.inverseLabels` flag to inverse logic of `remoteWrite.shardByURL.labels` --------- Co-authored-by: Eugene Ma <eugene.ma@airbnb.com> Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2024-03-29 13:26:02 +01:00
Andrii Chubatiuk	509df44d03	app/{vmagent,vminsert}: fixed firehose response (#6016 )	2024-03-26 13:20:41 +01:00
Alexander Marshalov	02bccd1eb9	[vmagent] added ingestion rate limiting with new flag `-maxIngestionRate` (#5900 ) * [vmagent] added ingestion rate limiting with new flag `-maxIngestionRate`. This flag can be used to limit the number of samples ingested by vmagent per second. If the limit is exceeded, the ingestion rate will be throttled. * fix changelog * fix review comment	2024-03-21 17:14:49 +01:00
Aliaksandr Valialkin	1cedaf61cb	app/{vmagent,vminsert}: add an ability to ignore input samples outside the current aggregation interval for stream aggregation See https://docs.victoriametrics.com/stream-aggregation.html#ignoring-old-samples	2024-03-17 23:03:47 +02:00
Aliaksandr Valialkin	b4b38f782c	app/vmagent/remotewrite: clarify the reason behind the default value for -remoteWrite.queues in the same way as the reason for -maxConcurrentInserts is defined at `73f5fb0f0c`	2024-03-06 13:43:08 +02:00
Aliaksandr Valialkin	da611ad628	app/{vmagent,vminsert}: add `-streamAggr.dropInputSamples` command-line flag for dropping the specified labels from input samples before deduplication and streaming aggregation	2024-03-05 02:15:01 +02:00
Aliaksandr Valialkin	ed523b5bbc	app/{vminsert,vmagent}: allow using -streamAggr.dedupInterval without -streamAggr.config This allows performing online de-duplication of incoming samples	2024-03-05 00:45:30 +02:00
Aliaksandr Valialkin	ac3cf3f357	lib/streamaggr: enable time alignment for aggregate flushed to multiples of interval For example, if `interval: 1m`, then data flush occurs at the end of every minute, while `interval: 1h` leads to data flush at the end of every hour. Add `no_align_flush_to_interval` option, which can be used for disabling the alignment.	2024-03-04 05:42:58 +02:00
Aliaksandr Valialkin	cfe774ab50	app/{vmagent,vminsert}: follow-up for `434a5803e7` Document the /opentelemetry/v1/metrics endpoint instead of /opentelemetry/api/v1/push, since the /v1/metrics suffix is hardcoded in OpenTelemetry protocol specification. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5871	2024-02-29 18:02:50 +02:00
Nikolay	434a5803e7	app/{vmagent,vminsert}: adds /v1/metrics suffix for opentelemetry route path (#5871 ) * app/{vmagent,vminsert}: adds /v1/metrics suffix for opentelemetry route path it must fix compatibility with opentemetry-collector [spec](https://opentelemetry.io/docs/specs/otlp/\#otlphttp-request) this suffix is hard-coded and cannot be changed with collector configuration * Apply suggestions from code review --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-29 17:58:49 +02:00
Aliaksandr Valialkin	04d13f6149	app/{vminsert,vmagent}: follow-up after `67a55b89a4` - Document the ability to read OpenTelemetry data from Amazon Firehose at docs/CHANGELOG.md - Simplify parsing Firehose data. There is no need in trying to optimize the parsing with fastjson and byte slice tricks, since OpenTelemetry protocol is really slooow because of over-engineering. It is better to write clear code for better maintanability in the future. - Move Firehose parser from /lib/protoparser/firehose to lib/protoparser/opentelemetry/firehose, since it is used only by opentelemetry parser. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5893	2024-02-29 14:38:23 +02:00
Andrii Chubatiuk	67a55b89a4	{vmagent,vminsert}: added firehose http destination opentelemetry data ingestion support (#5893 ) Co-authored-by: Andrii Chubatiuk <wachy@Andriis-MBP-2.lan> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-29 14:03:24 +02:00
Aliaksandr Valialkin	6697da73e5	app: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:44:24 +02:00
Aliaksandr Valialkin	7e1dd8ab9d	lib: consistently use atomic.* types instead of atomic.* functions See `ea9e2b19a5`	2024-02-24 02:07:53 +02:00
Anton L	d68bb658ce	#5833 Fix Deadlock when using shardByURL of VMAgent (#5834 )	2024-02-23 00:59:47 +02:00
Aliaksandr Valialkin	e963d6c789	app/vmagent/remotewrite: add -remoteWrite.tlsHandshakeTimeout command-line flag for tuning tls handshake timeout to -remoteWrite.url Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1699	2024-02-13 02:46:33 +02:00
Aliaksandr Valialkin	ae8a867924	all: add support for specifying multiple -httpListenAddr options	2024-02-09 03:15:04 +02:00
Aliaksandr Valialkin	541b644d3d	app/{vmagent,vminsert}: follow-up after `a1d1ccd6f2` - Document the change at docs/CHANGELOG.md - Copy changes from docs/Single-server-VictoriaMetrics.md to README.md - Add missing handler for processing multitenant requests ( https://docs.victoriametrics.com/vmagent/#multitenancy ) - Substitute github.com/stretchr/testify dependency with 3 lines of code in the added tests - Comment unclear code at lib/protoparser/datadogsketches/parser.go , so @AndrewChubatiuk could update it and add permalinks to the original source code there. - Various code cleanups Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5584 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3091	2024-02-07 01:28:05 +02:00
Andrii Chubatiuk	a1d1ccd6f2	support datadog /api/beta/sketches API (#5584 ) Co-authored-by: Andrew Chubatiuk <andrew.chubatiuk@motional.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-02-06 20:58:11 +00:00

1 2 3 4 5 ...

627 Commits