VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-26 20:30:10 +01:00

Author	SHA1	Message	Date
Roman Khavronenko	9eb71dda3d	vmagent: add grafana dashboard (#629 ) `vmagent` Grafana dashboard suppose to provide basic observability over multiple `vmagent` instances. Dashboard is saved in Grafana export format so it can be easily imported. It was also integrated into docker-compose environment.	2020-07-15 13:56:06 +03:00
Aliaksandr Valialkin	328814ee60	docs/vmagent.md: make filtering rules for init container pods less confusing	2020-07-14 20:32:47 +03:00
Aliaksandr Valialkin	7398e5701b	vendor: `make vendor-update`	2020-07-14 20:31:42 +03:00
Aliaksandr Valialkin	4e770e9120	docs/Single-server-VictoriaMetrics.md: remove `Roadmap` chapter, since it became outdated	2020-07-14 19:06:33 +03:00
Aliaksandr Valialkin	b442a42d8e	app/vmagent/remotewrite: return proper value from `tssRelabelPool.New` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/599	2020-07-14 14:29:20 +03:00
Aliaksandr Valialkin	6d77bfae4f	docs/Single-server-VictoriaMetrics.md: sync with README.md	2020-07-14 14:19:14 +03:00
Aliaksandr Valialkin	4081e2295e	app/{vminsert,vmagent}: add `-influxSkipMeasurement` command-line flag for using field name as metric name See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/626	2020-07-14 14:17:24 +03:00
Aliaksandr Valialkin	e1107fec10	lib/storage: reset `MetricName->TSID` cache after marking metricIDs as deleted This is a follow-up commit after `12b16077c4` , which didn't reset the `tsidCache` in all the required places. This could result in indefinite errors like: missing metricName by metricID ...; this could be the case after unclean shutdown; deleting the metricID, so it could be re-created next time Fix this by resetting the cache inside deleteMetricIDs function.	2020-07-14 14:06:32 +03:00
Aliaksandr Valialkin	25f80d320b	app/vmselect/prometheus: do not adjust last points in time series with timestamps exceeding the current time Such timestamps usually mean that the query contains `offset`. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/625	2020-07-14 12:52:16 +03:00
Aliaksandr Valialkin	cde18d1f43	lib/protoparser: properly update `vm_protoparser_rows_read_total{type="promscrape"}` metric	2020-07-14 12:16:35 +03:00
Seva Poliakov	457e61900d	add vm_protoparser_rows_read_total metrics to promscrape (#624 ) * add vm_protoparser_rows_read_total metrics to promscrape move vm_protoparser_rows_read_total for promscrape to better place move vm_protoparser_rows_read_total for promscrape to better place * remove possibility of infinity loop at prometheus parser	2020-07-14 12:16:34 +03:00
Roman Khavronenko	7e347972c4	lib/flagutil: specify additional description for all Array type flags (#620 ) Array type flag is now defined as `value` type in flag description when printed. This change adds additional description to every Array type flag so it would be clear what exact type is used: ``` -remoteWrite.urlRelabelConfig array Optional path to relabel config for the corresponding -remoteWrite.url Supports array of values separated by comma or specified via multiple flags. ```	2020-07-13 21:56:37 +03:00
Roman Khavronenko	19dd121968	lib/persistentqueue: add `vm_persistentqueue_bytes_pending` metric (#619 ) Metric `vm_persistentqueue_bytes_pending` is a gauge that shows current amount of bytes in persistentqueue flushed on disk as a difference between write and read offsets. This metric is very similar to `vmagent_remotewrite_pending_data_bytes` except of accounting for bytes in-memory.	2020-07-13 21:54:09 +03:00
Roman Khavronenko	829ec4f9cf	Extend metric `vm_promscrape_targets` with `status` label (#615 ) The change to `vm_promscrape_targets` metric suppose to improve observability for `vmagent` so it will be possible to track how many targets are up or down for every specific scrape group: ``` vm_promscrape_targets{type="static_configs", status="down"} 1 vm_promscrape_targets{type="static_configs", status="up"} 2 ```	2020-07-13 21:52:03 +03:00
Aliaksandr Valialkin	55d83e777d	app/vmselect/prometheus: minimize the diff for the change `1033dc7e2a` over `619b0a25c9`	2020-07-13 21:40:38 +03:00
faceair	1033dc7e2a	fix empty response template (#617 )	2020-07-13 21:31:19 +03:00
Aliaksandr Valialkin	619b0a25c9	docs/vmagent.md: sync with app/vmagent/README.md	2020-07-13 21:25:11 +03:00
ofen	666c795b98	Update README.md (#621 ) Troubleshooting section updated to help out with duplicate targets detection	2020-07-13 21:18:54 +03:00
Aliaksandr Valialkin	a730b3f6a1	app/vmagent: fix data race when multiple `-remoteWrite.urlRelabelConfig` options are set Previously multiple goroutines could access remoteWriteCtx.tss concurrently, which could lead to data race and improper relabeling. Now each goroutine has its own copy of tss during relabeling. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/599	2020-07-10 15:16:59 +03:00
Aliaksandr Valialkin	508ad46e0e	app/vmagent/remotewrite: typo fix in `-remoteWrite.showURL` help message	2020-07-10 14:07:08 +03:00
Aliaksandr Valialkin	e5b9f47623	vendor: update github.com/valyala/quicktemplate from v1.5.0 to v1.5.1 This should fix incorrect encoding for json strings with char codes below 0x20 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/613	2020-07-10 12:59:15 +03:00
Aliaksandr Valialkin	ca74b80f10	docs/Cluster-VictoriaMetrics.md: sync with the original README.md	2020-07-10 12:15:31 +03:00
Aliaksandr Valialkin	cba820e390	app/{vminsert,vmagent}: add ability to import data in Prometheus exposition format via `/api/v1/import/prometheus`	2020-07-10 12:14:07 +03:00
Aliaksandr Valialkin	6fe3c48a6e	properly calculate readCalls	2020-07-10 12:00:58 +03:00
Aliaksandr Valialkin	9c350bc20d	app/vmselect/promql: add missing tests for `ifnot` binary operation	2020-07-09 13:24:06 +03:00
Aliaksandr Valialkin	256fd9a87e	app/vmselect/promql: refactor implementations for `and` and `unless` binary operations, so they are closer to `or` implementation	2020-07-09 13:05:55 +03:00
Aliaksandr Valialkin	2d9b3ad5b3	app/vmselect/promql/active_queries.go: simplify code a bit by inlining getNextActiveQueryID function	2020-07-09 11:18:30 +03:00
Aliaksandr Valialkin	b66c7c13ac	docs: add a link to the `The CMS monitoring infrastructure and applications` publication from CERN	2020-07-08 20:16:43 +03:00
Aliaksandr Valialkin	3e1d7d8489	lib/promscrape: send `Accept` header similar to Prometheus when scraping targets This should fix scraping Spring Boot servers, which return incorrect response unless `Accept: text/plain` request header is set. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/608	2020-07-08 19:48:22 +03:00
Aliaksandr Valialkin	47c7ea5c60	vendor: `make vendor-update`	2020-07-08 19:25:38 +03:00
Aliaksandr Valialkin	4f737d1cbd	docs/Cluster-VictoriaMetrics.md: mention about `api/v1/status/active_queries` page Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/528	2020-07-08 19:18:26 +03:00
Aliaksandr Valialkin	742da690f4	app/vmselect: add `/api/v1/status/active_queries` page with the list of currently running queries This is a follow-up for https://github.com/VictoriaMetrics/VictoriaMetrics/pull/598 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/575	2020-07-08 18:55:38 +03:00
DexterZhang	99f54e44ff	feat(vmselect): add current running query list, add ability for getting the running query info and killing running query for master branch (#598 )	2020-07-08 18:52:55 +03:00
Aliaksandr Valialkin	cb92113632	lib/storage: limit the maximum concurrency for data ingestion to GOMAXPROCS Previously the concurrency has been limited to GOMAXPROCS*2. This had little sense, since every call to Storage.AddRows is bound to CPU, so the maximum ingestion bandwidth is achieved when the number of concurrent calls to Storage.AddRows is limited to the number of CPUs, i.e. to GOMAXPROCS.	2020-07-08 17:32:18 +03:00
Roman Khavronenko	e7557e0252	lib/protoparser: fix metric name of unmarshal errors in promremotewrite (#607 ) The change fixes the typo in metric name `vm_protoparser_unmarshal_errors` to respect the naming standard.	2020-07-08 14:18:41 +03:00
Aliaksandr Valialkin	e59b9916aa	lib/protoparser/graphite: go fmt	2020-07-08 14:12:10 +03:00
Aliaksandr Valialkin	d0b694c5c8	lib/protoparser/graphite: add more tests after `eb45185eef` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/610	2020-07-08 14:10:35 +03:00
Seva Poliakov	eb45185eef	Fix graphite minus one timestamp (#609 ) * fix graphite -1 timestamp * format the graphite fix -1 timestamp	2020-07-08 13:59:19 +03:00
Aliaksandr Valialkin	32b9fb58b8	lib/storage: clarify `out of retention period` error message by mentioning `-retentionPeriod` command-line flag	2020-07-08 13:54:26 +03:00
Aliaksandr Valialkin	12b16077c4	lib/storage: reset MetricName->TSID cache after deleting time series This should prevent from adding new data points to deleted time series without the need to check for the deleted time series. This improves ingestion performance a bit when the `deleted time series ids` aka `dmis` set contains big number of time series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/596 Based on the idea from @n4mine at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/604	2020-07-06 22:01:08 +03:00
Aliaksandr Valialkin	a23806f486	lib/fs: clarify description for `-fs.disableMmap` command-line flag	2020-07-06 14:28:34 +03:00
Aliaksandr Valialkin	6daa5f7500	lib/storage: prioritize data ingestion over heavy queries Heavy queries could result in the lack of CPU resources for processing the current data ingestion stream. Prevent this by delaying queries' execution until free resources are available for data ingestion. Expose `vm_search_delays_total` metric, which may be used in for alerting when there is no enough CPU resources for data ingestion and/or for executing heavy queries. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/291	2020-07-05 19:42:05 +03:00
Roman Khavronenko	703def4b2e	app/vmalert: add retries to remotewrite (#605 ) * app/vmalert: add retries to remotewrite Remotewrite pkg now does limited number of retries if write request failed. This suppose to make vmalert state persisting more reliable. New metrics were added to remotewrite in order to track rows/bytes sent/dropped. defaultFlushInterval was increased from 1s to 5s for sanity reasons. * fix * wip * wip * wip * fix bits alignment bug for 32-bit systems * fix mistakenly dropped field	2020-07-05 18:46:52 +03:00
Aliaksandr Valialkin	de137aef98	app/victoria-metrics: fix tests after the commit `acf828a759`	2020-07-05 18:24:41 +03:00
Aliaksandr Valialkin	acf828a759	app/vmselect/prometheus: small fixes on top of `8bb762124a`	2020-07-05 18:17:06 +03:00
faceair	8bb762124a	fix adjust last points avoid influence earlier value (#606 )	2020-07-05 17:56:54 +03:00
Aliaksandr Valialkin	ff6a0955eb	lib/promscrape: use HostClient.DoDeadline instead of HostClient.Do in order to guarantee strict deadline across multiple scrape attempts	2020-07-03 21:33:22 +03:00
Aliaksandr Valialkin	8b133e40d5	lib/promscrape: prevent from too big deadline misses on scrape retries The maximum deadline miss duration is reduced to 2x scrape_interval in the worst case. By default it is limited to scrape_interval configured for the given scrape target.	2020-07-03 20:41:36 +03:00
Aliaksandr Valialkin	44a54b8b3d	lib/promscrape: check for nil error before checking for the returned status code when scraping targets	2020-07-03 18:37:14 +03:00
Ween	d59cdbe90c	[VMAlert] Fix error log when remoteWrite queue size is full (#602 ) * Fix Auto metrics relabeled errors * Finalize auto-genenated Labels * Fix Test Errors * fix error logs when queue is full Co-authored-by: xinyulong <xinyulong@kuaishou.com>	2020-07-03 16:49:37 +03:00

... 8 9 10 11 12 ...

1854 Commits