Aliaksandr Valialkin
113f0a8a07
app/vmselect/promql: drop staleness marks before calling rollupConfig.Do
...
This allows dropping staleness marks only once and then calculate multiple rollup functions on the result.
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526
2021-08-15 13:21:10 +03:00
Aliaksandr Valialkin
25997a70f1
Revert "app/vmselect/promql: properly handle Prometheus staleness marks in removeCounterResets functions"
...
This reverts commit 94dfcb6747a3b29a11d14e71bea21a2312bb6346.
It is better to remove staleness marks (decimal.StaleNaN) before calling rollupConfig.Do, e.g. in preFunc
2021-08-15 13:19:16 +03:00
Aliaksandr Valialkin
73d7b568da
app/vmselect/promql: properly handle Prometheus staleness marks in removeCounterResets functions
...
Prometheus stalenss marks shouldn't be changed in removeCounterResets. Otherwise they will be converted to an ordinary NaN values,
which couldn't be removed in dropStaleNaNs() function later. This may result in incorrect calculations for rollup functions.
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526
2021-08-14 12:45:57 +03:00
Aliaksandr Valialkin
6d53620adf
vendor: make vendor-update
2021-08-13 13:15:27 +03:00
Aliaksandr Valialkin
2ae2c1dd09
app/victoria-metrics/testdata: fix tests after 4401464c22
2021-08-13 12:21:54 +03:00
Aliaksandr Valialkin
4401464c22
all: add support for Prometheus staleness markers
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1526
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/748
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1509
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1530
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/845
2021-08-13 12:10:17 +03:00
Aliaksandr Valialkin
9a8d1bcec5
docs/Cluster-VictoriaMetrics.md: meniton that vmagent can be used for replicating the data among multiple clusters
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1491
2021-08-12 12:47:32 +03:00
Aliaksandr Valialkin
556c1b36e5
vendor: update github.com/klauspost/compress from v1.13.1 to v1.13.4
2021-08-12 12:40:13 +03:00
Aliaksandr Valialkin
95dd5a48bb
app/vmselect: make vmui-update
after the commit 4ae14df864a7e327955f44941295a286175423b3
2021-08-11 13:41:41 +03:00
Aliaksandr Valialkin
860b272a95
app/vmui: actualize Dockerfiles
2021-08-11 13:41:41 +03:00
Denys Holius
81e4d644dd
added guide for HA monitoring setup in K8s via VM Cluster ( #1523 )
...
* added guide for HA monitoring setup in K8s via VM Cluster
* fixed missed divs
* fixed different typos
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
* Update docs/guides/k8s-ha-monitoring-via-vm-cluster.md
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
Co-authored-by: Roman Khavronenko <hagen1778@gmail.com>
2021-08-11 11:45:42 +03:00
Aliaksandr Valialkin
755f65f4bc
app/vminsert: add vm_rpc_send_duration_seconds_total metric per each vminsert->vmstorage
link
...
This metric is useful for determining high link saturation with the following alerting rule:
rate(vm_rpc_send_duration_seconds_total) > 0.9s
2021-08-11 11:44:40 +03:00
Aliaksandr Valialkin
869ff25392
docs/Cluster-VictoriaMetrics.md: update -help
output for cluster components after the d375d9b878
2021-08-11 11:44:39 +03:00
Aliaksandr Valialkin
bcffd04e3a
docs: make docs-sync
after e0ee69797d
2021-08-11 10:53:49 +03:00
Roman Khavronenko
e0ee69797d
docs: update "number of open files" tuning recommendation ( #1527 )
...
* docs: update "number of open files" tuning recomendation
Make "number of open files" recomendation not only Prometheus specific to avoid
confusion for users who does not use Prometheus.
* docs: mention fstrim in Tuning section
2021-08-11 10:51:02 +03:00
Aliaksandr Valialkin
d375d9b878
lib/envflag: add a link to docs for -envflag.enable
2021-08-11 10:29:33 +03:00
Aliaksandr Valialkin
5716af4636
deployment/dm: update Go builder from Go1.16.6 to Go1.16.7
...
See https://github.com/golang/go/issues?q=milestone%3AGo1.16.7+label%3ACherryPickApproved
2021-08-06 12:12:03 +03:00
Yury Molodov
236fc7d739
vmui: fix layout and add server url by default ( #1519 )
...
* fix: change layout for correctly display big query
* fix: set default server from url
* fix: change get default server url
2021-08-06 12:06:08 +03:00
Aliaksandr Valialkin
5ce531027f
docs/CHANGELOG.md: document new metrics added to vmalert at 7416fdaa8b
2021-08-05 10:13:08 +03:00
Aliaksandr Valialkin
c1185363ca
app/vmagent: typo fix in the description for -remoteWrite.queues
2021-08-05 10:01:35 +03:00
Roman Khavronenko
7416fdaa8b
vmalert: expose new metrics for tracking number of produced samples during last evaluation ( #1518 )
...
* vmalert: expose new metrics for tracking number of produced samples during last evaluation
Two new metrics were added to track the number of samples produced during the last evaluation:
* vmalert_recording_rules_last_evaluation_samples
* vmalert_alerting_rules_last_evaluation_samples
The gauge type is used to remain consistent with Prometheus metric
`prometheus_rule_group_last_evaluation_samples` which is on the group level.
However, the counter type was considered as well.
Two metrics instead of one are used to make it easier to separate recording and
alerting rules. It is likely, number of samples produced by recording rules is
more important so people will refer to it more frequently.
The expected usage of the new metric is the following:
```
- alert: RecordingRuleReturnsEmptyResults
expr: sum(vmalert_recording_rules_last_evaluation_samples) by(recording) < 1
annotations:
summary: Recording rule {{$labels.recording}} returns empty results.
Please verify expression correctness.
```
Addresses https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1494
* vmalert: rename `vmalert_alerts_error` to `vmalert_alerting_rules_error` to remain consistent with recording rules metrics
2021-08-05 09:59:46 +03:00
Aliaksandr Valialkin
d826352688
app/vmagent: follow-up after fe445f753b
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1491
2021-08-05 09:52:32 +03:00
Omar Ghader
46e27d60a6
feature: Add multitenant for vmagent ( #1505 )
...
* feature: Add multitenant for vmagent
* Minor fix
* Fix rcs index out of range
* Minor fix
* Fix multi Init
* Fix multi Init
* Fix multi Init
* Add default multi
* Adjust naming
* Add TenantInserted metrics
* Add TenantInserted metrics
* fix: remove unused metrics for vmagent
* fix: remove unused metrics for vmagent
Co-authored-by: mghader <marc.ghader@ubisoft.com>
Co-authored-by: Sebastian YEPES <syepes@gmail.com>
2021-08-05 09:52:31 +03:00
Aliaksandr Valialkin
f07165977a
docs/Articles.md: actualize links and re-order some links
2021-08-03 16:11:49 +03:00
Aliaksandr Valialkin
50663ba41f
lib/promscrape/discovery/gce: add __meta_gce_interface_ipv4_<name> labels as in Prometheus 2.29
...
See https://github.com/prometheus/prometheus/pull/8978
2021-08-03 16:11:49 +03:00
Aliaksandr Valialkin
3cad8b4564
lib/promscrape/discovery/ec2: add __meta_ec2_availability_zone_id
label as Prometheus 2.29 does
2021-08-03 16:11:49 +03:00
Aliaksandr Valialkin
e92fde7945
app/vmselect/promql: add present_over_time(m[d])
function, which will be available starting from Prometheus 2.29.0
...
See https://github.com/prometheus/prometheus/releases/tag/v2.29.0-rc.0 and https://github.com/prometheus/prometheus/pull/9097
2021-08-03 16:11:49 +03:00
Qifei Wan
fa9c5c5940
app/vmalert: update config state metrics if config parsed failed ( #1507 )
2021-08-03 12:55:29 +03:00
Roman Khavronenko
370fe9fa2a
docs: add "Scaling to trillions of metric data points" to articles ( #1517 )
2021-08-03 11:07:45 +03:00
wusphinx
c1ed7b77aa
Update TimeSelector.tsx ( #1515 )
...
delete garbled code
2021-08-03 10:01:01 +03:00
Nikolay
7bbff7fb86
adds /rules and /alerts api for grafana ( #1504 )
...
Co-authored-by: Aliaksandr Valialkin <valyala@gmail.com>
2021-08-02 17:28:09 +03:00
Roman Khavronenko
b385fa622b
docs: mention "Push Prometheus metrics to VictoriaMetrics or other exporters" article ( #1511 )
2021-08-02 17:23:10 +03:00
Roman Khavronenko
a641102ec2
docs: fix indentation for guide articles ( #1512 )
2021-08-02 17:16:58 +03:00
Roman Khavronenko
408ba43092
Alerts single update ( #1510 )
...
* alerts: move `ProcessNearFDLimits` to `vm-health` group since it is relevant for all services
* alerts: add new `TooHighMemoryUsage` alerting rule
2021-08-02 15:51:24 +03:00
Aliaksandr Valialkin
66eb60f20d
docs/CaseStudies.md: typo fix: hed->had
2021-07-30 18:32:12 +03:00
Aliaksandr Valialkin
8a3c13fd53
docs/CHANGELOG.md: typo fix
2021-07-30 12:35:57 +03:00
Aliaksandr Valialkin
a3b4fc0474
docs/CHANGELOG.md: document d05cac6c98
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1486
2021-07-30 12:19:53 +03:00
Aliaksandr Valialkin
a1911e1330
app/vmselect/netstorage: unpack time series data in mostly local big chunks
...
This should improve performance on multi-CPU systems for queries selecting time series with big number of raw samples
2021-07-30 12:03:17 +03:00
Aliaksandr Valialkin
d05cac6c98
li/storage: re-use the per-day inverted index search code for searching in global index
...
This allows removing a big pile of outdated code for global index search.
This may help https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1486
2021-07-30 10:31:37 +03:00
Aliaksandr Valialkin
74ffaa45d9
app/vmselect/netstorage: do not query Go maps with unsafe string keys, since this breaks in Go 1.17
2021-07-30 09:57:53 +03:00
Aliaksandr Valialkin
192dfbfd90
app/vmselect: follow-up for ed95bc9531
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1493
2021-07-29 09:53:28 +03:00
arnoldyahad
00af4ff5a4
Add case prometheus/rules for grafana 8 ( #1502 )
2021-07-29 09:53:27 +03:00
assassins
a483044557
Performance optimization ( #1481 )
...
There are redundant steps
2021-07-28 19:26:20 +03:00
Aliaksandr Valialkin
e20ec090b2
docs: remove SampleSizeCalculations.md, since it is outdated and no longer used
...
There was a reference to this doc from the old victoriametrics.com site
2021-07-28 19:25:16 +03:00
Aliaksandr Valialkin
8ee8660ac4
app/vmselect: follow-up for 626073bca8
...
* Rename -search.maxMetricsPointSearch to -search.maxSamplesPerQuery, so it is more consistent with the existing -search.maxSamplesPerSeries
* Move the -search.maxSamplesPerQuery from vmstorage to vmselect, so it could effectively limit the number of raw samples obtained from all the vmstorage nodes
* Document the -search.maxSamplesPerQuery in docs/CHANGELOG.md
2021-07-28 18:00:23 +03:00
Denys Holius
9ffd70a921
Added new guide for monitoring k8s via VictoriaMetrics cluster ( #1476 )
...
* renamed and moved screenshots
* fixed cluster guide, updated helm chart versions, added values.yaml for vm single
* renamed guide files
* fixed typo
* add some fixes
* fixed typos,added guide k8s-monitoring-via-vm-cluster
* added fixes for yamls
2021-07-27 18:01:12 +03:00
Aliaksandr Valialkin
8481f4f004
docs/CHANGELOG.md: document 9d45b46f4c
2021-07-27 12:38:31 +03:00
Nikolay
9d45b46f4c
adds check for region with custom s3 endpoint ( #1465 )
2021-07-27 12:35:38 +03:00
Aliaksandr Valialkin
c2deee9911
lib/storage: yet another attempt to properly determine disk space shortage, which prevents from optimal merges
...
Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1373
2021-07-27 12:04:50 +03:00
Aliaksandr Valialkin
bb31117555
lib/promrelabel: add tests for verifying that regex works as expected in single quotes and double quotes
2021-07-27 10:50:55 +03:00