VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2024-12-19 15:06:25 +01:00

Author	SHA1	Message	Date
Aliaksandr Valialkin	7fc5178a4b	app/vmselect/promql: add missing trace message in rollupResultCache.GetSeries()	2023-11-02 09:17:13 +01:00
Aliaksandr Valialkin	ece7024f11	app/vmselect/promql: reduce the minimum lookbehind window for enabling SLO/SLI optimizations from 24 hours to 6 hours This reduction is based on production testing. Also expose -search.minWindowForInstantRollupOptimization command-line flag, so users could fine-tune this arg for their needs	2023-11-01 20:19:19 +01:00
Aliaksandr Valialkin	e4365dbe3e	app/vmselect: run `make quicktemplate-gen` after `b8739bc00b`	2023-11-01 17:53:30 +01:00
Aliaksandr Valialkin	ae9b4c94bc	app/vmselect: return stats.seriesFetched as string instead of number vmalert expects string value for stats.seriesFetched, so it is impossible switching to number without breaking compatibility with old vmalert releases :( It is still unclear why stats.seriesFetched has string type in the first place...	2023-11-01 17:49:28 +01:00
Aliaksandr Valialkin	6a98f9df54	app/vmui: show query execution duration in the header of query input field This should simplify the process of query optimization	2023-11-01 16:46:42 +01:00
Aliaksandr Valialkin	c5e3b11762	app/vmselect/promql: apply SLO-like optimization to all the `count_*_over_time()` functions This is a follow-up for `41a0fdaf39`	2023-11-01 09:58:50 +01:00
Aliaksandr Valialkin	b96d55e1e4	app/vmselect/promql: typo fix, which could lead to panic during range query execution The panic is: BUG: unexpected values after merging new values This is a follow-up for `41a0fdaf39`	2023-11-01 09:58:50 +01:00
Aliaksandr Valialkin	28f0610e14	app/vmui: fix non-working `Disable cache` checkbox at `JSON` and `Table` views	2023-10-31 22:58:15 +01:00
Aliaksandr Valialkin	7b7ad44e84	app/vmselect/promql: properly calculate rollup result if lookbehind window isn't set This is a follow-up for `41a0fdaf39`	2023-10-31 22:23:04 +01:00
Aliaksandr Valialkin	744f8c3fe7	app/vmselect/promql: add outliers_iqr(q) and outlier_iqr_over_time(m[d]) functions These functions allow detecting anomalies in series and samples using Interquartile range method. See Outliers section at https://en.wikipedia.org/wiki/Interquartile_range for more details.	2023-10-31 22:14:14 +01:00
Aliaksandr Valialkin	9661918bb4	app/vmselect/promql: optimize repeated SLI-like instant queries with lookbehind windows >= 1d Repeated instant queries with long lookbehind windows, which contain one of the following rollup functions, are optimized via partial result caching: - sum_over_time() - count_over_time() - avg_over_time() - increase() - rate() The basic idea of optimization is to calculate rf(m[d] @ t) as rf(m[offset] @ t) + rf(m[d] @ (t-offset)) - rf(m[offset] @ (t-d)) where rf(m[d] @ (t-offset)) is cached query result, which was calculated previously The offset may be in the range of up to 1 hour.	2023-10-31 20:08:38 +01:00
Aliaksandr Valialkin	9ba007a636	app/vmselect/promql: wrap too long line after `a950873fff`	2023-10-31 19:11:05 +01:00
Roman Khavronenko	9d8f93050c	app/vmselect: expose `vm_memory_intensive_queries_total` counter metric (#5208 ) The new metric gets increased each time `-search.logQueryMemoryUsage` memory limit is exceeded by a query. This metric should help to identify expensive and heavy queries without inspecting the logs. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-10-31 19:02:22 +01:00
Aliaksandr Valialkin	a66c261b55	app/vmui: change the order of tables at `Top queries` tab Move the most interesting table - queries with the most summary time to execute - to the top	2023-10-28 11:57:08 +02:00
Aliaksandr Valialkin	36a1fdca6c	all: consistently use %w instead of %s in when error is passed to fmt.Errorf() This allows consistently using errors.Is() for verifying whether the given error wraps some other known error.	2023-10-26 09:44:40 +02:00
Roman Khavronenko	cd2247b24a	app/vmselect: limit the number of parallel workers by 32 (#5195 ) * app/vmselect: limit the number of parallel workers by 32 The change should improve performance and memory usage during query processing on machines with big number of CPU cores. The number of parallel workers for query processing is controlled via `-search.maxWorkersPerQuery` command-line flag. By default, the number of workers is limited by the number of available CPU cores, but not more than 32. The limit can be increased via `-search.maxWorkersPerQuery`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip - The `-search.maxWorkersPerQuery` command-line flag doesn't limit resource usage, so move it from the `resource usage limits` to `troubleshooting` chapter at docs/Single-server-VictoriaMetrics.md - Make more clear the description for the `-search.maxWorkersPerQuery` command-line flag - Add the description of `-search.maxWorkersPerQuery` to docs/Cluster-VictoriaMetrics.md - Limit the maximum value, which can be passed to `-search.maxWorkersPerQuery`, to GOMAXPROCS, because bigger values may worsen query performance and increase CPU usage - Improve the the description of the change at docs/CHANGELOG.md. Mark it as FEATURE instead of BUGFIX, since it is closer to a feature than to a bugfix. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087 --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-10-26 09:15:27 +02:00
Aliaksandr Valialkin	8642418e5a	app/vmselect: follow-up for `348c1bcec0`: cache static contents served from /select/tenantID/prometheus/vmui/static/...	2023-10-16 23:27:06 +02:00
Aliaksandr Valialkin	627a4e9330	app/vmselect/promql: add labels_equal(q, "label1", "label2", ...) function This function returns q series, which have identical values for the listed labels "label1", "label2", ... See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5148	2023-10-16 21:51:13 +02:00
Aliaksandr Valialkin	b2f9b9d634	app/vmselect/promql: add drop_empty_series() function for dropping empty series before performing additional calculations This can be useful in the following queries: drop_empty_series(temperature <= 30) default 40 This query drops temperature series with all the values bigger than 30 on the selected time range, while replacing gaps in the remaining series with 40. The query without drop_empty_series: (temperature <= 30) default 40 would leave all the temperature series with all the values bigger than 30 on the selected time range, and replace all their values with 40. This is not what could be epxected in some cases like here - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5071	2023-10-16 20:59:21 +02:00
Aliaksandr Valialkin	4278b00a66	app/vmselect/promql: do not use unsafe conversion from bytes slice to string when storing a value by map key The assigned map key shouldn't change over time, otherwise the map won't work properly. This is a follow-up for `1f91f22b5f` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087	2023-10-16 13:55:40 +02:00
Aliaksandr Valialkin	b86bec8109	app/vmui: small UX enhancements - Reduce vertical space usage, so more information is available on the screen without the need to scroll. - Show information for lines with higher values at the top of the legend under the graph. This should simplify graph analysis when it contains many lines.	2023-10-16 12:39:32 +02:00
Aliaksandr Valialkin	348c1bcec0	app/{vmselect,vlselect}: enable caching of static contents from /vmui/static/ folder at client side This should improve repated VMUI page load times on slow networks See https://developer.chrome.com/docs/lighthouse/performance/uses-long-cache-ttl/	2023-10-16 12:36:34 +02:00
Nikolay	4a50e9400c	app/vmselect: reduce lock contention for heavy aggregation requests (#5119 ) reduce lock contention for heavy aggregation requests previously lock contetion may happen on machine with big number of CPU due to enabled string interning. sync.Map was a choke point for all aggregation requests. Now instead of interning, new string is created. It may increase CPU and memory usage for some cases. https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5087	2023-10-10 13:44:02 +02:00
Aliaksandr Valialkin	b5812e2457	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update`	2023-10-02 21:44:21 +02:00
Aliaksandr Valialkin	5fd79f47f1	app/vmselect/promql: follow-up for `896c85a4a4` - Clarify the description of the change at docs/CHANGELOG.md - Make sure that bitmap_*(X, NaN) returns NaN Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4996 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/5021	2023-10-02 21:07:46 +02:00
Dmytro Kozlov	90b189dab8	app/vmselect: fix bitmap_*() functions behavior (#5021 ) Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4996 Signed-off-by: dmitryk-dk d.kozlov@victoriametrics.com Signed-off-by: dmitryk-dk d.kozlov@victoriametrics.com Co-authored-by: Nikolay <nik@victoriametrics.com>	2023-10-02 20:13:27 +02:00
Aliaksandr Valialkin	538dc6058d	app/vmselect/promql: run `make fmt` after `3b9605dba5`	2023-09-25 16:15:58 +02:00
Aliaksandr Valialkin	b43ff80d21	app/vmselect/promql: do not sort `q1 or q2` results This makes sure that `q2` series are returned after `q1` series in the same way as Prometheus does See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4763	2023-09-25 16:15:02 +02:00
Aliaksandr Valialkin	c954019e43	app/vmselect/promql: completely substitute median_over_time() WITH template with regular median_over_time() rollup function This is a follow-up for `34d7a670d0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5034	2023-09-25 15:31:25 +02:00
Zakhar Bessarab	fd6ca57c14	app/vmselect/promql: add implementation of median_over_time for rollup functions list (#5042 ) `median_over_time` is handled by predefined WITH template in MetricsQL library which translates it to `quantile_over_time(0.5)` This makes it impossble to use `median_over_time` as a usual rollup function for `aggr_over_time`. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5034 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-09-25 15:31:25 +02:00
Konstantin	c1a8a2d54c	app/vmselect: return +Inf as null in graphite render api (#5009 ) Signed-off-by: Konstantin Kulikov <k.kulikov2@gmail.com>	2023-09-18 16:41:39 +02:00
Dmytro Kozlov	5477b52991	vmagent: add validation of MetricsQL functions (#4991 ) Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-09-15 13:16:22 +02:00
Aliaksandr Valialkin	9c3a37597c	app/vmselect/netstorage: run `make fmt` after `58326dbf25`	2023-09-10 15:18:15 +02:00
Aliaksandr Valialkin	58326dbf25	app/vmselect: return 503 status code when partial responses are denied and some of vmstorage nodes are temporarily unavailable This should help detecting this case and automatic retrying the query at healthy cluster replica in another availability zone. This commit is needed as a preparation for automatic query retry at another backend at vmauth on 5xx errors as described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4792#issuecomment-1674338561	2023-09-07 16:07:06 +02:00
Aliaksandr Valialkin	4d3c24492c	app/vmselect: run `make vmui-update`	2023-09-06 10:29:59 +02:00
Aliaksandr Valialkin	b9b2fbc7cd	app/vmselect: run `make vmui-update` after `c112dd7367`	2023-09-01 10:54:22 +02:00
Aliaksandr Valialkin	d8afd7fe98	Makefile: update golangci-lint from v1.51.2 to v1.54.2 See https://github.com/golangci/golangci-lint/releases/tag/v1.54.2	2023-09-01 10:25:49 +02:00
Aliaksandr Valialkin	1ca3b660f0	app/vmselect/promql: add support for `_` delimiters in numeric values For example, 1_234_567_890 is equivalent to 1234567890, while 1.234_567_890 is equivalent to 1.234567890	2023-08-30 14:35:58 +02:00
Aliaksandr Valialkin	3a2d035283	lib/auth: add NewTokenPossibleMultitenant() for parsing auth token, which can be multitenant Disallow parsing multitenant token at auth.NewToken(). Use auth.NewTokenPossibleMultitenant() at vminsert only. All the other callers should call auth.NewToken(), since they do not support multitenant token. This is a follow-up for `f0c06b428e` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910	2023-08-30 14:13:51 +02:00
hagen1778	bda9699657	app/vmselect: follow-up after `f0c06b428e` Remove extra error message when auth token is nil. The default message about unsupported path should be more clear to the user who mistakenly requested /multitenant path. `f0c06b428e` Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-29 21:52:50 +02:00
Zakhar Bessarab	f0c06b428e	app/vmselect: fix panic when using `/select/multitenant` endpoint (#4912 ) app/vmselect: fix panic when using `/select/multitenant` endpoint Such requests must be rejected as not found since vmselect does not support multitenant endpoint. See: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4910 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-08-29 21:48:12 +02:00
Aliaksandr Valialkin	5e8dfcf65e	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update` after recent changes to app/vmui	2023-08-29 12:58:58 +02:00
Aliaksandr Valialkin	19d61737c1	app/{vminsert,vmselect}: follow-up after `2b7b3293c1` - Document the change at docs/CHANGELOG.md - Set the default value for -vmstorageUserTimeout to 3 seconds. This is much better than the 0 value, which means that TCP connection to unreachable vmstorage could block for up to 16 minutes. - Document -vmstorageUserTimeout at docs/Cluster-VictoriaMetrics.md	2023-08-29 12:17:39 +02:00
Will Jordan	2b7b3293c1	Add `vmstorageUserTimeout` flags to configure TCP user timeout (Linux) (#4423 ) `TCP_USER_TIMEOUT` (since Linux 2.6.37) specifies the maximum amount of time that transmitted data may remain unacknowledged before TCP will forcibly close the connection and return `ETIMEDOUT` to the application. Setting a low TCP user timeout allows RPC connections quickly reroute around unavailable storage nodes during network interruptions.	2023-08-29 11:46:39 +02:00
hagen1778	f48962e834	vmselect: follow-up after `7349f18c55` `7349f18c55` Signed-off-by: hagen1778 <roman@victoriametrics.com> (cherry picked from commit `ea2fbcf0e6`)	2023-08-21 15:50:19 +02:00
Tamara Vashchuk	6a59737e96	vmui: Add button to prettify query (#4694 ) * Add button to prettify query Just capitalizes query text for now * Add /prettify-query API handler * Replace UI pretiffier using prettifier API * Add showing server errors Had to pass setQueryErrors from useFetchQuery.ts * Use serverUrl from global AppState * Change icon to AutoAwsome icon + added style change color when button is active * Add sync/await to prettifyQuery function * Doc public function for lint * Minor async fix * Removed extra blank lines * Extract usePrettifyQuery hook * Made more generic style for :active button * Refactor usePrettifyQuery However, prettify errors don't clean up query errors, but should * Add prettyQuery functionality to CHANGELOG.md * Reuse queryErrors * Unhide errors on start --------- Co-authored-by: Tamara <toma.vashchuk@gmail.com> (cherry picked from commit `7349f18c55`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-08-21 15:50:17 +02:00
Aliaksandr Valialkin	5c80b11c15	app/vmselect: prevent from panic when lookbehind window inside rollup function is parsed into negative value Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4795	2023-08-12 04:49:56 -07:00
Aliaksandr Valialkin	37af7d4ed3	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update` after `86f1459ca6`	2023-08-11 07:01:15 -07:00
Damon07	4c509c0b89	{app/vmselect,docs}: support share_eq_over_time#4441 (#4725 ) https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4441 Co-authored-by: wangm <wangmm@tuya.com>	2023-07-31 07:51:09 -07:00
Aliaksandr Valialkin	16c343f882	app/{vmselect,vlselect}/vmui: run `make vmui-update vmui-logs-update` after `b6ae325763`	2023-07-24 17:15:26 -07:00
Aliaksandr Valialkin	c921bc0833	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update` after recent changes to VMUI Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4604 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4676 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4294	2023-07-20 21:53:51 -07:00
Aliaksandr Valialkin	a0b7def89d	app/vmselect/promql: fix tests after `781947a7e2`	2023-07-20 21:25:30 -07:00
Aliaksandr Valialkin	0cbe5ccb4a	app/vmselect: rename promql.WriteActiveQueries() to promql.ActiveQueriesHandler() This makes it more consistent with the rest of handlers inside app/vmselect/main.go This is a follow-up for `6a96fd8ed5` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4598	2023-07-20 11:30:40 -07:00
Aliaksandr Valialkin	992c300ce9	all: replace atomic.Value with atomic.Pointer[T] This eliminates the need in .(*T) casting for results obtained from Load() Leave atomic.Value for map, since atomic.Pointer[map[...]...] makes double pointer to map, because map is already a pointer type.	2023-07-19 17:48:26 -07:00
Yury Molodov	3ad80e281f	vmui: add Active Queries page (#4653 ) * feat: add page to display a list of active queries (#4598) * app/vmagent: code formatting * fix: remove console --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-07-19 16:02:58 -07:00
Aliaksandr Valialkin	5ace0701d3	app/vmselect/promql: add the ability to copy all the labels from `one` side of group_left()/group_right() operation This is performed by specifying `` inside group_left()/group_right(). Also allow specifying prefix for the copied labels via `group_left(...) prefix "..."` and `group_right(...) prefix "..."` syntax. For example, the following query adds all the namespace-related labels to pod info, and prefixes all the copied label names with "ns_" prefix: kube_pod_info on(namespace) group_left(*) prefix "ns_" kube_namespace_labels This resolves the following StackOverflow questions: - https://stackoverflow.com/questions/76661818/how-to-add-namespace-labels-to-pod-labels-in-prometheus - https://stackoverflow.com/questions/76653997/how-can-i-make-a-new-copy-of-kube-namespace-labels-metric-with-a-different-name	2023-07-17 16:58:30 -07:00
Aliaksandr Valialkin	cc54fa2a56	app/vmselect/promql: recommend to use `(a op b) keep_metric_names` instead of `a op b keep_metric_names` The `a op b keep_metric_names` is ambigouos to `a op (b keep_metric_names)` when `b` is a transform or rollup function. For example, `a + rate(b) keep_metric_names`. So it is better to use more clear syntax: `(a op b) keep_metric_names` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3710	2023-07-16 23:47:15 -07:00
Zakhar Bessarab	781947a7e2	metricsql: add support of using keep_metric_names for binary operations (#4109 ) * metricsql: add support of using keep_metric_names for binary operations This should help to avoid confusion with queries like one in the issue #3710. Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * wip --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-07-16 03:01:27 -07:00
Aliaksandr Valialkin	a7fdc3fcc7	all: add support for `or` filters in series selectors This commit adds ability to select series matching distinct filters via a single series selector. For example, the following selector selects series with either {env="prod",job="a"} or {env="dev",job="b"} labels: {env="prod",job="a" or env="dev",job="b"} The `or` filter is supported in all the VictoriaMetrics tools now. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3997 Uses https://github.com/VictoriaMetrics/metricsql/pull/14	2023-07-15 23:56:18 -07:00
Aliaksandr Valialkin	f65153018b	app/{vmselect,vlselect}: run `make vmui-update vmui-logs-update`	2023-07-09 12:44:04 -07:00
Haleygo	ef8e3eb9b3	vmselect: fix result in Prometheus query when time is small (#4578 ) vmselect: fix result in Prometheus query when time is small Co-authored-by: Roman Khavronenko <roman@victoriametrics.com>	2023-07-09 12:33:29 -07:00
Aliaksandr Valialkin	e1a2404db5	app/vmselect/netstorage: follow-up after `173ccf4333` - Clarify docs about -replicationFactor command-line flag at vmselect - Clarify description for -replicationFactor and -search.skipSlowReplicas command-line flags - Fix the logic for returning responses if -search.skipSlowReplicas command-line flag is enabled. The logic was broken in the `173ccf4333`, so it could return responses only if some of vmstorage nodes return error, while it should return when query results are successfully collected from more than (len(storageNodes) - replicationFactor) vmstorage nodes. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711	2023-07-09 11:58:22 -07:00
Haleygo	14e242d0b9	vmselect: fix result collect count (#4599 )	2023-07-08 08:21:27 +02:00
Roman Khavronenko	173ccf4333	vmselect: introduce `search.skipSlowReplicas` cmd-line flag (#4538 ) * vmselect: introduce `search.skipSlowReplicas` cmd-line flag vmselect has two logical conditions during request processing when `-replicationFactor` cmd-line flag is set: 1. If at least `len(storageNodes) - replicationFactor` responded, it could skip waiting for the rest of nodes to respond. This could lead to problems described here https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207. 2. Mark response as partial if less than `len(storageNodes) - replicationFactor` responded without an error. The P1 showed itself error-prone and became the main reason why `-replicationFactor` wasn't recommended to use at vmselect level. However, this optimization could be still very useful in situations when there are slow and fast replicas in cluster. But P2 remains viable and important conditionless. Hiding P1 behind the feature-flag `search.skipSlowReplicas` should make `-replicationFactor` flag usable again. And let users choose whether they want P1 to be respected. Related issues https://github.com/VictoriaMetrics/VictoriaMetrics/issues/1207 https://github.com/VictoriaMetrics/VictoriaMetrics/issues/711 Signed-off-by: hagen1778 <roman@victoriametrics.com> * docs: update changelog Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-07-07 11:50:26 +02:00
Aliaksandr Valialkin	4b10432435	app/vlselect: handle vmui at /select/vmui path instead of /vmui This simplifies routing at auth proxies such as vmauth to vlselect component, which serves VMUI - just route all the requests, which start with /select/, to vlselect.	2023-07-06 21:36:28 -07:00
Aliaksandr Valialkin	427ce69426	app/vmselect: move common http functionality from app/vmselect/searchutils to lib/httputils While at it, move app/vmselect/bufferedwriter to lib/bufferedwriter, since it is going to be used in VictoriaLogs	2023-07-06 17:22:23 -07:00
Aliaksandr Valialkin	dff199a745	app/vmselect/graphite: follow-up after `c7884f8686` - Consistently use -search.maxGraphiteTagValues for limiting tag values from auto-complete API - Use -search.maxGraphiteSeries for limiting paths (aka series), which can be returned from Graphite series API - Clarify the change in docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4339 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2841	2023-07-06 15:19:07 -07:00
Aliaksandr Valialkin	eb47ad4b69	app/vmselect/netstorage: remove runtime.Gosched() call from unpackWorker() This should improve scalability of unpackWorker() on systems with many CPU cores. This is a follow-up for `a2ecf4fa4a` and `16f3b279a2` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-07-06 10:07:42 -07:00
Aliaksandr Valialkin	ec75d9097d	app/vmselect/netstorage: follow-up after `11ac551d52` - Clarify the scope of the fix at docs/CHANGELOG.md - Handle the case when -search.maxSamplesPerSeries limit is exceeded in the same way as the -search.maxSamplesPerQuery limit. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4472	2023-07-05 21:13:34 -07:00
Aliaksandr Valialkin	643e99a157	app/vmselect/netstorage: improve code readability a bit after `6c84b61893` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4364	2023-07-05 20:48:38 -07:00
Roman Khavronenko	11ac551d52	app/vmselect/netstorage: properly process `-search.maxSamplesPerQuery` limit (#4472 ) Properly return the error to user when `-search.maxSamplesPerQuery` limit is exceeded. Before, user could have received a partial response instead. Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-23 13:17:34 +02:00
Dmytro Kozlov	c5debee3f4	app/{graphite,netstorage,prometheus}: fix graphite search tags api limits, remove redudant limit from SeriesHandler handler (#4352 ) * app/{graphite,netstorage,prometheus}: fix graphite search tags api limits, remove unused limit from SeriesHandler handler, * app/{graphite,netstorage,prometheus}: use search.maxTagValues for Graphite * app/{graphite,netstorage,prometheus}: update CHANGELOG.md * app/{graphite,netstorage,prometheus}: use own flags for Graphite API * app/{graphite,netstorage,prometheus}: cleanup * app/{graphite,netstorage,prometheus}: cleanup * app/{graphite,netstorage,prometheus}: update docs --------- Co-authored-by: Nikolay <nik@victoriametrics.com> (cherry picked from commit `c7884f8686`) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-09 10:39:12 +02:00
Nikolay	e3ce736ce2	app/vmselect/graphite: fixes tests for arm (#4348 ) at arm based CPUs only 9 digits after comma matches for tests. Especially at holtWinters functions. Since it only takes effect at tests it makes no sense for changing float prescision at actual functions (cherry picked from commit `228ea03bda`)	2023-06-02 13:19:34 +02:00
Roman Khavronenko	576e59d82c	cluster: standardize default HTTP responses (#4368 ) Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-06-01 10:26:52 +02:00
Haleygo	6c84b61893	vmselect:fix init sn take too much time (#4366 ) * vmselect: descrease start time for vmselect https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4364	2023-05-30 13:04:31 +02:00
Aliaksandr Valialkin	934a7f485c	app/vmselect: log locations of sendPrometheusError() calls Previously the location inside the sendPrometheusError() was logged. This could make hard investigating error locations via `vm_log_messages_total` metric.	2023-05-18 20:39:50 -07:00
Aliaksandr Valialkin	1ff67bb036	app/vmselect/vmui: run `make vmui-update` after `39c1b0f8d1`	2023-05-18 12:15:22 -07:00
Alexander Marshalov	d321ea91f2	fixed typos in documentation and commandline flags descriptions (#4275 )	2023-05-10 02:22:06 -07:00
Aliaksandr Valialkin	8703b2fa87	app/vmselect: small cleanup after `4f3f9950d0` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3807	2023-05-09 22:45:02 -07:00
Aliaksandr Valialkin	fbc28810b1	app/vmselect: small cleanup after `68e31a6000` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3811	2023-05-09 22:43:59 -07:00
Aliaksandr Valialkin	5dbaffe2c6	app/{vmselect,vmctl}: move ParseTime() to lib/promutils Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4091 This is a follow-up for `e2053baf32`	2023-05-09 22:42:35 -07:00
Roman Khavronenko	5bc8d8f290	vmselect: exit early from queue on context cancel (#4223 ) * vmselect: exit early from queue on context cancel When `-search.maxConcurrentRequests` is reached, vmselect puts request in the queue. It is expected, that requests in the queue will be processed as soon as it would be enough capacity to do so. However, it could happen that while request was waiting its turn, the client could have already cancel it (close the connection, or just close the tab with UI). In this case, we should de-queue such requests to avoid spending extra resources on them. Signed-off-by: hagen1778 <roman@victoriametrics.com> * app/vmselect: address review comments Signed-off-by: hagen1778 <roman@victoriametrics.com> --------- Signed-off-by: hagen1778 <roman@victoriametrics.com>	2023-05-08 22:58:05 -07:00
Aliaksandr Valialkin	1a7794735e	app/vmselect: fix the build after fb8889820aba710508033cbf6826eb63a357532a	2023-05-08 17:32:18 -07:00
Yury Molodov	de35cbf251	vmui: Integrate WITH template playground (#3831 ) * feat: add WithTemplate page * app/vmselect/prometheus: enable json mode for expand with expr API * app/vmselect/prometheus: enable CORS and add content type * feat: add api for expand with templates * fix: remove console from useExpandWithExprs * app/vmselect/prometheus: fix escaping * vmui: integrate WITH template * app/vmctl: check content type instead of form param * fix: add content-type for fetch with-exprs * fix: add a header to the server's response that allows the "Content-Type" header * app/vmctl: added comment and cleanup * app/vmctl: use format query param --------- Co-authored-by: dmitryk-dk <kozlovdmitriyy@gmail.com>	2023-05-08 14:35:35 -07:00
Aliaksandr Valialkin	cf4701db65	lib/fs: add MustReadDir() function Use fs.MustReadDir() instead of os.ReadDir() across the code in order to reduce the code verbosity. The fs.MustReadDir() logs the error with the directory name and the call stack on error before exit. This information should be enough for debugging the cause of the error.	2023-04-14 22:11:40 -07:00
Aliaksandr Valialkin	c4638553a3	lib/fs: rename WriteFileAtomically to MustWriteAtomic Callers of this function log the returned error and exit. So let's just log the error with the given filepath and the call stack inside the function itself and then exit. This simplifies the code at callers' place while leaves the same level of debuggability in case of errors.	2023-04-13 22:43:30 -07:00
Aliaksandr Valialkin	aac3dccfd1	lib/fs: replace MkdirAllIfNotExist->MustMkdirIfNotExist and MkdirAllFailIfExist->MustMkdirFailIfExist Callers of these functions log the returned error and then exit. The returned error already contains the path to directory, which was failed to be created. So let's just log the error together with the call stack inside these functions. This leaves the debuggability of the returned error at the same level while allows simplifying the code at callers' side. While at it, properly use MustMkdirFailIfExist instead of MustMkdirIfNotExist inside inmemoryPart.MustStoreToDisk(). It is expected that the inmemoryPart.MustStoreToDick() must fail if there is already a directory under the given path.	2023-04-13 22:22:08 -07:00
Aliaksandr Valialkin	26b361f4c3	app/vmselect/vmui: run `make vmui-update` after `01fc228fb0`	2023-04-06 15:11:54 -07:00
Aliaksandr Valialkin	a241485262	app/vmselect/vmui: run `make vmui-update` after `a1601929ec`	2023-04-06 03:20:16 -07:00
Yury Molodov	7871ee0e43	vmui: implement heatmap improvements (#4078 ) * fix: disabled limits for histogram * fix: add sorted buckets by upper bound * refactor: move line chart components to folder * feat: implement heatmap improvements (https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3384#issuecomment-1484023162) * app/vmselect/vmui: `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-04-05 22:15:23 -07:00
Aliaksandr Valialkin	fa2ba7b07b	app/vmselect/vmui: run `make vmui-update` after `edb45d7fc1`	2023-04-02 21:22:17 -07:00
Aliaksandr Valialkin	7b10af4846	app/vmselect/vmui: run `make vmui-update` after `42087518ba`	2023-04-01 00:41:03 -07:00
Aliaksandr Valialkin	db8fda4ec6	app/vmselect/graphite: open source Graphite Render API	2023-03-31 23:37:40 -07:00
Nikolay	b38a145cfd	app/vmselect: properly remove temp files at windows system (#4020 ) With non-posix compliant systems it's not possible to remove unclosed files. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-27 18:10:44 -07:00
Aliaksandr Valialkin	54b9537a76	app/vmselect/promql: follow-up for `79e1c6a6fc` - Document the fix at docs/CHANGELOG.md - Add tests with multiple adjancent zero buckets - Simplify the fix a bit Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/296 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/4021	2023-03-27 18:04:30 -07:00
Ze'ev Klapow	680a661ec0	fix le buckets when adjacent vmrange is empty (#4021 ) There is a bug here where if you have a single bucket like: foo{vmrange="4.084e+02...4.642e+02"} 2 123 The expected output is three le encoded buckets like: foo{le="4.084e+02"} 0 123 foo{le="4.642e+02"} 2 123 foo{le="+Inf"} 2 123 This correctly encodes the start and end of the vmrange. If however, the input contains the previous bucket, and that bucket is empty then you only get the end le and +Inf out currently, i.e: foo{vmrange="7.743e+05...8.799e+05"} 5 123 foo{vmrange="6.813e+05...7.743e+05"} 0 123 results in: foo{le="8.799e+05"} 5 123 foo{le="+Inf"} 5 123 This causes issues when you go to compute a quantile because this means that the assumed lower bound of the buckets is 0 and this we interpolate between 0->end rather than the vmrange start->end as expected.	2023-03-27 18:04:29 -07:00
Aliaksandr Valialkin	9387793f47	app/vmselect: follow-up for `10ab086366` - Expose stats.seriesFetched at `/api/v1/query_range` responses too for the sake of consistency. - Initialize QueryStats when it is needed and pass it to EvalConfig then. This guarantees that the QueryStats is properly collected when the query contains some subqueries.	2023-03-27 15:11:42 -07:00
Roman Khavronenko	10ab086366	app/vmselect: export `seriesFetched` stat for /query responses (#3925 ) The change adds a new field `seriesFetched` to EvalConfig object. Since EvalConfig object can be copied inside `Exec`, `seriesFetched` is a pointer which can be updated by all copied objects. The reason for having stats is that other components, like vmalert, could benefit from this information. Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-27 08:51:33 -07:00
Yury Molodov	86a98fa131	vmui: heatmap (#3780 ) * fix: add stroke and font for all axes * feat: add util for generate gradient * feat: add heatmap plugin * feat: add heatmap legend * feat: add heatmap graph (#3384) * vmui: add heatmap graph (#3384) * feat: add convert Prometheus to VictoriaMetrics histogram * fix: prevent re-render graph * feat: reset step for heatmap * feat: normalize heatmap data * fix: format heatmap legend * wip * app/vmselect/vmui: run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-03-26 00:31:21 -07:00
Aliaksandr Valialkin	db3bcbe56a	app/vmselect/netstorage: reduce the contention at fs.ReaderAt stats collection on systems with big number of CPU cores This optimization is based on the profile provided at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966#issuecomment-1483208419	2023-03-25 16:38:39 -07:00
Aliaksandr Valialkin	a2ecf4fa4a	app/vmselect/netstorage: document why runtime.Gosched() is removed at `28f054bb00` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-25 16:38:28 -07:00
Zakhar Bessarab	16f3b279a2	vmselect/netstorage: remove direct calls to `Gosched` to reduce amount of locks for global scope using `runtime.Gosched` requires acquiring global lock to check if there are any other goroutines to perform tasks. with the latest versions of runtime it can pause running goroutines automatically without requiring to call `Gosched` directly. Updates #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-25 16:37:58 -07:00
Aliaksandr Valialkin	740fa57fdc	app/vmselect/promql: typo fix after `e7f46a0aab`	2023-03-24 23:47:11 -07:00
Aliaksandr Valialkin	7aff6f872f	app/vmselect/promql: follow-up for `7205c79c5a` - Allocate and initialize seriesByWorkerID slice in a single go instead of initializing every item in the list separately. This should reduce CPU usage a bit. - Properly set anti-false sharing padding at timeseriesWithPadding structure - Document the change at docs/CHANGELOG.md Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-24 23:39:43 -07:00
Zakhar Bessarab	fec87e3ada	app/vmselect/promql: use lock-less approach to gather results of parallel processing for `evalRollup` funcs (#4004 ) vmselect/promql: refactor `evalRollupNoIncrementalAggregate` to use lock-less approach for parallel workers computation Locking there is causing issues when running on highly multi-core system as it introduces lock contention during results merge. New implementation uses lock less approach to store results per workerID and merges final result in the end, this is expected to significantly reduce lock contention and CPU usage for systems with high number of cores. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: add pooling for `timeseriesWithPadding` to reduce allocations Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * vmselect/promql: refactor `evalRollupFuncWithSubquery` to avoid using locks Uses same approach as `evalRollupNoIncrementalAggregate` to remove locking between workers and reduce lock contention. Related: #3966 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-03-24 23:39:41 -07:00
Aliaksandr Valialkin	b9632023c4	app/vmselect/vmui: run `make vmui-update` after `dc2c712a29`	2023-03-24 18:08:51 -07:00
Aliaksandr Valialkin	79d8f0e7c6	app/vmselect/promql: pass workerID to the callback inside doParallel() This opens the possibility to remove tssLock from evalRollupFuncWithSubquery() in the follow-up commit from @zekker6 in order to speed up the code for systems with many CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-20 20:57:34 -07:00
Aliaksandr Valialkin	e749a015a9	app/vmselect/promql: fix TestIncrementalAggr test on systems less than 3 CPU cores This is a follow-up for `4856a4cf5a`	2023-03-20 20:37:44 -07:00
Aliaksandr Valialkin	08da383eac	app/vmselect/netstorage: reduce the number of calls to runtime.Gosched() at timeseriesWorker() and unpackWorker() Call runtime.Gosched() only when there is a work to steal from other workers. Simplify the timeseriesWorker() and unpackWroker() code a bit by inlining stealTimeseriesWork() and stealUnpackWork(). This should reduce CPU usage when processing queries on systems with big number of CPU cores. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3966	2023-03-20 20:32:56 -07:00
Aliaksandr Valialkin	18af01c387	app/vmselect: optimize incremental aggregates a bit Substitute sync.Map with an ordinary slice indexed by workerID. This should reduce the overhead when updating the incremental aggregate state	2023-03-20 15:42:13 -07:00
Aliaksandr Valialkin	7a1e2f49cc	app/vmselect/vmui: `make vmui-update` after `d4525bd2d0`	2023-03-20 14:35:17 -07:00
Aliaksandr Valialkin	fc3d826d7f	all: add Windows build for VictoriaMetrics This commit changes background merge algorithm, so it becomes compatible with Windows file semantics. The previous algorithm for background merge: 1. Merge source parts into a destination part inside tmp directory. 2. Create a file in txn directory with instructions on how to atomically swap source parts with the destination part. 3. Perform instructions from the file. 4. Delete the file with instructions. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since the remaining files with instructions is replayed on the next restart, after that the remaining contents of the tmp directory is deleted. Unfortunately this algorithm doesn't work under Windows because it disallows removing and moving files, which are in use. So the new algorithm for background merge has been implemented: 1. Merge source parts into a destination part inside the partition directory itself. E.g. now the partition directory may contain both complete and incomplete parts. 2. Atomically update the parts.json file with the new list of parts after the merge, e.g. remove the source parts from the list and add the destination part to the list before storing it to parts.json file. 3. Remove the source parts from disk when they are no longer used. This algorithm guarantees that either source parts or destination part is visible in the partition after unclean shutdown at any step above, since incomplete partitions from step 1 or old source parts from step 3 are removed on the next startup by inspecting parts.json file. This algorithm should work under Windows, since it doesn't remove or move files in use. This algorithm has also the following benefits: - It should work better for NFS. - It fits object storage semantics. The new algorithm changes data storage format, so it is impossible to downgrade to the previous versions of VictoriaMetrics after upgrading to this algorithm. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3236 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3821 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/70	2023-03-19 23:28:26 -07:00
oliverpool	8c708ca1e9	app/vmselect/promql: add test to ensure 8-byte alignment (#3948 ) See `0af9e2b693`	2023-03-16 22:07:13 -07:00
Aliaksandr Valialkin	3b4a3583bc	app/vmselect/promql: prevent from `cannot unmarshal timeseries from rollupResultCache` panic after the upgrade to v1.89.0	2023-03-12 19:09:11 -07:00
Aliaksandr Valialkin	cf7d8811f6	app/vmselect/vmui: `make vmui-update` after `00a0816ab1`	2023-03-12 17:22:28 -07:00
Aliaksandr Valialkin	a6a4beb89a	app/vmselect: remove data race on updating EvalConfig.IsPartialResponse from concurrently running goroutines This properly returns `is_partial: true` for partial responses.	2023-03-12 16:53:03 -07:00
Aliaksandr Valialkin	5cd60c54d3	app/vmselect/promql: prevent from SIGBUS crash on architecures, which deny unaligned access to 8-byte words (e.g. ARM) Thanks to @oliverpool for nailing down the root cause of the issue and for the initial attempt to fix it at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3927	2023-03-12 16:29:18 -07:00
Aliaksandr Valialkin	e491fee1f4	app/vmselect/netstorage: do not intern string representation of MetricName for time series received from vmstorage It has been appeared that this interning may lead to increased memory usage and increased CPU usage when vmselect performs queries, which select big number of time series. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3692 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3863	2023-03-12 00:44:08 -08:00
Aliaksandr Valialkin	54fe207cc0	all: follow-up for `7a3e16e774` - Sync the description for -httpListenAddr.useProxyProtocol command-line flag at vmagent and vmauth, so it is consistent with the description at vmauth and victoria-metrics - Add a sample of panic text to docs/CHANGELOG.md, so it could be googled - Mention the -httpListenAddr.useProxyProtocol command-line flag in the description for the bugfix Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335	2023-03-08 01:42:58 -08:00
Aliaksandr Valialkin	bc24d35153	app/vmselect/vmui: `make vmui-update` after `bbf8e459a0`	2023-03-08 01:40:18 -08:00
Aliaksandr Valialkin	d3605ad072	app/vmselect/promql: fix panic when calculating `aggr_func(rollup*())` The panic has been introduced in `dac21d874b`	2023-02-27 11:48:38 -08:00
Aliaksandr Valialkin	bbd5914eb1	all: add makefile rules for GOARCH=s390x for all the VictoriaMetrics components This is a follow-up for `007530f882`	2023-02-26 12:38:48 -08:00
Aliaksandr Valialkin	18dd0d1dbf	.golangci.yml: properly enable `revive` linter and fix all the warnings it detects	2023-02-26 12:19:58 -08:00
Roman Khavronenko	66d0b45651	vmselect/promql: check for deadline in `count_values` fn (#3806 ) * vmselect/promql: check for deadline in `count_values` fn `count_values` could be very slow during the data processing. Checking for deadline between iterations supposed to reduce probability of exceeding `search.maxQueryDuration`. The change also adds a new trace record, which captures the time spent in aggregation function. Before that, the trace for aggr funcs could be confusing since it doesn't account for all the places where time was spent. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-24 17:10:38 -08:00
Roman Khavronenko	79eb33556e	metricsql: support optional 2nd argument for rollup functions (#3841 ) * metricsql: support optional 2nd argument for rollup functions Support optional 2nd argument `min`, `max` or `avg` for rollup functions: * rollup * rollup_delta * rollup_deriv * rollup_increase * rollup_rate * rollup_scrape_interval If second argument is passed, then rollup function will return only the selected aggregation type. This change can be useful for situations where only one type of rollup calculation is needed. For example, `rollup_rate(requests_total[5m], "max")`. Signed-off-by: hagen1778 <roman@victoriametrics.com> * wip --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-24 13:48:30 -08:00
Aliaksandr Valialkin	8efa9159cf	app/vmselect/promql: measure the time required for calculating the aggregate function from the prepared source time series	2023-02-23 20:06:02 -08:00
Aliaksandr Valialkin	dce8afa4c4	app/vmselect/vmui: `make vmui-update` after `d4fc0ed874`	2023-02-23 19:26:08 -08:00
Aliaksandr Valialkin	6369c88a68	app/vmselect: add -search.logQueryMemoryUsage command-line flag for logging queries, which take big amounts of memory Thanks to @michal-kralik for initial attempts for this feature: - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3651 - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3715 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3553	2023-02-23 18:52:44 -08:00
Aliaksandr Valialkin	0c60e4a30a	all: consistently use http.Method{Get,Post,Put} across the codebase This is a follow-up after `9dec3c8f80`	2023-02-22 19:01:09 -08:00
my-git9	7d86c5c94a	chore: Use http constants to replace numbers (#3846 ) Signed-off-by: xin.li <xin.li@daocloud.io>	2023-02-22 18:59:32 -08:00
Alexander Marshalov	9bb9bd266b	fix interpolate function for filling only intermediate gaps (#3816 ) (#3857 ) * fix interpolate function for filling only intermediate gaps (#3816) * Update docs/CHANGELOG.md --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-22 18:41:29 -08:00
Aliaksandr Valialkin	ff8c57a964	app/vmselect: allow zero value for `-search.latencyOffset` command-line flag See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/2061#issuecomment-1299109836 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/218	2023-02-21 18:07:27 -08:00
Aliaksandr Valialkin	a15da5ff73	app/vmselect/promql: add `share(q)` aggregate function for normalizing results across multiple time series in [0..1] value range per each timestamp and aggregation group	2023-02-18 22:43:54 -08:00
Aliaksandr Valialkin	84b5532bc1	app/vmselect/promql: add range_zscore(q) and range_trim_zscore(z, q) functions These functions may be useful for dropping outliers at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3759	2023-02-18 22:43:53 -08:00
Aliaksandr Valialkin	450b6f6d39	app/vmselect/promql: add range_mad(q) and range_trim_outliers(k, q) functions These functions may help trimming outliers during query time for the use case described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3759	2023-02-18 15:18:47 -08:00
Aliaksandr Valialkin	7274424252	app/vmui: tooltip formatting enhancements according to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3706#issuecomment-1429980038	2023-02-14 23:38:05 -08:00
Oleksandr Redko	0e1c395609	app,lib: fix typos in comments (#3804 )	2023-02-13 09:32:35 -08:00
Aliaksandr Valialkin	27b3209816	app/vmui: show `median` instead of `avg` on graph tooltip and line legend Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3706	2023-02-11 12:52:50 -08:00
Aliaksandr Valialkin	db7f237da9	app/vmselect/promql: add `mad_over_time(m[d])` function See https://github.com/prometheus/prometheus/issues/5514	2023-02-11 01:06:39 -08:00
Aliaksandr Valialkin	34379d4cf1	all: run `apk update && apk upgrade` in base Alpine Docker image in order to get all the recent security fixes	2023-02-09 14:03:02 -08:00
Aliaksandr Valialkin	82b5fa2fd0	app/vmui: UX enhancements for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3706 - Display `min` value additionally to `avg`, `max` and `last` - Allow copy-n-pasting metric name with its labels from both legend and tooltup	2023-02-09 11:05:57 -08:00
Yury Molodov	13f55fc21e	vmui: lazy loading predefined panels (#3795 ) * fix: change logic lazy loading predefined panels * app/vmselect/vmui: `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-09 00:12:29 -08:00
Yury Molodov	54bfd22ec5	vmui: add last/max/avg values (#3789 ) * feat: add last/max/avg values (#3706) * fix: change filter exclude values * app/vmui: wip - improve the visualization for avg/max/last values - make getAvgFromArray() function resilient against inf/undefined/nil - export getLastFromArray() function, which is resilient against inf/undefined/nil - run `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-02-08 22:42:05 -08:00
Aliaksandr Valialkin	c9a32ebaf7	app/vmselect/vmui: `make vmui-update` after `e4c04b6dbe`	2023-02-03 19:34:20 -08:00
Zakhar Bessarab	626bd22157	fix: vmselect multi-level setup panic (#3738 ) * app/vmselect/netstorage: fix panic for multi-level cluster setup when `replicationFactor` was set and request contained `trace` parameter (#3734) Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> * app/vmselect/netstorage: use correct context for retry Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2023-02-01 08:56:36 -08:00
Aliaksandr Valialkin	2049114e1f	app/vmselect/vmui: `make vmui-update` after `dcc5616126`	2023-01-31 13:24:54 -08:00
Yury Molodov	730025d1dc	vmui: add select of Tenant ID (#3673 ) * feat: add select of tenantID * feat: replace tenantID to default url * fix: move the tenantID selector to the top header * fix: hide tenantID selector by condition * fix: correct z-index * app/vmselect/vmui: `make vmui-update` --------- Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-27 15:54:09 -08:00
Aliaksandr Valialkin	bccbe07c33	lib/netutil: move IsTrivialNetworkError() function there, since it is used in multiple places across the code	2023-01-27 13:24:44 -08:00
Nikolay	ebebaecd94	lib/netutil: init implimentation of proxy protocol (#3687 ) * lib/netutil: init implimentation of proxy protocol https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3335 * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-26 23:25:22 -08:00
Yury Molodov	29fd95d426	vmui: include fonts in its bundle (#3705 ) * feat: include fonts in the build * fix: reduce size fonts * wip - Document the change at docs/CHANGELOG.md - Run `make vmui-update` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-24 09:31:37 -08:00
Aliaksandr Valialkin	6ff15ca135	app/vmselect: use consistent randomizer in tests Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3683	2023-01-23 19:27:40 -08:00
Aliaksandr Valialkin	b046af8a4d	app/vmselect: `make vmui-update` after `df7b81b44d`	2023-01-20 12:07:29 -08:00
Aliaksandr Valialkin	46645f5d94	app/vmui: increase perceived performance by 2.5x by reducing the delay before the query execution from 0.8s to 0.3s The delay cannot be removed, since it is used for limiting the rate of queries sent to VictoriaMetrics during graph scrolling.	2023-01-18 01:33:51 -08:00
Aliaksandr Valialkin	e01f52d517	app/vmselect/promql: updates tests for https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3664	2023-01-17 23:26:06 -08:00
Yury Molodov	0561ba3557	vmui: correctly display range results in Table view (#3657 ) * fix: properly display range results * fix: set range values to empty array * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-17 21:04:50 -08:00
Yury Molodov	060780af69	vmui: make the step input field global across all the tabs and views (#3644 ) * feat: make the step input field global * fix: correct get step from url * fix: set minimumSignificantDigits to 1 * app/vmselect/vmui: `make vmui-update` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-15 13:47:48 -08:00
Aliaksandr Valialkin	ce47faf102	app/vmselect/promql: reduce memory allocations when searching for time series pairs with identical labelsets in `q1 op q2` queries Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641	2023-01-15 13:00:28 -08:00
Aliaksandr Valialkin	fe8802bbc8	app/vmselect/promql: reduce the number of memory allocations inside getCommonLabelFilters() This should improve performance a bit for `q1 op q2` queries	2023-01-15 12:56:21 -08:00
Aliaksandr Valialkin	26f6cfd3b2	app/vmselect/netstorage: tune the number of blocks per series which should be unpacked by a single goroutine instead of spinning up multiple goroutines This reduces overhead on time series data unpacking for typical cases, this reducing CPU usage at vmselect Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3641	2023-01-12 09:35:15 -08:00
Aliaksandr Valialkin	41b0b951f3	app/vmselect/netstorage: unpack series blocks in the current goroutine if their count doesnt exceed 100 This should improve performance a bit for common case	2023-01-12 01:31:38 -08:00
Aliaksandr Valialkin	d33a65e401	app/vmselect/promql: reduce memory allocations at getCommonLabelFilters() function Intern tag keys and values there	2023-01-12 01:27:34 -08:00
Aliaksandr Valialkin	820357f434	app/vmselect: follow-up after `820312a2b1` - Move the feature description at the correct place at docs/CHANGELOG.md - Run `make vmui-update` - Various cosmetic fixes Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3322	2023-01-11 23:37:51 -08:00
Dmytro Kozlov	f2862f088b	app/vmui: define custom path for dashboards json file (#3545 ) * app/vmui: define custom path for dashboards json file * app/vmui: remove unneeded code * app/vmui: move handler to own file, fix show dashboards, * app/vmui: move flag to handler, add flag description * app/vmauth: fix part of the comments * feat: add store for dashboards * fix: prevent fetch dashboards for app mode * app/vmauth: use simple cache for predefined dashboards * app/vmauth: update dashboards doc * app/vmauth: fix ci * app/vmui: decrease timeout * app/vmselect: removed cache, fix comments * app/vmselect: remove unused const * app/vmselect: fix error log, use slice byte instead of struct Co-authored-by: Yury Moladau <yurymolodov@gmail.com>	2023-01-11 23:33:30 -08:00
Yury Molodov	4295ca2ce2	vmui: small changes on explore metrics page (#3634 ) * fix: change issue link * fix: remove legend toggle * fix: move select graph size * feat: save url params on explore metrics page * wip Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-11 22:17:06 -08:00
Aliaksandr Valialkin	f7130d571d	app/vmselect: improve logging when the incoming query cannot be executed because of timeout in the wait queue	2023-01-11 01:12:25 -08:00
Aliaksandr Valialkin	675e0fa0ee	app/vmselect/promql: typo fix after `0771d57860`	2023-01-11 01:06:18 -08:00
Aliaksandr Valialkin	3d22532bb8	app/vmselect/promql: make a copy of per-series timestamps before their modification The per-series timestamps are usually shared among series, so it is unsafe modifying them. The issue has been appeared after the optimization at `2f3ddd4884`	2023-01-11 00:59:02 -08:00
Aliaksandr Valialkin	8a35377cf3	app/vmselect/promql: move the `eval function args in parallel` query trace outside the loop	2023-01-10 22:23:43 -08:00
Aliaksandr Valialkin	aa027529eb	lib/httpserver: directly pass flag value to CheckAuthFlag() There is no sense in passing a pointer to flag value there. This is a follow-up for `4225a0bd75`	2023-01-10 15:59:55 -08:00
Zakhar Bessarab	10f314cdbd	Use `httpAuth.` flags as a fallback for endpoints protected by `AuthKey` flags (#3582 ) * {lib/server, app/}: use `httpAuth.` flag as fallback for `AuthKey` if it is not set * lib/ingestserver/opentsdbhttp: fix opentdb HTTP handler not respecting `httpAuth.` flags Apply suggestions from code review Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-10 15:57:55 -08:00
Aliaksandr Valialkin	98931449c1	app/vmselect/netstorage: reduce tail latency during query processing Previously the selected time series were split evenly among available CPU cores for further processing - e.g unpacking the data and applying the given rollup function to the unpacked data. Some time series could be processed slower than others. This could result in uneven work distribution among available CPU cores, e.g. some CPU cores could complete their work sooner than others. This could slow down query execution. The new algorithm allows stealing time series to process from other CPU cores when all the local work is done. This should reduce the maximum time needed for query execution (aka tail latency). The new algorithm should also scale better on systems with many CPU cores, since every CPU processes locally assigned time series without inter-CPU communications. The inter-CPU communications are used only when all the local work is finished and the pending work from other CPUs needs to be stealed.	2023-01-10 13:42:26 -08:00
Aliaksandr Valialkin	158a280822	app/vmselect/netstorage: reduce memory allocations when unpacking time series Unpack time series with less than 4M samples in the currently running goroutine. Previously a new goroutine was being started for unpacking the samples. This was requiring additional memory allocations.	2023-01-09 23:17:34 -08:00
Aliaksandr Valialkin	c8bd3534cb	app/vmselect/promql: eliminate memory allocation when sorting values inside float64s	2023-01-09 23:06:57 -08:00
Aliaksandr Valialkin	7956b0d974	app/vmselect/promql: pre-allocate memory for values to be merged in mergeTimeseries() This should reduce the number of memory re-allocations	2023-01-09 22:52:19 -08:00
Aliaksandr Valialkin	895d5d9d22	app/vmselect/promql: consistently intern series names obtained from marshalMetricNameSorted This reduces memory allocations when the returned series names are used as map keys later	2023-01-09 22:46:30 -08:00
Aliaksandr Valialkin	12e2bcdf81	app/vmselect/promql: avoid memory allocations and copying from source timeseries to the returned result at timeseriesToResult()	2023-01-09 22:39:15 -08:00
Aliaksandr Valialkin	dd92e2050f	app/vmselect/promql: remove memory allocations from sortMetricTags()	2023-01-09 22:22:58 -08:00
Aliaksandr Valialkin	8050f5a18c	app/vmselect/promql: intern output series names inside timeseriesToResult() This reduces the number of memory allocations for repeated queries, which return (almost) the same set of time series.	2023-01-09 22:20:34 -08:00
Aliaksandr Valialkin	efb3c630fe	app/vmselect/promql: intern output series names during normal aggregation	2023-01-09 22:15:31 -08:00
Aliaksandr Valialkin	c0de651558	app/vmselect/promql: intern output series names during incremental aggregation This should reduce the number of memory allocations for repeated queries	2023-01-09 22:12:05 -08:00
Aliaksandr Valialkin	abbac2c27c	app/vmselect/netstorage: pre-allocate 4 block references per each time series during querying Usually the number of blocks returned per each time series during queries is around 4. So it is a good idea to pre-allocate 4 block references per time series in order to reduce the number of memory allocations.	2023-01-09 22:08:30 -08:00
Aliaksandr Valialkin	2483c67579	app/vmselect/netstorage: cache canonical MetricName for time series returned from the storage This reduces memory allocations for repeated queries, which return (almost) the same set of time series.	2023-01-09 21:56:27 -08:00
Aliaksandr Valialkin	b7a4650ab0	all: use metricsql.CompileRegexp instead of regexp.Compile for compiling regexps used in graphite queries This should speed up repeated queries, since metricsql.CompileRegexp returns regexps from the cache on subsequent calls for the same input regexp.	2023-01-09 21:45:34 -08:00
Aliaksandr Valialkin	9f02f5a05a	app/vmselect/netstorage: eliminate memory allocation for sortBlocksHeap arg when calling mergeSortBlocks()	2023-01-09 21:29:01 -08:00
Aliaksandr Valialkin	96f04c9863	app/vmselect/netstorage: consistently select the sample with the biggest value out of samples with identical timestamps Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3333 This fix is based on https://github.com/VictoriaMetrics/VictoriaMetrics/pull/3620 , but doesn't slow down the common case with merging replicated data blocks so significantly. Benchmark results: Before the change: BenchmarkMergeSortBlocks/replicationFactor-1-4 13968 85643 ns/op 956.53 MB/s 1700 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-2-4 10806 109171 ns/op 1500.77 MB/s 2191 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-3-4 8887 130623 ns/op 1881.45 MB/s 2660 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-4-4 7440 157348 ns/op 2082.52 MB/s 3174 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-5-4 6534 184473 ns/op 2220.38 MB/s 3612 B/op 1 allocs/op BenchmarkMergeSortBlocks/overlapped-blocks-bestcase-4 13419 85205 ns/op 961.44 MB/s 2213 B/op 1 allocs/op BenchmarkMergeSortBlocks/overlapped-blocks-worstcase-4 579 1894900 ns/op 43.23 MB/s 46760 B/op 1 allocs/op After the change: BenchmarkMergeSortBlocks/replicationFactor-1-4 13832 85298 ns/op 960.40 MB/s 1716 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-2-4 8833 134222 ns/op 1220.66 MB/s 2675 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-3-4 6487 184830 ns/op 1329.65 MB/s 3636 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-4-4 4977 236318 ns/op 1386.61 MB/s 4733 B/op 1 allocs/op BenchmarkMergeSortBlocks/replicationFactor-5-4 4088 296734 ns/op 1380.36 MB/s 5761 B/op 1 allocs/op BenchmarkMergeSortBlocks/overlapped-blocks-bestcase-4 14083 84067 ns/op 974.47 MB/s 2110 B/op 1 allocs/op BenchmarkMergeSortBlocks/overlapped-blocks-worstcase-4 536 2043534 ns/op 40.09 MB/s 50511 B/op 1 allocs/op	2023-01-09 12:58:18 -08:00
Aliaksandr Valialkin	5876821a16	all: small improvements in error messages and command-line flag descriptions related to concurrency limiters	2023-01-07 00:12:24 -08:00
Aliaksandr Valialkin	b275983403	lib/writeconcurrencylimiter: improve the logic behind -maxConcurrentInserts limit Previously the -maxConcurrentInserts was limiting the number of established client connections, which write data to VictoriaMetrics. Some of these connections could be idle. Such connections do not consume big amounts of CPU and RAM, so there is a little sense in limiting the number of such connections. So now the -maxConcurrentInserts command-line option limits the number of concurrently executed insert requests, not including idle connections. It is recommended removing -maxConcurrentInserts command-line option, since the default value for this option should work good for most cases.	2023-01-06 22:07:16 -08:00
Aliaksandr Valialkin	20e9598254	lib/vmselectapi: limit the number of concurrently executed requests This should prevent from out of memory errors when big number of vmselect nodes send many concurrent requests to vmstorage The limit can be controlled at vmstorage via the following command-line flags: - search.maxConcurrentRequests - search.maxQueueDuration See https://docs.victoriametrics.com/Cluster-VictoriaMetrics.html#resource-usage-limits	2023-01-06 18:39:46 -08:00
Aliaksandr Valialkin	cd705b0f69	app/vmselect: improve error message when the request cannot be started because too many concurrent requests are already executed	2023-01-06 18:19:05 -08:00
Aliaksandr Valialkin	1d16cc9349	lib/promscrape: pre-fetch metric_relabel_configs rules when debugging metric relabeling for a particular target Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3407	2023-01-05 03:28:14 -08:00
Yury Molodov	802febab74	vmui: improve `Explore metrics` (#3598 ) * feat: add multiple select * feat: improve explore interface * app/vmselect/vmui: `make vmui-update` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2023-01-05 02:24:05 -08:00
Aliaksandr Valialkin	78114e85d6	vendor: update github.com/VictoriaMetrics/metricsql from v0.50.0 to v0.51.0 Updates https://github.com/VictoriaMetrics/metricsql/pull/7 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3589	2023-01-05 01:50:26 -08:00
Aliaksandr Valialkin	ac890b3081	docs: update `-help` outputs for vm* tools	2023-01-03 23:27:31 -08:00
Aliaksandr Valialkin	d794e971fc	app/vmui: small usability improvements - Show in the line tooltip the number of the query which generates the given line. This simplifies comparison of lines generated by multiple queries. - Show metric name as __name__ label in the line tooltip in the same way as other labels are shown there. This makes the label information in the tooltip more consistent. - Properly quote label values with JSON.stringify(). This prevents from improper formatting when label values contain doublequote chars. - Remove double curly braces artifact at graph legend for lines without names and labels. - Properly use modifier for regular expressions across the code.	2022-12-29 15:00:10 -08:00
Aliaksandr Valialkin	71f2979669	app/vmselect/vmui: `make vmui-update` after `1720bddb4f`	2022-12-29 12:20:20 -08:00
ChenyuanHu	8dfe95761e	app/vmselect/prometheus: no need manually call queryDuration.UpdateDuration (#3564 ) There is no need to manually call `queryDuration.UpdateDuration(startTime)`, because `defer queryDuration.UpdateDuration(startTime)` is executed at the beginning of the function(L660).	2022-12-29 10:41:51 -08:00
Aliaksandr Valialkin	f27bb19213	app/vmselect/searchutils: accept partial RFC3339 values at `time`, `start` and `end` query args This simplifies manual usage of the APIs. For example, the following query would return the results over the 2022 year. /api/v1/query_range?start=2022&end=2023&step=1d&query=... This is equivalent to: /api/v1/query_range?start=2022-01-01T00:00:00Z&end=2023-01-01T00:00:00Z&step=1d&query=...	2022-12-28 19:46:21 -08:00
Yury Molodov	6f21435d2d	vmui: fix step field (#3561 ) * feat: use a unit next to the step value * app/vmselect/vmui: `make vmui-update` Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2022-12-28 16:01:33 -08:00
Artem Navoiev	393f4ab86f	update links to grafana dashboards (#3534 ) docs: update links to grafana dashboards Signed-off-by: Artem Navoiev <tenmozes@gmail.com>	2022-12-28 11:22:02 -08:00
Aliaksandr Valialkin	197e58f1f6	app/vmui: show min, max and avg lines at `Explore metrics` graphs when `instance` is selected in the same way as when only the `job` is selected This improves consistency of the graphs. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/3386	2022-12-23 23:21:23 -08:00

... 2 3 4 5 6 ...

1252 Commits