docs: clarify why spare CPU and RAM resources are needed in capacity planning

This commit is contained in:
Aliaksandr Valialkin 2021-07-13 15:48:25 +03:00
parent 8d0ec47be9
commit 8d99e94a52
4 changed files with 12 additions and 12 deletions

View File

@ -1106,10 +1106,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede
The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).
It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
It is recommended leaving the following amounts of spare resources:
* 50% of free RAM
* 50% of spare CPU
* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
* At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.

View File

@ -306,10 +306,10 @@ Each node type - `vminsert`, `vmselect` and `vmstorage` - can run on the most su
The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag at `vmstorage`) can be extrapolated from disk space usage in a test run. For example, if the storage space usage is 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period). Storage space usage can be monitored with [the official Grafana dashboard for VictoriaMetrics cluster](#monitoring).
It is recommended leaving the following amounts of spare resources on every node type for reducing the probability of issues related to temporary spikes in the workload:
It is recommended leaving the following amounts of spare resources:
* 50% of free RAM
* 50% of spare CPU
* 50% of free RAM across all the node types for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
* 50% of spare CPU across all the node types for reducing the probability of slowdowns during temporary spikes in workload.
* At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag at `vmstorage` nodes.
Some capacity planning tips for VictoriaMetrics cluster:

View File

@ -1106,10 +1106,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede
The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).
It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
It is recommended leaving the following amounts of spare resources:
* 50% of free RAM
* 50% of spare CPU
* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
* At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.

View File

@ -1110,10 +1110,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede
The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).
It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
It is recommended leaving the following amounts of spare resources:
* 50% of free RAM
* 50% of spare CPU
* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
* At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.