docs: clarify why spare CPU and RAM resources are needed in capacity planning

2024-11-23 12:31:07 +01:00 · 2021-07-13 15:48:25 +03:00 · 2021-07-13 15:48:25 +03:00 · 8d99e94a52
commit 8d99e94a52
parent 8d0ec47be9
4 changed files with 12 additions and 12 deletions
--- a/README.md
+++ b/README.md
@ -1106,10 +1106,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede

 The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).

-It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
+It is recommended leaving the following amounts of spare resources:

-* 50% of free RAM
-* 50% of spare CPU
+* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
+* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
 * At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.


--- a/docs/Cluster-VictoriaMetrics.md
+++ b/docs/Cluster-VictoriaMetrics.md
@ -306,10 +306,10 @@ Each node type - `vminsert`, `vmselect` and `vmstorage` - can run on the most su

 The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag at `vmstorage`) can be extrapolated from disk space usage in a test run. For example, if the storage space usage is 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period). Storage space usage can be monitored with [the official Grafana dashboard for VictoriaMetrics cluster](#monitoring).

-It is recommended leaving the following amounts of spare resources on every node type for reducing the probability of issues related to temporary spikes in the workload:
+It is recommended leaving the following amounts of spare resources:

-* 50% of free RAM
-* 50% of spare CPU
+* 50% of free RAM across all the node types for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
+* 50% of spare CPU across all the node types for reducing the probability of slowdowns during temporary spikes in workload.
 * At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag at `vmstorage` nodes.

 Some capacity planning tips for VictoriaMetrics cluster:
--- a/docs/README.md
+++ b/docs/README.md
@ -1106,10 +1106,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede

 The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).

-It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
+It is recommended leaving the following amounts of spare resources:

-* 50% of free RAM
-* 50% of spare CPU
+* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
+* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
 * At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.


--- a/docs/Single-server-VictoriaMetrics.md
+++ b/docs/Single-server-VictoriaMetrics.md
@ -1110,10 +1110,10 @@ VictoriaMetrics capacity scales linearly with the available resources. The neede

 The needed storage space for the given retention (the retention is set via `-retentionPeriod` command-line flag) can be extrapolated from disk space usage in a test run. For example, if `-storageDataPath` directory size becomes 10GB after a day-long test run on a production workload, then it will need at least `10GB*100=1TB` of disk space for `-retentionPeriod=100d` (100-days retention period).

-It is recommended leaving the following amounts of spare resources for reducing the probability of issues related to temporary spikes in the workload:
+It is recommended leaving the following amounts of spare resources:

-* 50% of free RAM
-* 50% of spare CPU
+* 50% of free RAM for reducing the probability of OOM (out of memory) crashes and slowdowns during temporary spikes in workload.
+* 50% of spare CPU for reducing the probability of slowdowns during temporary spikes in workload.
 * At least 30% of free storage space at the directory pointed by `-storageDataPath` command-line flag.