Node_Exporter

mirror of https://github.com/prometheus/node_exporter.git synced 2024-11-24 22:06:49 +01:00

Author	SHA1	Message	Date
Peter Bueschel	da5972b539	Add gauges for allocated memory for queued UDP and TCP packages (#1503 ) * Two new states will be added to the tcpstat collector called rx_queued_bytes and tx_queued_bytes. For UDP datagrams an additional collector 'udp_queues' can be used to expose the total lengths of the tx_queue and rx_queue. @SuperQ and @discordianfish this changes gives us the option to check for overloaded UDP + TCP processing. The names of the new TCP states and the UDP metric can be discussed. The current reasons are just: I don't want to add another collector for the same exposed file, so I just added the new states to the tcpstat collector. I chose the name 'udp_queue' instead of 'udpstat' as UDP has no state. Signed-off-by: Peter Bueschel <peter.bueschel@logmein.com>	2020-03-31 10:46:32 +02:00
Ben Kochie	4891b01b6c	Add changelog entry for #1647 Signed-off-by: Ben Kochie <superq@gmail.com>	2020-03-27 21:36:39 +01:00
Tom Wilkie	6496c24d61	Metrics for IO errors on Mac. (#1636 ) * Metrics for IO errors and retries on Mac. Signed-off-by: Tom Wilkie <tom@grafana.com>	2020-03-21 21:05:38 +01:00
Benjamin Drung	34d50e15d5	Add model_name and stepping to node_cpu_info metric The `node_cpu_info` metric contains some information like the `model` (which is an integer), but not the human readable model name. Also the stepping of the processor might be interesting, since different stepping of a processor might behave differently. Signed-off-by: Benjamin Drung <benjamin.drung@cloud.ionos.com>	2020-03-20 17:27:11 +01:00
Ben Kochie	ef7c05816a	Release 1.0.0-rc.0 (#1614 ) Update CHANGELOG/VERSION for 1.0.0-rc.0 release. * Add a note about new https settings to top-level README. * Mark --web.config flag as experimental. Signed-off-by: Ben Kochie <superq@gmail.com>	2020-02-20 13:42:47 +01:00
Daniel Hodges	ec62141388	Fix num cpu (#1561 ) * add a map of profilers to CPUids `runtime.NumCPU()` returns the number of CPUs that the process can run on. This number does not necessarily correlate to CPU ids if the affinity mask of the process is set. This change maintains the current behavior as default, but also allows the user to specify a range of CPUids to use instead. The CPU id is stored as the value of a map keyed on the profiler object's address. Signed-off-by: Joe Damato <jdamato@fastly.com> Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com> Signed-off-by: Daniel Hodges <hodges@uber.com> Co-authored-by: jdamato-fsly <55214354+jdamato-fsly@users.noreply.github.com>	2020-02-20 11:36:33 +01:00
Paul Gier	b40954dce5	new flag to disable all default collectors (#1460 ) * new flag to disable all default collectors Signed-off-by: Paul Gier <pgier@redhat.com> Co-authored-by: Ben Kochie <superq@gmail.com>	2020-02-20 11:03:33 +01:00
Ben Kochie	3e1b0f1bee	Don't count empty collection as success (#1613 ) Many collectors depend on underlying features to be enabled. This causes confusion about what "success" means. This changes the behavior of the `node_scrape_collector_success` metric. * When a collector is unable to find data don't return success. * Catch the no data error and send to Debug log level to avoid log spam. * Update collectors to support this new functionality. * Fix copy-pasta mistake in infiband debug message. Closes: https://github.com/prometheus/node_exporter/issues/1323 Signed-off-by: Ben Kochie <superq@gmail.com>	2020-02-19 16:11:29 +01:00
Ben Kochie	1a75bc7b50	Fix up Darwin swap metrics * Add a changelog entry. * Remove redundant swap free metric. Signed-off-by: Ben Kochie <superq@gmail.com>	2020-02-19 15:52:47 +01:00
Silke Hofstra	8faa843fc4	Add Btrfs collector (#1512 ) * Add procfs/btrfs to vendor folder * Add Btrfs collector Resolves #1100 Signed-off-by: Silke Hofstra <silke@slxh.eu>	2020-02-19 15:48:51 +01:00
Ukri Niemimuukko	eac3e30f7f	rapl_linux collector This exposes RAPL statistics from /sys/class/powercap. Co-Authored-By: Ben Kochie <superq@gmail.com> Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>	2020-02-01 12:06:30 +01:00
Paul Cameron	9bb37873a8	Add unix socket support for supervisord collector (#1592 ) * Add unix socket support for supervisord collector For example: --collector.supervisord.url=unix:///var/run/supervisor.sock Fixes prometheus/node_exporter#262 Signed-off-by: Paul Cameron <cameronpm@gmail.com>	2020-01-28 08:50:23 +01:00
Thomas Lin	3ddc82c2d8	Fixed inaccurate 'node_network_speed_bytes' when speeds are low (#1580 ) Integer division and the order of operations when converting Mbps to Bps results in a loss of accuracy if the interface speeds are set low. e.g. 100 Mbps is reported as 12000000 Bps, should be 12500000 10 Mbps is reported as 1000000 Bps, should be 1250000 Signed-off-by: Thomas Lin <t.lin@mail.utoronto.ca>	2020-01-01 13:10:53 +01:00
Peter Nicholson	a80b7d0bc5	Add softnet collector (#1576 ) Signed-off-by: Peter Nicholson <petergoods@hotmail.com>	2019-12-30 01:36:10 +01:00
Ben Kochie	0d9d7e961a	Update CHANGELOG Add/update entries for recent merged PRs. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-11-25 21:50:00 +01:00
Matt Layher	da6b66371f	collector: reimplement sockstat collector with procfs (#1552 ) * collector: reimplement sockstat collector with procfs * collector: handle sockstat IPv4 disabled, debug logging Signed-off-by: Matt Layher <mdlayher@gmail.com>	2019-11-25 13:41:38 -06:00
John Belmonte	15e36e2230	fix typo in cpufreq metric names (#1510 ) Signed-off-by: John Belmonte <john@neggie.net>	2019-10-11 02:12:20 +09:00
Paul Gier	9f5225456d	fix order of items in CHANGELOG Signed-off-by: Paul Gier <pgier@redhat.com>	2019-09-25 14:39:43 -05:00
Paul Gier	4d72cb8059	add node_cpu_info metric Contains information gathered from /proc/cpuinfo Signed-off-by: Paul Gier <pgier@redhat.com>	2019-09-25 14:38:57 -05:00
Ben Kochie	82b7b1f732	Merge branch 'master' into coolingDevice	2019-09-09 17:44:03 +02:00
dt-rush	93fbb93a46	fix issue where rootfs path strips to the empty string (#1464 ) Change-type: patch Connects-to: #1463 Signed-off-by: dt-rush <nickp@balena.io>	2019-09-09 17:39:24 +02:00
Paul Gier	8c3de12c22	systemd: check version for availability of properties (#1413 ) The dbus property 'SystemState' and the timer property 'LastTriggerUSec' were added in version 212 of systemd. Check that the version of systemd is higher than 212 before attempting to query these properties `f755e3b74b` `dedabea4b3` Resolves issue #291 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-09-04 16:27:25 +02:00
Alex Schmitz	664025d60c	Scrape cooling_device state Signed-off-by: Alex Schmitz <alex.schmitz@gmail.com>	2019-08-30 08:58:47 -05:00
Boris Momčilović	93c12e03a1	Ipvs firewall mark (#1455 ) * IPVS: include firewall mark label Signed-off-by: Boris Momčilović <boris@firstbeatmedia.com>	2019-08-27 14:24:11 +02:00
Richard Kojedzinszky	75462bf4fe	Scrape thermal_zone temperatures (#1425 ) * Scrape thermal_zone temperatures Signed-off-by: Richard Kojedzinszky <richard@kojedz.in>	2019-08-04 12:56:36 +02:00
Ben Kochie	10146109ec	Update CHANGELOG for #1433 Signed-off-by: Ben Kochie <superq@gmail.com>	2019-08-03 12:33:25 +02:00
Dipack P Panjabi	a7452023db	Added mountinfo changes to node_exporter (#1417 ) Use the extra information gleaned from the mountinfo file to add a 'mountaddr' field for NFS metrics. This helps prevent prometheus from ignoring mounts that come from the same URL, but are actually from different IP addresses. This commit also rebases to current master Signed-off-by: Dipack P Panjabi <dpanjabi@hudson-trading.com>	2019-07-28 11:32:40 +02:00
Ben Kochie	852b340a46	Add changelog entry for #1439 Signed-off-by: Ben Kochie <superq@gmail.com>	2019-07-28 10:38:41 +02:00
dt-rush	5d3e2ce2ef	properly strip path.rootfs from mountpoint labels (#1421 ) Change-type: patch Connects-to: #1418 Signed-off-by: dt-rush <nickp@balena.io>	2019-07-19 16:51:17 +02:00
Steven Kreuzer	d8e47a9f9f	Expose additional XFS runtime statistics (#1423 ) Include directory operation, read/write system call, and vnode runtime statistics for XFS filesystems. Signed-off-by: Steven Kreuzer <skreuzer@FreeBSD.org>	2019-07-15 16:28:09 +02:00
Ben Kochie	0de95ef8f3	Add changelog entry for #1414 Signed-off-by: Ben Kochie <superq@gmail.com>	2019-07-12 14:25:17 +02:00
Phil Frost	f693a71c06	Scrape CPU latency stats from /proc/schedstat (#1389 ) These are useful as a direct indication of CPU contention and task scheduler latency. Handy references: - https://github.com/torvalds/linux/blob/master/Documentation/scheduler/sched-stats.txt - https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.taskscheduler.html procfs is updated to pull in the enabling change: https://github.com/prometheus/procfs/pull/186 Signed-off-by: Phil Frost <phil@postmates.com>	2019-07-10 09:16:24 +02:00
Advait Bhatwadekar	3f49b31101	Closes issue #261 on node_exporter. (#1403 ) * Closes issue #261 on node_exporter. Delegated mdstat parsing to procfs project. mdadm_linux.go now only exports the metrics. -> Added disk labels: "fail", "spare", "active" to indicate disk status -> hanged metric node_md_disks_total ==> node_md_disks_required -> Removed test cases for mdadm_linux.go, as the functionality they tested for has been moved to procfs project. Signed-off-by: Advait Bhatwadekar <advait123@ymail.com>	2019-07-01 11:56:06 +02:00
mknapphrt	3108a50fb6	Fix systemd restart counter label from state to name (#1393 ) Signed-off-by: Mark Knapp <mknapp@hudson-trading.com>	2019-06-25 09:37:48 +02:00
Ben Kochie	c39f6749fc	Bugfix release 0.18.1 (#1366 ) Cherry-pick two bug fixes into 0.18.1. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-06-04 14:29:33 +02:00
Ben Kochie	4a15edf0b6	Add changelog entry for #1364 Signed-off-by: Ben Kochie <superq@gmail.com>	2019-06-03 11:20:06 +02:00
Ben Kochie	fdf9846282	Fixup 0.17.0 changelog (#1354 ) * Fix ordering of CHANGE items by PR number. * Add missing CHANGE for #1003 Signed-off-by: Ben Kochie <superq@gmail.com>	2019-06-02 10:51:07 +01:00
Noam Meltzer	501ccf9fb4	Add --collector.netdev.device-whitelist flag (#1279 ) * Add --collector.netdev.device-whitelist flag Sometimes it is desired to monitor only one netdev. The golang regexp does not support a negated regex, so the ignored-devices flag is too cumbersome for this task. This change introduces a new flag: accept-devices, which is mutually exclusive to ignored-devices. This flag allows specifying ONLY the netdev you'd like. Signed-off-by: Noam Meltzer <noam@cynerio.co>	2019-05-31 17:55:50 +02:00
David O'Rourke	814ef064c0	meminfo: Fix the size mismatch in the swapTotal check mib for BSD. (#1345 ) Signed-off-by: David O'Rourke <david.orourke@gmail.com>	2019-05-14 17:42:36 -05:00
Ben Kochie	f97f01c46c	Update for 0.18.0 release (#1337 ) * Update CHANGELOG for release. * Bump VERSION. * Update vendoring. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-05-09 13:19:12 -05:00
Daniel Hodges	7882009870	Add perf exporter (#1274 ) Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2019-05-07 13:21:41 +02:00
Ben Kochie	78b9eb9c2c	Use 64-bit Darwin netstat counters (#1319 ) Avoid 32-bit counter rollovers. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-04-25 10:07:56 +02:00
Christian Hoffmann	36e3b2a923	textfile: use opened file's mtime as timestamp (#1326 ) Previously, the node_textfile_mtime_seconds metric was based on the Fileinfo.ModTime() of the ioutil.ReadDir() return value. This is based on lstat() and therefore has unintended consequences for symlinks (modification time of the symlink instead of the symlink target is returned). It is also racy as the lstat() is performed before reading the file. This commit changes the node_textfile_mtime_seconds metric to be based on a fresh Stat() call on the open file. This eliminates the race and works as expected for symlinks. Fixes #1324. Signed-off-by: Christian Hoffmann <mail@hoffmann-christian.info>	2019-04-18 17:47:04 +02:00
Daniele Sluijters	cc2fd82008	Expose /proc/pressure (#1261 ) This enables the collection of pressure stall information as exposed by the `/proc/pressure` interface added in the 4.20 release of the Linux kernel. Closes #1174 Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>	2019-04-18 12:19:20 +02:00
Paul Gier	cc847f2f44	collector/cpu: split cpu freq metrics into separate collector (#1253 ) The cpu frequency information is not always needed and/or available. This change allows the cpu frequency metrics to be enabled/disabled separately from the other cpu metrics, and also prevents a frequency metric failure (such as a parse error) from failing the main cpu collector. Fixes #1241 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-19 17:22:54 +01:00
Ben Kochie	f028b81615	Update systemd blacklist (#1255 ) Include additional unit types in the default systemd collector blacklist. Signed-off-by: Ben Kochie <superq@gmail.com>	2019-02-17 17:57:15 +01:00
Paul Gier	cb9e23c536	Systemd refactor (#1254 ) This reduces the system metric collection time by using a wait group and go routines to allow the systemd metric calls happen concurrently. Also, makes the start time, restarts, tasks_max, and tasks_current metrics disabled by default because these can be time consuming to gather. Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-11 23:27:21 +01:00
Sachi King	18fc512fc4	Bond: Monitor bond mii_status not link operstate (#1124 ) With a bond interface the state of the slave interface from the bond's point of view is reflected in `mii_status` and is independent of the link's `operstate`. When a bond is monitored with `miimon`, `mii_status` will reflect the state of the physical link as configured via the operator. When a bond is monitored via `arp_interval` the `mii_status` will reflect the results of the bond ARP checking. This means the link can be down from the bond's point of view, but up from a physical connection point of view. If a bond is not monitored via miimon or arp, the `mii_status` should likely be always `up`, however I have observed a case where this is not true and the `operstate` is `up` while `mii_status` is `down`. Kernel bond documentation stresses that a bond should not be configured without one of `mii_mon` or `arp_interval` configured however. This change results in the metric 'node_bonding_active' matching the up/down state of the bond's point of view rather than operstate. Signed-off-by: Sachi King <nakato@nakato.io>	2019-02-10 11:00:04 +01:00
Paul Gier	e0d6d11859	netclass_linux: remove varying labels from the 'up' metric (#1243 ) * netclass_linux: remove varying labels from the 'up' metric This moves the variable label values such as 'operstate' out of the 'network_up' metric and into a separate metric called '_info'. This allows the 'up' metric to remain continous over state changes. Fixes #1236 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-07 15:59:32 +01:00
Johannes 'fish' Ziemke	6ea0aa73e4	Rename interface to device in netclass collector (#1224 ) * Rename interface to device in netclass collector This makes it consistent with other networking metrics like node_network_receive_bytes_total This closes #1223 Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2019-02-06 20:02:48 +01:00

1 2 3

102 Commits