Node_Exporter

mirror of https://github.com/prometheus/node_exporter.git synced 2024-11-23 20:36:21 +01:00

Author	SHA1	Message	Date
Anthony D'Atri	8b466360a3	Modest doc improvements (#1876 ) * Modest doc improvements Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>	2020-11-25 16:46:58 +01:00
Louis Taylor	c8d5159fed	readme: fix link to PSI documentation (#1864 ) The txt was changed to rst: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/Documentation/accounting/psi.rst But it's probably better to link to the rendered docs, since the link should be more stable. Signed-off-by: Louis Taylor <louis@kragniz.eu>	2020-11-24 18:33:54 +01:00
胡玮文	2c1d2a6efd	Update the link to prometheus-dcgm The original link is broken (404). Signed-off-by: 胡玮文 <huww98@outlook.com>	2020-07-14 12:24:38 +02:00
Calle Pettersson	ba20f5e061	Update windows_exporter reference Signed-off-by: Calle Pettersson <calle@cape.nu>	2020-06-01 21:07:42 +02:00
Daniel Hodges	b14168cf6a	Add perf tracepoint collection flag (#1664 ) * Add tracepoint collector option for perf collector Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2020-04-17 12:02:08 +02:00
Michael Vorburger ⛑️	4135c00d33	minor README doc fix re. collector.perf.cpus Signed-off-by: Michael Vorburger <mike@vorburger.ch>	2020-04-17 11:02:26 +02:00
Peter Bueschel	da5972b539	Add gauges for allocated memory for queued UDP and TCP packages (#1503 ) * Two new states will be added to the tcpstat collector called rx_queued_bytes and tx_queued_bytes. For UDP datagrams an additional collector 'udp_queues' can be used to expose the total lengths of the tx_queue and rx_queue. @SuperQ and @discordianfish this changes gives us the option to check for overloaded UDP + TCP processing. The names of the new TCP states and the UDP metric can be discussed. The current reasons are just: I don't want to add another collector for the same exposed file, so I just added the new states to the tcpstat collector. I chose the name 'udp_queue' instead of 'udpstat' as UDP has no state. Signed-off-by: Peter Bueschel <peter.bueschel@logmein.com>	2020-03-31 10:46:32 +02:00
Ben Kochie	ef7c05816a	Release 1.0.0-rc.0 (#1614 ) Update CHANGELOG/VERSION for 1.0.0-rc.0 release. * Add a note about new https settings to top-level README. * Mark --web.config flag as experimental. Signed-off-by: Ben Kochie <superq@gmail.com>	2020-02-20 13:42:47 +01:00
Daniel Hodges	ec62141388	Fix num cpu (#1561 ) * add a map of profilers to CPUids `runtime.NumCPU()` returns the number of CPUs that the process can run on. This number does not necessarily correlate to CPU ids if the affinity mask of the process is set. This change maintains the current behavior as default, but also allows the user to specify a range of CPUids to use instead. The CPU id is stored as the value of a map keyed on the profiler object's address. Signed-off-by: Joe Damato <jdamato@fastly.com> Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com> Signed-off-by: Daniel Hodges <hodges@uber.com> Co-authored-by: jdamato-fsly <55214354+jdamato-fsly@users.noreply.github.com>	2020-02-20 11:36:33 +01:00
Ukri Niemimuukko	eac3e30f7f	rapl_linux collector This exposes RAPL statistics from /sys/class/powercap. Co-Authored-By: Ben Kochie <superq@gmail.com> Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>	2020-02-01 12:06:30 +01:00
Peter Nicholson	a80b7d0bc5	Add softnet collector (#1576 ) Signed-off-by: Peter Nicholson <petergoods@hotmail.com>	2019-12-30 01:36:10 +01:00
Alex Schmitz	664025d60c	Scrape cooling_device state Signed-off-by: Alex Schmitz <alex.schmitz@gmail.com>	2019-08-30 08:58:47 -05:00
Sandro Jäckel	697c2deed5	Update rootfs syntax in Docker example (#1443 ) Signed-off-by: Sandro Jäckel <sandro.jaeckel@gmail.com>	2019-08-07 09:19:20 +02:00
Philip Gough	2d95ecaa96	Extends uname collector to export on Darwin OS (#1433 ) Adds uname collector support for Darwin and OpenBSD Signed-off-by: Philip Gough <philip.p.gough@gmail.com>	2019-08-03 12:32:43 +02:00
Phil Frost	f693a71c06	Scrape CPU latency stats from /proc/schedstat (#1389 ) These are useful as a direct indication of CPU contention and task scheduler latency. Handy references: - https://github.com/torvalds/linux/blob/master/Documentation/scheduler/sched-stats.txt - https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.taskscheduler.html procfs is updated to pull in the enabling change: https://github.com/prometheus/procfs/pull/186 Signed-off-by: Phil Frost <phil@postmates.com>	2019-07-10 09:16:24 +02:00
Daniel Hodges	7882009870	Add perf exporter (#1274 ) Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>	2019-05-07 13:21:41 +02:00
Daniele Sluijters	5b4140e0bd	README: Move pressure to enabled table (#1325 ) Follow-up from #1261. Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>	2019-04-18 13:52:14 +02:00
Daniele Sluijters	cc2fd82008	Expose /proc/pressure (#1261 ) This enables the collection of pressure stall information as exposed by the `/proc/pressure` interface added in the 4.20 release of the Linux kernel. Closes #1174 Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>	2019-04-18 12:19:20 +02:00
Johannes 'fish' Ziemke	d2136aace0	Update README: Add note about ts in textfile This closes #1284	2019-03-19 11:23:17 +01:00
Paul Gier	cc847f2f44	collector/cpu: split cpu freq metrics into separate collector (#1253 ) The cpu frequency information is not always needed and/or available. This change allows the cpu frequency metrics to be enabled/disabled separately from the other cpu metrics, and also prevents a frequency metric failure (such as a parse error) from failing the main cpu collector. Fixes #1241 Signed-off-by: Paul Gier <pgier@redhat.com>	2019-02-19 17:22:54 +01:00
Ralf Horstmann	3867ad5ab0	Add diskstats collector for OpenBSD (#1250 ) * Add diskstats collector for OpenBSD Tested on i386 and amd64, OpenBSD 6.4 and -current. * Refactor diskstats collectors This moves common descriptors from Linux, Darwin, OpenBSD diskstats collectors into diskstats_common.go Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>	2019-02-06 11:36:22 +01:00
David O'Rourke	d442108d7a	collector: Implement uname collector for FreeBSD (#1239 ) * collector: Implement uname collector for FreeBSD Signed-off-by: David O'Rourke <david.orourke@gmail.com>	2019-02-05 17:39:24 +01:00
Jon Davies	e766485286	Add kstat-based Solaris metrics (#1197 ) * collector/loadavg_solaris.go: Use libkstat to gather load averages. * go.mod: Added go-kstat. * boot_time_solaris.go: Added. * cpu_solaris.go: Added. * README.md: Updated entries for Solaris. * collector/zfs_solaris.go: Added. * CHANGELOG.md: Added note about kstat-based Solaris metrics. Signed-off-by: Jonathan Davies <jpds@protonmail.com>	2019-01-12 13:33:56 +01:00
Johannes 'fish' Ziemke	bcec99e0aa	Add link to prometheus-dcgm (#1164 ) Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2018-11-19 19:35:01 +01:00
Bryan Boreham	f0d2a06b11	Update readme (#1107 ) * State that wifi collector is disabled by default Signed-off-by: Bryan Boreham <bjboreham@gmail.com> * Add the 'processes' collector to the Readme Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2018-10-11 18:27:41 +02:00
Yecheng Fu	0f9842f20a	[continue 912] strip rootfs prefix for run in docker (#1058 ) * strip rootfs prefix for run in docker * Use `/` as default value of path.rootfs, and parse mounts from `/proc/1/mounts`. * No need to mount `/proc` and `/sys` because we share host's PID namespace, which allows processes within the container to see all of the processes on the system. Closes: #66 Signed-off-by: Ivan Mikheykin <ivan.mikheykin@flant.com> Signed-off-by: Yecheng Fu <cofyc.jackson@gmail.com>	2018-10-04 14:11:21 +02:00
Andrew Banchich	5da107b02c	Add missing words and update markdown syntax (#1095 ) Signed-off-by: Andrew Banchich <andrewbanchich@gmail.com>	2018-10-03 09:03:25 +02:00
Ben Kochie	354115511c	Add note about SYS_TIME capability for Docker. (#1001 ) Signed-off-by: Ben Kochie <superq@gmail.com>	2018-07-16 18:30:19 +02:00
Jan Klat	c4102f1175	Add sys/class/net parsing from procfs and expose its metrics (#851 ) * add sys/class/net parsing from procfs and expose its metrics Signed-off-by: Jan Klat <jenik@klatys.cz> * change code to use int pointers per procfs change, move netclass to separate collector, change metric naming Signed-off-by: Jan Klat <jenik@klatys.cz> * bump year in licence, remove redundant newline, correct fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * fix style Signed-off-by: Jan Klat <jenik@klatys.cz> * change carrier changes to counter type Signed-off-by: Jan Klat <jenik@klatys.cz> * fix e2e output Signed-off-by: Jan Klat <jenik@klatys.cz> * add fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * update vendor, use fixtures correctly Signed-off-by: Jan Klat <jenik@klatys.cz> * change fixtures (device in /sys/class/net should be symlinked) Signed-off-by: Jan Klat <jenik@klatys.cz> * correct fixtures for 64k page, updated readme Signed-off-by: Jan Klat <jenik@klatys.cz>	2018-07-16 15:08:18 +02:00
Johannes 'fish' Ziemke	a6a8ec3c1c	Remove travis build badge Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2018-05-30 19:16:18 +02:00
Ben Kochie	361b5bf85d	Merge pull request #852 from prometheus/remove-gmond Remove gmond collector	2018-04-27 10:02:16 +02:00
Johannes 'fish' Ziemke	fd66a86a30	Remove gmond collector Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2018-04-17 20:20:24 +02:00
mischief	693eb82139	update README.md and CHANGELOG.md to reflect new bsd boottime collector Signed-off-by: mischief <mischief@offblast.org>	2018-04-15 08:38:47 +00:00
Karsten Weiss	7e392e6634	Fix spelling mistakes found by codespell Signed-off-by: Karsten Weiss <knweiss@gmail.com>	2018-04-09 18:27:17 +02:00
Brian Brazil	c8c144587e	Enable bonding collector by default. (#872 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-03-29 15:18:12 +01:00
Ben Kochie	6468e7c80b	Enable NFS client metrics by default. (#828 ) Enable NFS client metrics by default now that it nolonger prints errors on scrape if there are no metrics to display. Also fixup the nfsd README to match the nfs entry.	2018-02-16 15:42:47 +01:00
Ben Kochie	822bf91d14	Update README with nfsd collector. (#826 ) Add missing documentation about the `nfsd` collector.	2018-02-16 09:46:41 +01:00
Ben Kochie	111e3af437	Remove obsolete megacli collector. (#798 ) This collector has been replaced by the textfile collector tool `storcli.py`.	2018-01-23 11:25:42 +01:00
Ben Kochie	bc38ffc538	Update collect[] param documentation (#784 ) Improve recommendations and wording around advanced use of the collect[] param. Remove example that causes users to copy-and-paste it.	2018-01-10 15:16:33 +01:00
Ben Kochie	8f9c8a060d	Update README Add OpenBSD to supported list for meminfo collector[0]. [0]: https://github.com/prometheus/node_exporter/pull/724	2018-01-04 10:33:57 +01:00
david	eb3a917bd8	Use host PID namespace in docker example (#672 ) * Use host PID namespace in docker example See https://github.com/prometheus/node_exporter/issues/671 * Update readme for readability * Fix comments in readme	2017-11-02 12:07:40 +01:00
Siavash Safi	f3a7022602	Add `collect[]` parameter (#699 ) * Add `collect[]` parameter * Add TODo comment about staticcheck ignored * Restore promhttp.HandlerOpts * Log a warning and return HTTP error instead of failing * Check collector existence and status, cleanups * Fix warnings and error messages * Don't panic, return error if collector registration failed * Update README	2017-10-14 14:23:42 +02:00
Ben Kochie	8f9edf87b5	Add extra notes to Building section (#694 ) * Add link to Golang * Add note about RHEL/CentOS build dep.	2017-10-11 11:46:13 +02:00
Wei Wei	1e4af21256	add rslave for docker example, so node_exporter can receive host mount/unmount events (#660 )	2017-10-11 11:18:30 +02:00
Calle Pettersson	859a825bb8	Replace --collectors.enabled with per-collector flags (#640 ) * Move NodeCollector into package collector * Refactor collector enabling * Update README with new collector enabled flags * Fix out-of-date inline flag reference syntax * Use new flags in end-to-end tests * Add flag to disable all default collectors * Track if a flag has been set explicitly * Add --collectors.disable-defaults to README * Revert disable-defaults flag * Shorten flags * Fixup timex collector registration * Fix end-to-end tests * Change procfs and sysfs path flags * Fix review comments	2017-09-28 15:06:26 +02:00
Sami Kerola	3762191e66	Add timex collector (#664 ) This collector is based on adjtimex(2) system call. The collector returns three values, status if time is synchronised, offset to remote reference, and local clock frequency adjustment. Values are taken from kernel time keeping data structures to avoid getting involved how the synchronisation is implemented. By that I mean one should not care if time is update using ntpd, systemd.timesyncd, ptpd, and so on. Since all time sync implementation will always end up telling to kernel what is the status with time one can simply omit the software in between, and look results of the syncing. As a positive side effect this makes collector very quick and conceptually specific, this does not monitor availability of NTP server, or network in between, or dns resolution, and other unrelated but necessary things. Minimum set of values to keep eye on are the following three: The node_timex_sync_status tells if local clock is in sync with a remote clock. Value is set to zero when synchronisation to a reliable server is lost, or a time sync software is misconfigured. The node_timex_offset_seconds tells how much local clock is off when compared to reference. In case of multiple time references this value is outcome of RFC 5905 adjustment algorithm. Ideally offset should be close to zero, and it depends about use case how large value is acceptable. For example a typical web server is probably fine if offset is about 0.1 or less, but that would not be good enough for mobile phone base station operator. The node_timex_freq tells amount of adjustment to local clock tick frequency. For example if offset is one second and growing the local clock will need instruction to tick quicker. Number value itself is not very important, and occasional small adjustments are fine. When frequency is unusually in stable one can assume quality of time stamps will not be accurate to very far in sub second range. Obviously explaining why local clock frequency behaves like a passenger in roller coaster is different matter. Explanations can vary from system load, to environmental issues such as a machine being physically too hot. Rest of the measurements can help when debugging. If you run a clock server do probably want to collect and keep track of everything. Pull-request: https://github.com/prometheus/node_exporter/pull/664	2017-09-19 07:54:06 -07:00
Leonid Evdokimov	c169b4b1c5	Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check (#655 ) * Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check 1. Checking local clock against remote NTP daemon is bad idea, local ntpd acting as a client should do it better and avoid excessive load on remote NTP server so the collector is refactored to query local NTP server. 2. Checking local clock against remote one does not check local ntpd itself. Local ntpd may be down or out of sync due to network issues, but clock will be OK. 3. Checking NTP server using sanity of it's response is tricky and depends on ntpd implementation, that's why common `node_ntp_sanity` variable is exported. * `govendor add golang.org/x/net/ipv4`, it is dependency of github.com/beevik/ntp * Update github.com/beevik/ntp to include boring SNTP fix * Use variable name from RFC5905 * ntp: move code to make export of raw metrics more explicit * Move NTP math to `github.com/beevik/ntp` * Make `golint` happy * Add some brief docs explaining `ntp` #655 and `timex` #664 modules * ntp: drop XXX comment that got its decision * ntp: add `_seconds` suffix to relevant metrics * Better `node_ntp_leap` comment * s/node_ntp_reftime/node_ntp_reference_timestamp_seconds/ as requested by @discordianfish * Extract subsystem name to const as suggested by @SuperQ	2017-09-19 10:36:14 +02:00
Ben Kochie	9947f602f3	Add buildkite status badge.	2017-08-24 12:29:34 +02:00
Joe Handzik	4b011bfe44	Clarify Infiniband collector support (#643 ) Tested a DL360 Gen9 box with an Omni-Path adapter in it. The existing InfiniBand collector can provide support for the same metrics on Omni-Path cards as well. Signed-Off-By: Joe Handzik <joseph.t.handzik@hpe.com>	2017-08-16 07:32:54 +02:00
Calle Pettersson	dfe07eaae8	Switch to kingpin flags (#639 ) * Switch to kingpin flags * Fix logrus vendoring * Fix flags in main tests * Fix vendoring versions	2017-08-12 15:07:24 +02:00

1 2 3

131 Commits