Node_Exporter

mirror of https://github.com/prometheus/node_exporter.git synced 2024-12-30 15:50:08 +01:00

Author	SHA1	Message	Date
Ben Kochie	0fdc089187	Change systemd unit filtering (#1083 ) * Change systemd unit filtering Get all units from systemd and filter in Go. * Improves compatibility with older versions of systemd. * Improve debugging by printing when units pass the filter. * Remove extraneous newlines from log messages. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-24 15:04:55 +02:00
Luca Bruno	4672ea1671	collector/timex: remove cgo dependency (#1079 ) This removes the cgo import from timex collector, as it was only used to define two constants. Those are part of the Linux kernel<->userspace interface, thus there is no need to depend on libc to source them: https://github.com/torvalds/linux/blob/v4.18/include/uapi/linux/timex.h Signed-off-by: Luca Bruno <luca.bruno@coreos.com>	2018-09-20 11:51:34 +02:00
Björn Rabenstein	1c9ea46cca	Update vendoring for client_golang and friends (#1076 ) Signed-off-by: beorn7 <beorn@soundcloud.com>	2018-09-17 17:09:52 +02:00
Ben Kochie	ebdd524123	Correctly cast Darwin memory info (#1060 ) * Correctly cast Darwin memory info * Cast stats to float64 before doing math on them to avoid integer wrapping. * Remove invalid `_total` suffix from gauge values. * Handle counters in `meminfo.go`. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-09-07 22:27:52 +02:00
Marco Tulio R Braga	05e55bddad	Fix typo on description of read_time_seconds_total (#1057 ) Fix typo on unit description of metric `*read_time_seconds_total` from milliseconds to seconds. Signed-off-by: Marco Tulio R Braga <marco.tulio@mtulio.eng.br>	2018-09-02 09:46:45 +02:00
Dan Fredell	c52e0d3353	Fix SmartOS build #1017 (#1018 ) Signed-off-by: Dan Fredell <Dan.Fredell@gmail.com>	2018-08-23 10:57:15 +00:00
James Hartig	60c827231a	NRestarts or NRefused aren't available on older systemd versions (#1039 ) * If NRestarts or NRefused are not available, don't ignore the unit itself * Don't report systemd metrics (NRestarts/NRefused) that are not available Signed-off-by: James Hartig <james@getadmiral.com>	2018-08-14 14:28:26 +02:00
Ben Kochie	fe5a117831	Handle vanishing PIDs (#1043 ) PIDs can vanish (exit) from /proc/ between gathering the list of PIDs and getting all of their stats. * Ignore file not found errors. * Explicitly count the PIDs we find. * Cleanup some error style issues. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-13 17:27:23 +02:00
Ben Kochie	0662673ad6	Disable wifi collector by default (#1037 ) * Disable wifi collector by default Disable the wifi collector by default due to suspected cashing issues and goroutine leaks. * https://github.com/prometheus/node_exporter/issues/870 * https://github.com/prometheus/node_exporter/issues/1008 Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-07 10:27:20 +02:00
Ben Kochie	5d23ad0ca7	Fix supervisord collector (#978 ) * Replace supervisord xmlrpc library * Use `github.com/mattn/go-xmlrpc` that doesn't leak goroutines. * Fix uptime metric * Use Prometheus best practices for uptime metric. * Use "start time" rather than "uptime". * Don't emit a start time if the process is down. * Add changelog entry. * Add example compatibility rules. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-08-06 16:54:46 +02:00
Julius Volz	2c52b8c761	systemd: Remove unneeded/unhandled error returns (#1035 ) Signed-off-by: Julius Volz <julius.volz@gmail.com>	2018-08-05 16:55:25 +02:00
Christian Hoffmann	6bdc5558ec	build: make staticcheck happy by using real regexp patterns #1025 (#1026 ) Signed-off-by: Christian Hoffmann <mail@hoffmann-christian.info>	2018-07-30 07:57:18 +02:00
Hannes Körber	14a4f0028e	Enable nfs protocol (#998 ) * vendor: Update prometheus/procfs Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * mountstats: Use new NFS protocol field In https://github.com/prometheus/procfs/pull/100, the NFSTransportStats struct was expanded by a field called protocol that specifies the NFS protocol in use, either "tcp" or "udp". This commit adds the protocol as a label to all NFS metrics exported via the mountstats collector. Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * Update fixtures for UDP mount Signed-off-by: Hannes Körber <hannes.koerber@haktec.de>	2018-07-24 00:47:12 +02:00
Johannes Wienke	5c780d132c	Exclude only subdirectories of /var/lib/docker (#1003 ) It is quite common to put /var/lib/docker itself on a separate partition and that should be monitored as well. Signed-off-by: Johannes Wienke <languitar@semipol.de>	2018-07-23 15:43:42 +02:00
Ben Kochie	23f95c8e04	Fix ntp collector thread safety (#1014 ) Make the ntp collector thread safe by wrapping a mutex lock around the leapMidnight variable. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-07-22 14:36:33 +02:00
xginn8	140b8b85c3	Filter out uninstalled systemd units when collecting all units (#1011 ) fixes #567 Signed-off-by: Matthew McGinn <mamcgi@gmail.com>	2018-07-22 09:20:03 +02:00
Sven Lange	2ae8c1c7a7	Add systemd uptime metric collection (#952 ) * Add systemd uptime metric collection Signed-off-by: Sven Lange <tdl@hadiko.de>	2018-07-18 16:02:05 +02:00
neiledgar	7e4d9bd150	Update wifi stats to support multiple stations (#977 ) (#980 ) Signed-off-by: neiledgar <neil.edgar@btinternet.com>	2018-07-16 16:02:25 +02:00
xginn8	9b97f44a70	Add a counter for refused socket unit connections, available as of systemd 239 (#995 ) Signed-off-by: xginn8 <mamcgi@gmail.com>	2018-07-16 16:01:42 +02:00
Brandon Gilmore	76bbd8dd18	Use /proc/mounts instead of statfs(2) for ro state (#1002 ) While the statfs(2) approach is reliable for normally mounted filesystems, the flags returned can be inconsistent when filesystem has been remounted read-only after encountering an error. The returned flags do accurately represent the internal state of the filesystem, but they do not reflect whether the VFS layer will accept writes. Instead, it makes sense to parse the current VFS mount state from the options field in /proc/mounts since it takes precedence. Signed-off-by: Brandon Gilmore <bgilmore@valvesoftware.com>	2018-07-16 15:56:27 +02:00
Jan Klat	c4102f1175	Add sys/class/net parsing from procfs and expose its metrics (#851 ) * add sys/class/net parsing from procfs and expose its metrics Signed-off-by: Jan Klat <jenik@klatys.cz> * change code to use int pointers per procfs change, move netclass to separate collector, change metric naming Signed-off-by: Jan Klat <jenik@klatys.cz> * bump year in licence, remove redundant newline, correct fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * fix style Signed-off-by: Jan Klat <jenik@klatys.cz> * change carrier changes to counter type Signed-off-by: Jan Klat <jenik@klatys.cz> * fix e2e output Signed-off-by: Jan Klat <jenik@klatys.cz> * add fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * update vendor, use fixtures correctly Signed-off-by: Jan Klat <jenik@klatys.cz> * change fixtures (device in /sys/class/net should be symlinked) Signed-off-by: Jan Klat <jenik@klatys.cz> * correct fixtures for 64k page, updated readme Signed-off-by: Jan Klat <jenik@klatys.cz>	2018-07-16 15:08:18 +02:00
mknapphrt	09b4305090	Changed the way that stuck mounts are handled. If a mount fails to return, it will stop being queried until it returns. (#997 ) Fixed spelling mistakes. Update transport_generic.go Changed to a mutex approach instead of channels and added a timeout before declaring a mount stuck. Removed unnecessary lock channel and clarified some var names. Fixed style nits. Signed-off-by: Mark Knapp <mknapp@hudson-trading.com>	2018-07-14 11:10:28 +02:00
xginn8	ac5a981761	Adding socket stat collection for systemd socket units (#968 ) Signed-off-by: xginn8 <mamcgi@gmail.com>	2018-07-05 16:26:48 +02:00
xginn8	8af84a215d	Add support for NRestarts counter introduced in systemd 235 (#992 ) * Add support for NRestarts counter introduced in systemd 235 `.service` units increment this counter any time the Restart= condition is triggered. Signed-off-by: Matthew McGinn <mamcgi@gmail.com>	2018-07-05 13:31:45 +02:00
Ben Kochie	107e5dfecc	Fix mdadm collector issues (#985 ) * Send "Personality unknown" to debug, not info, remove unnecessary newline. * Add support for "linear" personality. * Always set number of active disks to 0 when a device is inactive. * Add total disks calculation to unknown personalites. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-07-02 12:38:20 +02:00
Derek Marcotte	2678d68dcc	Fix for #945 , cpu temperature is signed. (#965 ) * Fix for #945, cpu temperature is signed. Added a type conversion to cpu temperature sysctl. Will still collect/report -1 when the value is -1, this is because it should be up to interpretation whether this is the correct value for the system or not. Some drivers will report -1 for cpu temperature. Other sensors will report "an input into the fan control algorithm", i.e. not the actual temperature, but how much fan it wants. Some people cool their machines with liquid nitrogen. Signed-off-by: Derek Marcotte <554b8425@razorfever.net>	2018-06-07 15:01:25 +02:00
Brad Beam	e3cf1d5187	Adding support for evaluating octal characters in mountpoint (#954 ) Signed-off-by: Brad Beam <brad.beam@b-rad.info>	2018-06-06 16:49:19 +02:00
Pavlo Kutishchev	456bf5094a	Add processes exporter (#950 ) * Add processes exporter Signed-off-by: Pavel Kutishchev <pavel.kutishchev@olx.com> Signed-off-by: Ben Kochie <superq@gmail.com>	2018-06-05 19:38:32 +02:00
Alexey Kopytov	dd98a09bb2	A couple of ARM64-related fixes (#934 ) * Do not rely on AArch64 CPUs to support 32-bit ARM for cross-testing. Signed-off-by: Alexey Kopytov <akopytov@gmail.com> * aarch64 like ppc64le reports 64k node_sockstat_TCP_mem_bytes due to 64k pages. Signed-off-by: Alexey Kopytov <akopytov@gmail.com>	2018-05-14 15:55:49 +02:00
Steve Kotsopoulos	84dc362b05	Align Darwin disk stat names with Linux (#930 ) Signed-off-by: Steve Kotsopoulos <sk@fywss.com>	2018-05-02 11:32:55 +02:00
Mario Trangoni	24a28fcc9e	Remove unused func, var, and const (#928 ) Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com>	2018-04-29 14:35:43 +02:00
Mario Trangoni	c9f421d0dd	Fix some golint issues (#927 ) * collector/cpu_: rename nodeCpuSecondsDesc to nodeCPUSecondsDesc Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com> collector/qdisc_linux.go: add NewQdiscStatCollector comment Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com> * collector/cpu_linux.go: rename core_map to coreMap Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com>	2018-04-29 14:34:47 +02:00
Ben Kochie	361b5bf85d	Merge pull request #852 from prometheus/remove-gmond Remove gmond collector	2018-04-27 10:02:16 +02:00
Ben Kochie	b10ca77680	Fix /proc/net/dev/ interface name handling * Allow any character (UTF-8) for Linux interface names. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-04-18 12:53:59 +02:00
Ben Kochie	1ab4a460c7	Update ppc64le end-to-end fixture. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-04-18 09:12:21 +02:00
Johannes 'fish' Ziemke	fd66a86a30	Remove gmond collector Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2018-04-17 20:20:24 +02:00
Ben Kochie	0f5be132ac	Merge pull request #904 from prometheus/superq/if_alias Fix parsing of interface aliases in netdev linux	2018-04-17 13:37:21 +02:00
Ben Kochie	a528966dcd	Fix parsing of interface aliases in netdev linux Very old kernels expose interface aliases as `foo0:0`, adjust the line parsing to handle these names. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-04-17 13:15:02 +02:00
Ben Kochie	f6008b242b	Merge pull request #901 from mischief/bsd_boottime collector: implement node_boot_time_seconds for OpenBSD/NetBSD/Darwin	2018-04-17 07:48:39 +02:00
Jürgen Hötzel	de0632c2e9	Fix memory corruption when number of filesystems > 16 (#900 ) Signed-off-by: Juergen Hoetzel <juergen@archlinux.org>	2018-04-16 12:39:15 +02:00
mischief	26a385d7ab	collector: implement node_boot_time_seconds for OpenBSD/NetBSD/Darwin Signed-off-by: mischief <mischief@offblast.org>	2018-04-15 08:26:46 +00:00
Ben Kochie	015b86670a	Update ppc64le e2e output. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-04-14 15:28:06 +02:00
Ben Kochie	0507b0c9a2	Fix formatting. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-04-14 15:02:20 +02:00
Dmitriy Lukyanchikov	eddd1b9357	Fix netdev collector for linux (#890 ) fix variable name, fix transmitHeader extracting modify fixtures to run tests with updated netdev_linux collector Signed-off-by: dmitriy-lukyanchikov <d.lukyanchikov@anchorfree.com>	2018-04-14 13:58:56 +02:00
Derek Marcotte	fe86e908da	Update ppc64 fixtures to unbreak end-to-end. `efc1fdb` added new labels. Signed-off-by: Derek Marcotte <554b8425@razorfever.net>	2018-04-13 06:33:38 -04:00
Karsten Weiss	7e392e6634	Fix spelling mistakes found by codespell Signed-off-by: Karsten Weiss <knweiss@gmail.com>	2018-04-09 18:27:17 +02:00
Karsten Weiss	efc1fdb6d0	cpu: Add a 2nd label 'package' to metric node_cpu_core_throttles_total (#871 ) * cpu: Add a 2nd label 'package' to metric node_cpu_core_throttles_total This commit fixes the node_cpu_core_throttles_total metrics on multi-socket systems as the core_ids are the same for each package. I.e. we need to count them seperately. Rename the node_package_throttles_total metric label `node` to `package`. Reorganize the sys.ttar archive and use the same symlinks as the Linux kernel. Also, the new fixtures now use a dual-socket dual-core cpu w/o HT/SMT (node0: cpu0+1, node1: cpu2+3) as well as processor-less (memory-only) NUMA node 'node2' (this is a very rare case). Signed-off-by: Karsten Weiss <knweiss@gmail.com> * cpu: Use the direct /sys path to the cpu files. Use the direct path /sys/devices/system/cpu/cpu[0-9]* (without symlinks) instead of /sys/bus/cpu/devices/cpu[0-9]. The latter path also does not exist e.g. on RHEL 6.9's kernel. Signed-off-by: Karsten Weiss <knweiss@gmail.com> cpu: Reverse core+package throttle processing order Signed-off-by: Karsten Weiss <knweiss@gmail.com> * cpu: Add documentation URLs Signed-off-by: Karsten Weiss <knweiss@gmail.com>	2018-04-09 18:01:52 +02:00
Brian Brazil	31ce32f1fe	Greatly trim what netstat collector exposes by default (#876 ) Netstat is 40% of the metrics on my laptop, many of which are highly detailed information about IP internals in the kernel. ~300 such metrics on every machine in your fleet is excessive, so focus on key metrics by default, overridable by the user. Fixes #515 Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-03-30 19:28:08 +01:00
Ben Kochie	cf3edadcbb	Update fixtures * Add oom_kill to fixture. * Update e2e outputs. * Put regexp in order. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-03-29 22:00:02 +01:00
Brian Brazil	499c342fed	Greatly reduce the metrics vmstat returns by default. Vmstat has over 100 fields, most of which are highly detailed debug information. Trim this down to only essential fields by default, configurable by flag. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-03-29 22:00:02 +01:00
Brian Brazil	c8c144587e	Enable bonding collector by default. (#872 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-03-29 15:18:12 +01:00
Ben Kochie	779090db7e	Update ppc64le fixture (#867 ) Update to match standard e2e output. Signed-off-by: Ben Kochie <superq@gmail.com>	2018-03-27 17:05:20 +02:00
Mario Trangoni	1f11a86d59	Fix nfs golint issues (#863 ) * procfs: update vendoring Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com> * procfs: fix e2e tests after nfs changes Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com>	2018-03-22 22:25:37 +01:00
Ben Kochie	7b720df1c5	Use lowercase cpu label name in interrupts (#849 ) To match other CPU related metric labels, use a lowercase named label.	2018-03-08 15:04:49 +01:00
Johannes 'fish' Ziemke	424ca8e322	Drop exec_ in boot_timestamp_seconds on *bsd (#839 ) This closes #827.	2018-03-08 12:59:48 +01:00
colmbuckley	098f975b48	Correct the ClocksPerSec scaling factor on Darwin (#846 ) * Update cpu_darwin.go Change the definition of ClocksPerSec to read from limits.h * Update cpu_darwin.go	2018-03-07 11:56:57 +01:00
Julius Volz	864a6ee935	Treat custom textfile metric timestamps as errors (#769 ) This is clearer behavior and users will notice and fix their textfiles faster than if we just output a warning.	2018-02-27 19:43:38 +01:00
Rene Treffer	c504c7e264	Only report core throttles per core, not per cpu (#836 ) * Only report core throttles per core, not per cpu * Add topology/core_id to the cpu sysfs fixtures * Add new cpu fixtures to ttar file * Merge core_id reading and thermal throttle accounting * Declare core_id	2018-02-27 19:43:15 +01:00
Ben Kochie	e0d54a509c	Cleanup NFS metrics (#834 ) * Cleanup NFS metrics * Update `nfs` metric names to match `nfsd`. * Remove uneeded `tcp` label from TCP connections metric. * Remove uneeded `v` on `nfsd` metrics. * Enable all `nfs` v4 client metrics. * Remove `nfs` metric name overrides. * Add ppc64le fixture. * Fix typo.	2018-02-21 07:25:41 +01:00
Ben Kochie	3f41a2fecb	Update ppc64le fixture (#832 ) Updates fixture for ppc64le arch to latest output.	2018-02-19 20:43:33 +01:00
Ben Kochie	d33a447047	Remove deprecated prometheus.InstrumentHandlerFunc (#831 ) Update Prometheus client golang use to use `promhttp.Handler()` instead of `prometheus.InstrumentHandlerFunc()`.	2018-02-19 15:44:59 +01:00
Richard Elling	d7348a5c78	updates for zfsonlinux 0.7.5 (#779 ) * updates for zfsonlinux 0.7.5 * add constants for KSTAT_DATA_* types * added e2e test for negative values represented by uint64 that can result from ZFS bugs	2018-02-16 15:46:31 +01:00
Ben Kochie	6468e7c80b	Enable NFS client metrics by default. (#828 ) Enable NFS client metrics by default now that it nolonger prints errors on scrape if there are no metrics to display. Also fixup the nfsd README to match the nfs entry.	2018-02-16 15:42:47 +01:00
Ralf Horstmann	8d9c7ca659	Use swpginuse instead of swpgonly in meminfo_openbsd (#813 ) All tools in OpenBSD base system use swpginuse instead of swpgonly for reporting swap usage (snmpd, swapctl, top, vmstat), so let memory collector use that as well for consistency.	2018-02-16 11:34:41 +01:00
Kasinath Kottukkal	f6965e1812	Add overlay to defIgnoredFSTypes (#824 ) * Add overlay to defIgnoredFSTypes To avoid statfs() errors if node_exporter is running as non privileged user. * Updated defIngoredFSTypes values in sorted order	2018-02-16 09:47:50 +01:00
Ben Kochie	01bd99fb1a	Refactor NFS client collector (#816 ) * Update vendor github.com/prometheus/procfs/... * Refactor NFS collector Use new procfs library to parse NFS client stats. * Ignore nfs proc file not existing. * Refactor with reflection to walk the structs.	2018-02-15 13:40:38 +01:00
Brian Brazil	52c031890e	Add _seconds suffix to node_time. (#823 )	2018-02-14 16:59:08 +00:00
Ben Kochie	05eabe60fb	Fix error output in nfsd collector. (#821 )	2018-02-14 13:57:35 +01:00
Ben Kochie	3de2542d21	Fix NFSd metric type (#819 ) RPC Count should be a counter, not a gauge.	2018-02-13 17:03:22 +01:00
Matt Layher	544488ddd6	Fix remaining metric naming issues (#799 )	2018-02-12 18:53:31 +01:00
Ben Kochie	6a041692ed	Add NFS Server metrics collector. (#803 ) * Add NFS Server metrics collector. * Add File Handles metrics. * Add nfsd IO stats. * Add metrics for NFSd threads. * Add metrics for NFSd read ahead cache. * Add NFSd network traffic counters. * Add RPC metrics. * Add V2 requests metrics. * Add NFSv3 metrics. * Add NFSv4 metrics. * Update reply cache comment. * Update help text.	2018-02-12 17:56:05 +01:00
Brian Brazil	1072f2868d	Fix log level regression in #533	2018-02-07 15:16:20 +00:00
Brian Brazil	7e41a2b279	Ignore /var/lib/docker by default. (#814 ) The node exporter runs unprivileged, so it cannot statfs any filesystems under this directory causing log spam. In addition there tends to be high churn in the filesystems here (as it's basically application monitoring) which can cause high cardinaltiy and in one case caused Prometheus's index symbol table to get very large. Accordingly this should be ignored to reduce log spam and avoid performance issues. The filesystems themselves can in principle be monitored via container oriented exporters, and the underlying filesystems will still be monitored.	2018-02-06 17:10:59 +01:00
Ralf Horstmann	29ac809e48	Use unified CPU metric description on OpenBSD (#810 )	2018-02-01 23:59:19 +01:00
Derek Marcotte	fde5d2c6c9	Remove unsafe typecasts from sysctl_bsd getStructTimeval. (#741 ) There is a simpler way.	2018-02-01 18:43:40 +01:00
Ben Kochie	14d60958d6	Unify CPU collector conventions (#806 ) * Unify CPU collector conventions Add a common CPU metric description. * All collectors use the same `nodeCpuSecondsDesc`. * All collectors drop the `cpu` prefix for `cpu` label values. * Fix subsystem string in cpu_freebsd. * Fix Linux CPU freq label names.	2018-02-01 18:42:20 +01:00
Ralf Horstmann	e3c76b1f0c	Add OpenBSD CPU collector (#805 )	2018-02-01 18:33:49 +01:00
Tom Wilkie	6833eec187	Fix tests.	2018-01-31 15:22:17 +00:00
Tom Wilkie	0316bacceb	Only use one dbus connection, required some refactoring.	2018-01-31 15:19:18 +00:00
Tom Wilkie	a7fd6b8743	Export systemd timer last trigger sec.	2018-01-31 15:07:04 +00:00
Ben Kochie	111e3af437	Remove obsolete megacli collector. (#798 ) This collector has been replaced by the textfile collector tool `storcli.py`.	2018-01-23 11:25:42 +01:00
Julius Volz	6cac74f0e0	Add unit suffix to textfile collector mtime metric (#796 )	2018-01-22 14:02:19 +01:00
Brian Brazil	a98067a294	Make metrics better follow guidelines (#787 ) * Improve stat linux metric names. cpu is no longer used. * node_cpu -> node_cpu_seconds_total for Linux * Improve filesystem metric names with units * Improve units and names of linux disk stats Remove sector metrics, the bytes metrics cover those already. * Infiniband counters should end in _total * Improve timex metric names, convert to more normal units. See `3c073991eb/kernel/time/ntp.c (L909)` for what stabil means, looks like a moving average of some form. * Update test fixture * For meminfo metrics that had "kB" units, add _bytes * Interrupts counter should have _total	2018-01-17 17:55:55 +01:00
Ben Kochie	b4d7ba119a	Add fixture for ppc64le (#785 ) * Add support for per-architecture fixtures. * Add output for ppc64le.	2018-01-11 13:56:19 +01:00
Nick Owens	0629a081db	multiply page size after float64 coercion to avoid signed integer overflow (#780 )	2018-01-08 15:36:49 +01:00
Franz Pletz	d432f9857e	Use uint64 in the ZFS collector (#714 ) ZFS metrics can also be unsigned 64-bit integers that won't fit in int64 and causes the whole collector to fail.	2018-01-06 12:36:55 +01:00
Derek Marcotte	477fe4665a	Move FreeBSD/DragonflyBSD out of meminfo add kvm. (#547 ) * Move FreeBSD/DragonflyBSD out of meminfo add kvm. This gives us SwapUsed, and everything under one roof. * Fix typos per review. * Update to use newer API. * Remove premature optimization per PR feedback.	2018-01-04 12:23:26 +01:00
Sevag Hanssian	4329b0a86b	Add summary metrics for systemd exporter (#765 )	2018-01-04 11:49:36 +01:00
Matthieu Guegan	d6ef10bb56	Add openbsd meminfo (#724 ) * Implements meminfo collector for OpenBSD This is a rework of #151. * Fix CGO import * Add some useful metrics * Rename total -> size for normalization	2018-01-04 10:32:08 +01:00
Ben Kochie	7f6c59e198	Ignore more virtual filesystems (#775 ) Add additional Linux virtual filesystem types to the default list.	2018-01-03 17:22:02 +01:00
Netmonk	2aa8d0eb0c	[FIX] Exclude Linux proc from filesystem type regexp (#774 ) * [FIX] Issue 63, error on excluding proc filesystem on linux, improving regexp * [FIX] Reordering filter order	2018-01-03 11:40:32 +01:00
Julius Volz	f536857ac6	Fix e2e tests after textfile custom timestamp removal (#768 )	2017-12-24 11:54:33 +01:00
Shubheksha Jalan	1f2458f42c	Filter out testfile metrics correctly when using `collect[]` filters (#763 ) * remove injection hook for textfile metrics, convert them to prometheus format * add support for summaries * add support for histograms * add logic for handling inconsistent labels within a metric family for counter, gauge, untyped * change logic for parsing the metrics textfile * fix logic to adding missing labels * Export time and error metrics for textfiles * Add tests for new textfile collector, fix found bugs * refactor Update() to split into smaller functions * remove parseTextFiles(), fix import issue * add mtime metric directly to channel, fix handling of mtime during testing * rename variables related to labels * refactor: add default case, remove if guard for metrics, remove extra loop and slice * refactor: remove extra loop iterating over metric families * test: add test case for different metric type, fix found bug * test: add test for metrics with inconsistent labels * test: add test for histogram * test: add test for histogram with extra dimension * test: add test for summary * test: add test for summary with extra dimension * remove unnecessary creation of protobuf * nit: remove extra blank line	2017-12-23 20:21:58 +01:00
Ben Kochie	cd2a17176a	Add full make to CircleCI (#761 ) * Add full make to CircleCI Ensure end-to-end test is run. * Fix go fmt error. * Fix end-to-end output.	2017-12-21 16:24:23 +01:00
Wei Li	1e9bb4ec3a	textfile: fix duplicate metrics error (#738 ) The textfile gatherer should only be added to gatherer list once. Signed-off-by: Li Wei <liwei@anbutu.com>	2017-12-06 17:05:40 +01:00
Kristian Klausen	a96f1738b3	netdev: Change valueType to CounterValue (#749 ) All the metric only goes up, so the type should be counter. This also add _total to all the metric name. Fix: #747	2017-12-06 13:58:35 +01:00
Ben Kochie	2a80537547	Split out guest cpu metrics on Linux. (#744 ) Linux "guest" metrics for VMs are already accounted for in node_cpu `user` and `nice` metrics. Separate these into their own metric to avoid duplication of data.	2017-11-23 15:04:47 +01:00
Karsten Weiss	a8d7d1101a	cpu: Support processor-less (memory-only) NUMA nodes (#734 ) * cpu: Support processor-less (memory-only) NUMA nodes Processor-less (memory-only) NUMA nodes exist e.g. in systems that use Intel Optane drives for RAM expansion using Intel Memory Drive Technology (IMDT). IMDT RAM expansion supports two modes: * "Unify Remote Memory domains": present a processor-less (memory-only) NUMA domain, which is the default * "Expand local memory domains": to expand each processor’s memory domain with a portion of the memory made available by Optane and IMDT This commit fixes a crash in the first case (when "cpulist" is empty). Here's an example of such a system: $ numastat -m\|head -n5 Per-node system memory usage (in MBs): Node 0 Node 1 Node 2 Total --------------- --------------- --------------- --------------- MemTotal 118239.56 130816.00 464384.00 713439.56 $ for i in {0..2}; do echo -n "$i: " ; cat /sys/bus/node/devices/node$i/cpulist ; done 0: 0-7,16-23 1: 8-15,24-31 2: $ /opt/vsmp/bin/vsmpversion -vvv Memory Drive Technology: 8.2.1455.74 (Sep 28 2017 13:09:59) System configuration: Boards: 3 1 x Proc. + I/O + Memory 2 x NVM devices (Intel SSDPED1K375GAQ) Processors: 2, Cores: 16, Threads: 32 Intel(R) Xeon(R) CPU E5-2667 v4 @ 3.20GHz Stepping 01 Memory (MB): 713472 (of 977450), Cache: 251416, Private: 12562 1 x 249088MB [262036/ 678/12270] 1 x 232192MB [357707/125369/ 146] 82:00.0#1 1 x 232192MB [357707/125369/ 146] 83:00.0#1 * cpu: rename some variables (pkg => node) * cpu: Use %v not %q in log.Debugf() format strings	2017-11-10 15:31:26 +01:00
Matt Layher	f6f9c8d6cc	Add and use sysReadFile in hwmon collector (#728 )	2017-11-07 07:49:37 +01:00
Tobias Klauser	d73f1e60c4	Simplify Utsname string conversion (#716 ) * Update golang.org/x/sys/unix This allows to use simplified string conversion of Utsname members. * Simplify Utsname string conversion Use Utsname from golang.org/x/sys/unix which contains byte array instead of int8/uint8 array members. This allows to simplify the string conversions of these members.	2017-11-02 11:57:14 +01:00

1 2 3 4 5 ...

584 Commits