System monitoring

Yak shaving my router crash

February 18, 2020 — March 5, 2020

computers are awful
data sets
Figure 1

Ok, my Raspberry pi VPN router kept crashing (spoiler: it was both overheating and failing to load iptables rules) so I needed to try to monitor it to diagnose it. Then I went into a system monitoring rabbit hole.

Time series databases such as RRD are probably handy. RRD looks simple enough. You just run a job that measures something at intervals and it records it for you. The tides of hype have surfaces some more fancy and/or turnkey solutions.

netdata has been recommended to me as something with native raspberry pi support (i.e. it looks for overheating). Open source.

There are more elaborate solutions targeting big infrastructure and networks in particular such as icinga and opennms, but I have not used these as RRD ended up being sufficient.

If I wanted to monitor only server traffic logs, I could use goaccess.