Anyone have any good server health monitoring (RAM usage, CPU usage, disk usage, etc)? I want to be able to also log these things for historical data and trend mapping.
@screem@yarn.yarnpods.com grafana I think, but I don’t know of those kind of things myself
@screem@yarn.yarnpods.com Prometheus. I’ll give you more details when I get back home (out at the moment)
At a high level:
- Deploy Prometheus as your metrics storage and wiry engine
- Deploy Node_Exporter for scraping and exposing CPI, Memory, UI, Network and much more..
- Deploy Grafana for querying and dashboard
- Deploy AlertManager for alerting and notifications