Looking for ways to capture hardware-specific data and use them later to predict unexpected behaviors. Tools can be collectd, prometheus-nodeexporter, ipmiexporter , grafana-agent or other.
Goal for this Hackweek
Extend the solution we already have to get some specific monitoring information depending on the workload. Predict possible events to save business downtime.
This project is part of:
Hack Week 21