an invention by cbosdonnat
Project Description
A supportconfig
provides a lot of files and data from the system, but it is often hard to spot the real issue in it. The idea of this project is to get machine-readable output for the supportconfig data and analyze them.
Then we would try to provide hints using the tool about what is wrong.
The name of this tool is: uyuni-health-check
.
GitHub repository: https://github.com/uyuni-project/poc-uyuni-health-check
Summary:
- Research about machine learning log anomaly detectors: few alternatives out there.
- Getting custom metrics for Salt and Uyuni via prometheus exporter from live server.
- Setting up Loki to process relevant Uyuni logs from live server.
- Allow data visualization with Grafana.
- Really easy-to-use CLI tool to run "health checks" and get feedback.
Details:
- Grafana, Loki, Uyuni prometheus exporter and all other components run on "containers"
- The containers run on the Uyuni server. "podman" is required on the server.
- CLI tool takes care of building and deploying the "container" image to the server, collect the metrics and provide output on the command line.
- Prometheus / Grafana expose containers metrics.
Goals for Hackweek #23
- Enhance and collect more Uyuni / Salt metrics.
- Use "supportconfig" as source for logs/metrics instead of live server.
Achievements during HW #23
- ...
Goals for Hackweek #22
- Improve CLI and performance.
- Fix memory leak on "uyuni-health-exporter".
- Complete automated deployment of Loki and other containers.
Achievements during HW #22:
- Fix memory leak on uyuni-health-exporter.
- Fix python packaging and installation.
- Deploy grafana and prometheus dashboard.
- Fix loki and promtail deployments.
- Run all containers in the same POD.
- Unify console logging across deployment functions.
- More friendly CLI with new functions.
- Containers are not wiped by default after executions.
- Minor and cosmetic changes.
- Update README.md to reflect latest changes
Goals for this Hackweek #21
- Getting a machine readable version of supportconfig
- First analysis and tweaking
Looking for hackers with the skills:
supportconfig analysis tool dashboard monitoring grafana loki prometheus python3 uyuni susemanager
This project is part of:
Hack Week 21 Hack Week 22 Hack Week 23
Activity
Comments
-
over 1 year ago by PSuarezHernandez | Reply
I've updated project description to reflect latest changes after Hackweek 22!
-
12 months ago by PSuarezHernandez | Reply
Let's keep hacking on this project during upcoming Hackweek 23!
Similar Projects
Update my own python audio and video time-lapse and motion capture apps and publish by dmair
Project Description
Many years ago, in my...