Project Description

A supportconfig provides a lot of files and data from the system, but it is often hard to spot the real issue in it. The idea of this project is to get machine-readable output for the supportconfig data and analyze them.

Then we would try to provide hints using the tool about what is wrong.

The name of this tool is: uyuni-health-check.

GitHub repository: https://github.com/uyuni-project/poc-uyuni-health-check

Summary:

  • Research about machine learning log anomaly detectors: few alternatives out there.
  • Getting custom metrics for Salt and Uyuni via prometheus exporter from live server.
  • Setting up Loki to process relevant Uyuni logs from live server.
  • Allow data visualization with Grafana.
  • Really easy-to-use CLI tool to run "health checks" and get feedback.

Details:

  • Grafana, Loki, Uyuni prometheus exporter and all other components run on "containers"
  • The containers run on the Uyuni server. "podman" is required on the server.
  • CLI tool takes care of building and deploying the "container" image to the server, collect the metrics and provide output on the command line.
  • Prometheus / Grafana expose containers metrics.

Goals for Hackweek #23

  • Enhance and collect more Uyuni / Salt metrics.
  • Use "supportconfig" as source for logs/metrics instead of live server.

Achievements during HW #23

  • ...

Goals for Hackweek #22

  • Improve CLI and performance.
  • Fix memory leak on "uyuni-health-exporter".
  • Complete automated deployment of Loki and other containers.

Achievements during HW #22:

  • Fix memory leak on uyuni-health-exporter.
  • Fix python packaging and installation.
  • Deploy grafana and prometheus dashboard.
  • Fix loki and promtail deployments.
  • Run all containers in the same POD.
  • Unify console logging across deployment functions.
  • More friendly CLI with new functions.
  • Containers are not wiped by default after executions.
  • Minor and cosmetic changes.
  • Update README.md to reflect latest changes

Goals for this Hackweek #21

  • Getting a machine readable version of supportconfig
  • First analysis and tweaking

This project is part of:

Hack Week 21 Hack Week 22 Hack Week 23

Activity

  • 28 days ago: pinvernizzi liked this project.
  • 28 days ago: oscar-barrios liked this project.
  • about 1 month ago: juliogonzalezgil liked this project.
  • about 2 months ago: emendonca liked this project.
  • over 1 year ago: cbosdonnat added keyword "uyuni" to this project.
  • over 1 year ago: cbosdonnat added keyword "susemanager" to this project.
  • over 1 year ago: cbosdonnat added keyword "monitoring" to this project.
  • over 1 year ago: cbosdonnat added keyword "grafana" to this project.
  • over 1 year ago: cbosdonnat added keyword "loki" to this project.
  • over 1 year ago: cbosdonnat added keyword "prometheus" to this project.
  • over 1 year ago: cbosdonnat added keyword "python3" to this project.
  • over 1 year ago: rangelino liked this project.
  • over 1 year ago: ygutierrez liked this project.
  • over 1 year ago: cbbayburt liked this project.
  • over 1 year ago: j_renner liked this project.
  • over 1 year ago: mbussolotto liked this project.
  • over 1 year ago: firoyang liked this project.
  • over 1 year ago: PSuarezHernandez joined this project.
  • over 1 year ago: PSuarezHernandez liked this project.
  • over 1 year ago: cbosdonnat started this project.
  • over 1 year ago: cbosdonnat added keyword "supportconfig" to this project.
  • over 1 year ago: cbosdonnat added keyword "analysis" to this project.
  • over 1 year ago: cbosdonnat added keyword "tool" to this project.
  • over 1 year ago: cbosdonnat added keyword "dashboard" to this project.
  • over 1 year ago: cbosdonnat originated this project.

  • Comments

    • PSuarezHernandez
      10 months ago by PSuarezHernandez | Reply

      I've updated project description to reflect latest changes after Hackweek 22!

    • PSuarezHernandez
      2 months ago by PSuarezHernandez | Reply

      Let's keep hacking on this project during upcoming Hackweek 23!

    Similar Projects

    Saline (state deployment control and monitoring tool for SUSE Manager/Uyuni) by vizhestkov

    [comment]: # (Please use the project descriptio...


    Nodes Overview by lrangasamy

    [comment]: # (Please use the project descriptio...


    Visualization of historical sar(1) archives by ggherdovich

    Project Description

    The sar(1) tool, fr...


    Visualization of historical sar(1) archives by ggherdovich

    Project Description

    The sar(1) tool, fr...


    TinyTutor - an AI chat-bot powered children's tutor by dmulder

    [comment]: # (Please use the project descriptio...


    Cluster-Tester for SAP HANA System Replication Cluster by fmherschel

    [comment]: # (Please use the project descriptio...


    Publish two personally productive projects (preferably on github) by dmair

    Since this will be my first real "from the star...


    A quantum physics experiment puzzle (designed with Google's CP-SAT solver) by moio

    [![link to video player demoing the result](htt...