Project Description

A supportconfig provides a lot of files and data from the system, but it is often hard to spot the real issue in it. The idea of this project is to get machine-readable output for the supportconfig data and analyze them.

Then we would try to provide hints using the tool about what is wrong.

The name of this tool is: uyuni-health-check.

GitHub repository: https://github.com/uyuni-project/poc-uyuni-health-check

Summary:

  • Research about machine learning log anomaly detectors: few alternatives out there.
  • Getting custom metrics for Salt and Uyuni via prometheus exporter from live server.
  • Setting up Loki to process relevant Uyuni logs from live server.
  • Allow data visualization with Grafana.
  • Really easy-to-use CLI tool to run "health checks" and get feedback.

Details:

  • Grafana, Loki, Uyuni prometheus exporter and all other components run on "containers"
  • The containers run on the Uyuni server. "podman" is required on the server.
  • CLI tool takes care of building and deploying the "container" image to the server, collect the metrics and provide output on the command line.
  • Prometheus / Grafana expose containers metrics.

Goals for Hackweek #23

  • Enhance and collect more Uyuni / Salt metrics.
  • Use "supportconfig" as source for logs/metrics instead of live server.

Achievements during HW #23

  • ...

Goals for Hackweek #22

  • Improve CLI and performance.
  • Fix memory leak on "uyuni-health-exporter".
  • Complete automated deployment of Loki and other containers.

Achievements during HW #22:

  • Fix memory leak on uyuni-health-exporter.
  • Fix python packaging and installation.
  • Deploy grafana and prometheus dashboard.
  • Fix loki and promtail deployments.
  • Run all containers in the same POD.
  • Unify console logging across deployment functions.
  • More friendly CLI with new functions.
  • Containers are not wiped by default after executions.
  • Minor and cosmetic changes.
  • Update README.md to reflect latest changes

Goals for this Hackweek #21

  • Getting a machine readable version of supportconfig
  • First analysis and tweaking

This project is part of:

Hack Week 21 Hack Week 22 Hack Week 23

Activity

  • 8 months ago: pinvernizzi liked this project.
  • 8 months ago: oscar-barrios liked this project.
  • 9 months ago: juliogonzalezgil liked this project.
  • 9 months ago: emendonca liked this project.
  • about 2 years ago: cbosdonnat added keyword "uyuni" to this project.
  • about 2 years ago: cbosdonnat added keyword "susemanager" to this project.
  • about 2 years ago: cbosdonnat added keyword "monitoring" to this project.
  • about 2 years ago: cbosdonnat added keyword "grafana" to this project.
  • about 2 years ago: cbosdonnat added keyword "loki" to this project.
  • about 2 years ago: cbosdonnat added keyword "prometheus" to this project.
  • about 2 years ago: cbosdonnat added keyword "python3" to this project.
  • about 2 years ago: rangelino liked this project.
  • about 2 years ago: ygutierrez liked this project.
  • about 2 years ago: cbbayburt liked this project.
  • about 2 years ago: j_renner liked this project.
  • about 2 years ago: mbussolotto liked this project.
  • about 2 years ago: firoyang liked this project.
  • about 2 years ago: PSuarezHernandez joined this project.
  • about 2 years ago: PSuarezHernandez liked this project.
  • about 2 years ago: cbosdonnat started this project.
  • about 2 years ago: cbosdonnat added keyword "supportconfig" to this project.
  • about 2 years ago: cbosdonnat added keyword "analysis" to this project.
  • about 2 years ago: cbosdonnat added keyword "tool" to this project.
  • about 2 years ago: cbosdonnat added keyword "dashboard" to this project.
  • about 2 years ago: cbosdonnat originated this project.

  • Comments

    • PSuarezHernandez
      over 1 year ago by PSuarezHernandez | Reply

      I've updated project description to reflect latest changes after Hackweek 22!

    • PSuarezHernandez
      10 months ago by PSuarezHernandez | Reply

      Let's keep hacking on this project during upcoming Hackweek 23!

    Similar Projects

    Saline (state deployment control and monitoring tool for SUSE Manager/Uyuni) by vizhestkov

    [comment]: # (Please use the project descriptio...


    Visualization of historical sar(1) archives by ggherdovich

    Project Description

    The sar(1) tool, fr...


    Nodes Overview by lrangasamy

    [comment]: # (Please use the project descriptio...


    Visualization of historical sar(1) archives by ggherdovich

    Project Description

    The sar(1) tool, fr...


    A quantum physics experiment puzzle (designed with Google's CP-SAT solver) by moio

    [![link to video player demoing the result](htt...


    Publish two personally productive projects (preferably on github) by dmair

    Since this will be my first real "from the star...


    TinyTutor - an AI chat-bot powered children's tutor by dmulder

    [comment]: # (Please use the project descriptio...


    Cluster-Tester for SAP HANA System Replication Cluster by fmherschel

    [comment]: # (Please use the project descriptio...


    Generate ignition/combustion files from Uyuni/SUSE Manager by dvosburg

    [comment]: # (Please use the project descriptio...


    Enhance product migration feature in Uyuni(SUSE Manager) to migrate from regular SLES to SLES4SAP by admehmood

    [comment]: # (Please use the project descriptio...


    Testing and adding GNU/Linux distributions on Uyuni by juliogonzalezgil

    Join the Gitter channel! [https://gitter.im/uy...


    Create language model based on Uyuni documentation by nadvornik

    [comment]: # (Please use the project descriptio...


    Saline (state deployment control and monitoring tool for SUSE Manager/Uyuni) by vizhestkov

    [comment]: # (Please use the project descriptio...