Project Overview:

This project aims to create a lightweight, scalable Python-based tool to manage and monitor servers via Redfish APIs. The current state allows basic system status checks (power, health, model), but PSU monitoring, sensor readings, and firmware updates are inconsistent due to differences in chassis IDs and server board variations.

The goal is to create a unified, automated solution that works across multiple Intel server boards, allowing administrators to monitor system health, temperature, PSU status, and perform firmware upgrades centrally.

Goals for Hack Week:

Dynamic Chassis Detection:

Replace hardcoded paths with a discovery mechanism that detects all chassis members per server.

Scalable Sensor Monitoring:

Monitor temperatures and PSUs for all servers in a data center.

Reduce manual intervention; allow Ctrl+C to exit the monitor loop safely.

Firmware Upload & Apply:

Maintain current working Redfish firmware upload functionality.

Extend to support multiple server models with minimal changes.

Ease of Use & Automation:

Single redfish_tool.py CLI for status, PSU info, sensor monitoring, and firmware updates.

Configurable via config.json for host, username, and password.

Documentation & Best Practices:

Clear instructions for adding new servers.

Include error handling for unsupported endpoints or missing sensors.

Collaboration & Skills Needed:

Python 3: Experience with REST APIs, JSON handling.

Redfish API knowledge: Familiarity with system management, PSU, thermal, and update services.

Hardware Knowledge: Understanding Intel server board variations, chassis, and PSU layouts.

Optional: Experience with firmware flashing and remote BMC management.

Looking for hackers with the skills:

Nothing? Add some keywords!

This project is part of:

Hack Week 25

Activity

  • about 4 hours ago: Squinata originated this project.

  • Comments

    Be the first to comment!

    Similar Projects

    This project is one of its kind!