Hewlett Packard Enterprise (HPE) is one of the largest and most reputable manufacturers of server equipment, widely used in data centers worldwide. HPE offers a variety of servers tailored to different organizational capacities and needs. Some of the most popular HPE server series include ProLiant ML, DL, and BL, known for their flexibility, security, and high reliability.
To ensure the proper operation and management of HPE servers, the Integrated Lights-Out (ILO) interface has been designed and implemented. In addition to managing HPE servers, ILO is used for monitoring these devices, allowing for the health status of all server components—such as processors, RAM modules, power supplies, controllers, disks, fans, and temperature sensors—to be monitored.
The Moein monitoring platform, developed by Behpaya, supports monitoring all HPE servers via ILO, including ILO versions 3, 4, and 5. The platform is designed to provide a comprehensive overview of all server components at a glance, allowing administrators to assess the overall health of their servers easily. A complete list of monitored metrics for HPE servers is available on the Behpaya website.
The following image illustrates an HPE Gen10 server monitored by Moein. As shown, the interface provides a detailed view of processor status, RAM modules, controllers, fans, power supplies, disk health, and network interfaces. In addition to displaying each module’s status separately, their physical placement within the server is also shown. For memory modules, their positioning per processor is indicated, along with occupied and available RAM slots and the status of each slot. Similarly, disk locations are displayed according to their enclosure (BOX), along with their health status. This enables administrators to quickly identify any issues within the server.
The following physical server components are monitored in Moein:
Each server has at least two processors, making processor health one of the most critical metrics monitored by Moein. Additional parameters such as core count, processor speed, maximum thread count, and L1, L2, and L3 cache sizes are also collected and displayed. Any change in processor health triggers an alert to the administrator.
In physical servers, each processor has multiple DIMM slots for RAM modules. Moein collects and displays information on each memory module, including its status, operational state, capacity, location, type, and technology. Any changes in RAM module status trigger an alert to the administrator.
HPE servers use controllers to manage drives (disks) and support various RAID levels. Given the critical nature of data storage, monitoring controller health is essential. The Moein monitoring system includes functionality to track controller health status.
Disks store data on servers, making their health and location vital factors to monitor. Moein not only monitors disk health but also collects and displays details such as disk type, speed, capacity, serial number, and physical placement within the server.
Power supplies provide the necessary electrical power for servers. HPE servers typically use redundant power supplies to enhance reliability. Moein monitors power supply health, input voltage, power consumption, redundancy status, model, serial number, and fault conditions.
Fans play a crucial role in cooling server components. Malfunctioning fans can lead to overheating, reducing server performance. Moein monitors fan health, installation status, location, and speed to ensure proper cooling.
HPE servers are equipped with numerous temperature sensors to monitor different areas of the system. Some servers contain up to 30 sensors, making their monitoring essential. Moein collects temperature sensor readings and provides configurable thresholds for alerts. This eliminates the need for continuous manual monitoring, as Moein will notify operators of any temperature anomalies.