Advanced Visualization and Data Analysis of HPC Cluster and User Application Behavior

SC21

Abstract

This work presents cutting-edge visualization, monitoring, and management solutions for HPC systems to understand the status of high-performance computing platforms and provide insight into the interactions among platform components. Benefiting from the greatly increased level of detail available from modern baseboard management controllers through Redfish Telemetry and real-time correlations via API and CLI interfaces to HPC job schedulers, this work provides much greater detail than previous similar projects.

Date
Nov 14, 2021 12:00 AM — Nov 19, 2021 12:00 AM
Event
Location
Saint Louis, Missouri (Virtual)
Jie Li
Jie Li
Ph.D. candidate in Computer Science