AI RESEARCH

Instant GPU Efficiency Visibility at Fleet Scale

arXiv CS.LG

ArXi:2605.20799v1 Announce Type: cross We present Overall FLOP Utilization (OFU), a hardware-level, precision-agnostic GPU efficiency metric for AI workloads on HPC systems, derived from two on-chip performance counters: Tensor Pipe Activity and SM clock frequency. OFU requires no application instrumentation and works across GPU generations and numeric precisions.