Short Communication: GPU-Aware Scheduling for Kubernetes Clusters

Grace O'Leary; Chen Xu

Short Communication: GPU-Aware Scheduling for Kubernetes Clusters

short-communication

Received: Oct 10, 2025

Published: Oct 30, 2025

Authors: Grace O'Leary ✉ Chen Xu

Abstract

We add GPU topology awareness to Kubernetes scheduler plugins, reducing inter-GPU communication latency by 19% in multi-node AI workloads.

⬇ Download

Cite this article

O'Leary, G. & Xu, C. (2025). Short Communication: GPU-Aware Scheduling for Kubernetes Clusters. Research Explorations in Global Knowledge & Technology (REGKT), 4 (1). Retrieved from https://regkt.com/article.php?id=185&slug=short-communication-gpu-aware-scheduling-for-kubernetes-clusters

Short Communication: GPU-Aware Scheduling for Kubernetes Clusters

Abstract

Cite this article

Premium Membership Required