Short Communication: GPU-Aware Scheduling for Kubernetes Clusters

short-communication
Received: Oct 10, 2025
Published: Oct 30, 2025
Authors: Grace O'Leary ✉ Chen Xu

Abstract

We add GPU topology awareness to Kubernetes scheduler plugins, reducing inter-GPU communication latency by 19% in multi-node AI workloads.

⬇ Download

Cite this article

O'Leary, G. & Xu, C. (2025). Short Communication: GPU-Aware Scheduling for Kubernetes Clusters. Research Explorations in Global Knowledge & Technology (REGKT), 4 (1). Retrieved from https://regkt.com/article.php?id=185&slug=short-communication-gpu-aware-scheduling-for-kubernetes-clusters

Premium Membership Required

You need a premium account to view or download this article.

Become Premium