Case Study: GPU Quota Markets for Shared Model Serving

case-study
Received: Oct 9, 2024
Published: Nov 6, 2024
Authors: Maya Smith ✉ Owen Brown

Abstract

A marketplace introduced internal GPU quota markets with preemption and buybacks, improving utilization by 14% while protecting latency SLOs.

⬇ Download

Cite this article

Smith, M. & Brown, O. (2024). Case Study: GPU Quota Markets for Shared Model Serving. Research Explorations in Global Knowledge & Technology (REGKT), 3 (10). Retrieved from https://regkt.com/article.php?id=284&slug=case-study-gpu-quota-markets-for-shared-model-serving

Premium Membership Required

You need a premium account to view or download this article.

Become Premium