Review: Elastic Inference on Cloud GPUs�Scaling Intelligence Efficiently

review
Received: Dec 11, 2023
Published: Dec 31, 2023
Authors: Hiro Weiss ✉ Vera Fiorini Adie Ivanov

Abstract

Survey of GPU elasticity mechanisms for model inference on cloud infrastructure, highlighting performance gains and billing efficiency.

⬇ Download

Cite this article

Weiss, H., Fiorini, V., & Ivanov, A. (2023). Review: Elastic Inference on Cloud GPUs�Scaling Intelligence Efficiently. Research Explorations in Global Knowledge & Technology (REGKT), 2 (10). Retrieved from https://regkt.com/article.php?id=492&slug=review-elastic-inference-cloud-gpus-scaling-intelligence

Premium Membership Required

You need a premium account to view or download this article.

Become Premium