Research Article: Adaptive Caching for Model Inference Routers

research-article
Received: Dec 8, 2022
Published: Dec 31, 2022
Authors: Nora Lee ✉ Olivia Jackson Sophie Smith

Abstract

We introduce feedback-aware cache replacement for model routers, achieving 17 % lower latency under burst traffic.

⬇ Download

Cite this article

Lee, N., Jackson, O., & Smith, S. (2022). Research Article: Adaptive Caching for Model Inference Routers. Research Explorations in Global Knowledge & Technology (REGKT), 1 (11). Retrieved from https://regkt.com/article.php?id=661&slug=adaptive-caching-for-model-inference-routers

Premium Membership Required

You need a premium account to view or download this article.

Become Premium