Short Communication: GPU-Aware Cost Models for ML Compilers

short-communication
Received: Dec 13, 2023
Published: Dec 31, 2023
Authors: Suriu Orlov ✉ Verau Weiss Veraa Yilmaz

Abstract

Introducing a GPU instruction-level cost model improving compilation time and kernel fusion efficiency for TensorFlow XLA.

⬇ Download

Cite this article

Orlov, S., Weiss, V., & Yilmaz, V. (2023). Short Communication: GPU-Aware Cost Models for ML Compilers. Research Explorations in Global Knowledge & Technology (REGKT), 2 (10). Retrieved from https://regkt.com/article.php?id=495&slug=short-communication-gpu-aware-cost-models-ml-compilers

Premium Membership Required

You need a premium account to view or download this article.

Become Premium