Research Article: Adaptive De-duplication for Lakehouse CDC Tables

research-article
Received: Jun 4, 2024
Published: Jul 1, 2024
Authors: Kartik Taylor ✉ Aarohi Nguyen

Abstract

We present an adaptive deduplication operator that tunes keys by lateness profiles and watermark jitter, lowering duplicate reads to 0.03% in production.

⬇ Download

Cite this article

Taylor, K. & Nguyen, A. (2024). Research Article: Adaptive De-duplication for Lakehouse CDC Tables. Research Explorations in Global Knowledge & Technology (REGKT), 3 (6). Retrieved from https://regkt.com/article.php?id=256&slug=adaptive-deduplication-lakehouse-cdc-tables

Premium Membership Required

You need a premium account to view or download this article.

Become Premium