Research Article: Adaptive De-duplication for Lakehouse CDC Tables
Abstract
We present an adaptive deduplication operator that tunes keys by lateness profiles and watermark jitter, lowering duplicate reads to 0.03% in production.
Cite this article
Taylor, K. & Nguyen, A. (2024). Research Article: Adaptive De-duplication for Lakehouse CDC Tables. Research Explorations in Global Knowledge & Technology (REGKT), 3 (6). Retrieved from https://regkt.com/article.php?id=256&slug=adaptive-deduplication-lakehouse-cdc-tables