Feature Engineering at Scale: Automating Machine Learning Readiness in Data Lakes
Abstract
Feature engineering remains one of the most critical and labor-intensive steps in machine learning development. This study proposes an automated feature engineering framework integrated directly into modern data lake architectures. The framework enables reusable, versioned, and auditable feature pipelines that improve model reproducibility and deployment velocity. Case studies illustrate how automation significantly reduces time-to-model while maintaining data quality standards.
Cite this article
(2023). Feature Engineering at Scale: Automating Machine Learning Readiness in Data Lakes. Research Explorations in Global Knowledge & Technology (REGKT), 2 (1). Retrieved from https://regkt.com/article.php?id=748&slug=feature-engineering-at-scale-automating-ml-readiness