Optimizing Large-Scale ETL Workflows Using Distributed Processing and Intelligent Scheduling
Abstract
As organizations process increasingly large datasets, traditional ETL workflows struggle to meet performance and reliability expectations. This paper presents an optimization framework combining distributed data processing with intelligent job scheduling techniques. By leveraging workload profiling and adaptive resource allocation, the proposed approach reduces execution latency and operational costs. Empirical evaluation across multiple cloud environments confirms significant improvements in throughput and fault tolerance.
Cite this article
(2022). Optimizing Large-Scale ETL Workflows Using Distributed Processing and Intelligent Scheduling. Research Explorations in Global Knowledge & Technology (REGKT), 1 (2). Retrieved from https://regkt.com/article.php?id=747&slug=optimizing-large-scale-etl-workflows-distributed-processing