Research Article: Self-Healing DAGs with Learned Retry Policies

research-article
Received: Oct 10, 2025
Published: Oct 30, 2025
Authors: Farah Aziz ✉ Mikhail Sokolov

Abstract

We train a policy to adjust retries and backoff using pipeline telemetry. In production, failed task MTTR decreased by 41% and resourcing costs dropped 12%.

⬇ Download

Cite this article

Aziz, F. & Sokolov, M. (2025). Research Article: Self-Healing DAGs with Learned Retry Policies. Research Explorations in Global Knowledge & Technology (REGKT), 3 (6). Retrieved from https://regkt.com/article.php?id=127&slug=self-healing-dags-with-learned-retry-policies

Premium Membership Required

You need a premium account to view or download this article.

Become Premium