Review: LLM Evaluation in Production�Freshness, Harms, and Drift
Abstract
We synthesize eval suites for freshness, harm probes, and drift detection with cost�quality tradeoffs.
Cite this article
Nguyen, A. & Evans, I. (2024). Review: LLM Evaluation in Production�Freshness, Harms, and Drift. Research Explorations in Global Knowledge & Technology (REGKT), 3 (12). Retrieved from https://regkt.com/article.php?id=367&slug=review-llm-evaluation-in-production-freshness-harms-drift