https://www.sh-reya.com/blog/in-defense-ai-evals/

https://567-labs.github.io/systematically-improving-rag/talks/

https://arxiv.org/abs/2508.21038

https://gradientflow.substack.com/p/quick-wins-for-your-ai-eval-strategy

https://docs.google.com/presentation/d/1FsmB0nCR5xdes-scAUPx20gKm18MR2cqp1D3w9-iGmg/edit?slide=id.p#slide=id.p Stop Distracting Your Coding Agents - The death of RAG (embedding search)

https://hamel.dev/blog/posts/evals-faq/

https://x.com/JuliaANeagu/status/1964704824299253888