Short for "evaluation." A benchmark or test suite used to measure how well an AI model performs. "Write better evals" is a common 2025 refrain.
"Our eval suite catches regressions better than any Q&A meeting."
No comments yet — say something.
Add your own interpretation of "eval".