Blog | Weave Docs

pgermishuys · 7 min read

Agent Evals in Weave

What agent evaluations are, the main types that matter, and how Weave structures them to keep comparisons useful and honest.

Agent Evals in Weave &ZeroWidthSpace; It is easy to build an impressive agent demo. It is much harder to know whether the agent is actually improving. That is the role of an agent evaluation: taking a behavior you care about and turning it into something you can measure repeatedly.