promptdojo_

Add evals and traces — measure the agent, don't trust it — step 2 of 9

Four eval patterns: exact, contains, schema, judge. Match each case to its strongest fit. Which combination is correct?

read, then continue.