Add evals and traces — measure the agent, don't trust it — step 2 of 9
Four eval patterns: exact, contains, schema, judge. Match each case to its strongest fit. Which combination is correct?
⌘↵ runs the editor.read, then continue.
Add evals and traces — measure the agent, don't trust it — step 2 of 9
Four eval patterns: exact, contains, schema, judge. Match each case to its strongest fit. Which combination is correct?
⌘↵ runs the editor.read, then continue.