LLM-as-judge — when the judge is another model — step 4 of 9
Read the editor. The fake judge has pure position bias — it always picks slot A, no matter what content sits there. We run the same pair in both orders, then check whether the same physical output won both times.
What does the script print on the third line?
⌘↵ runs the editor.read, then continue.
LLM-as-judge — when the judge is another model — step 4 of 9
Read the editor. The fake judge has pure position bias — it always picks slot A, no matter what content sits there. We run the same pair in both orders, then check whether the same physical output won both times.
What does the script print on the third line?
⌘↵ runs the editor.read, then continue.