Each class lives in a different trace field

You read a trace by knowing which fields point at which class. Not every field is equally useful at every moment. Read them in priority order.

The trace shape

A real production trace has a lot of fields. The five that classify a failure:

Field	What it tells you
`retrieved_chunks`	What the retriever returned. Empty? Wrong? Right?
`rendered_prompt`	The exact prompt the model saw — variables substituted, system + user merged
`raw_output`	What the model literally returned, before any post-processing
`output_after_postprocess`	What your code did to the raw output before showing it to the user
`tools_called`	Which tools ran, in what order

A trace without these fields is a trace you can't classify against. If your logging only captures the final user-facing string, you're flying blind — fix logging before you debug the next failure.

Priority order: which field first

Three of the classes show up in the trace as mechanical signals. The fourth (prompt ambiguity) can't be detected from fields alone. So your classifier walks the mechanical signals first and lands on prompt as the fallthrough.

Retrieved chunks. Open them. Are they relevant to the question? Are they from the right account, customer, document, or time range? If retrieved_chunks looks wrong: class 1 (retrieval). Stop here.
raw_output vs output_after_postprocess. Are they different? Diff them. If post-processing changed something meaningful: class 4 (parse). Stop here.
retrieved_chunks is empty. The retriever had nothing to give the model, but the model answered anyway. That's class 3 (hallucination). Stop here.
None of the above. Retrieval was fine, the trace is clean, the post-processing didn't touch the output. The model still went wrong. That points at class 2 (prompt). Read the rendered prompt by hand. Could a careful reader pick a different reasonable answer than the one you wanted?

Why this order

Retrieval bugs are the most common in production and the easiest to spot — the wrong chunk is right there in the log. Prompt bugs are next most common and require slower reading. Parse bugs hide in the diff between raw and post-processed. True hallucinations are the least common cause of customer-visible failures, despite being the most blamed. Most "the model hallucinated" reports are actually class 1, 2, or 4 in disguise.

What this lesson does NOT do

It does NOT teach you to fix any of these classes. Fixing retrieval is chapter 22. Fixing prompts is chapter 19. Fixing parse bugs is chapter 23. This lesson teaches the practice of putting the failure in the right bucket so you fix it at the right layer. The wrong fix at the wrong layer is how teams burn weeks on a bug.

What you'll build

A classify_failure(trace) that reads the four key fields in priority order and returns one of "retrieval", "prompt", "hallucination", or "parse". That function becomes the front door for every wrong-output ticket on your team.

⌘↵ runs the editor.read, then continue.

promptdojo_›phase 04 · shipping discipline›ch 24 · debugging broken ai output

lesson 2 of 3 · the four breakage classes — sort any llm failure before you touch the promptstep 3 / 9

Each class lives in a different trace field

You read a trace by knowing which fields point at which class. Not every field is equally useful at every moment. Read them in priority order.

The trace shape

A real production trace has a lot of fields. The five that classify a failure:

Field	What it tells you
`retrieved_chunks`	What the retriever returned. Empty? Wrong? Right?
`rendered_prompt`	The exact prompt the model saw — variables substituted, system + user merged
`raw_output`	What the model literally returned, before any post-processing
`output_after_postprocess`	What your code did to the raw output before showing it to the user
`tools_called`	Which tools ran, in what order

A trace without these fields is a trace you can't classify against. If your logging only captures the final user-facing string, you're flying blind — fix logging before you debug the next failure.

Priority order: which field first

Retrieved chunks. Open them. Are they relevant to the question? Are they from the right account, customer, document, or time range? If retrieved_chunks looks wrong: class 1 (retrieval). Stop here.
raw_output vs output_after_postprocess. Are they different? Diff them. If post-processing changed something meaningful: class 4 (parse). Stop here.
retrieved_chunks is empty. The retriever had nothing to give the model, but the model answered anyway. That's class 3 (hallucination). Stop here.
None of the above. Retrieval was fine, the trace is clean, the post-processing didn't touch the output. The model still went wrong. That points at class 2 (prompt). Read the rendered prompt by hand. Could a careful reader pick a different reasonable answer than the one you wanted?

Why this order

What this lesson does NOT do

What you'll build

⌘↵ runs the editor.read, then continue.

The four breakage classes — sort any LLM failure before you touch the prompt — step 3 of 9

Each class lives in a different trace field

The trace shape

Priority order: which field first

Why this order

What this lesson does NOT do

What you'll build

The four breakage classes — sort any LLM failure before you touch the prompt — step 3 of 9

Each class lives in a different trace field

The trace shape

Priority order: which field first

Why this order

What this lesson does NOT do

What you'll build