The classifier output is the most important constraint

A routing system fails the first time the classifier returns a string that isn't in your dispatch table. KeyError. The user gets a 500. You get paged.

The fix is constraining the classifier before it answers, not after. Three ways, in increasing reliability:

1. Enum in the prompt (weakest)

"Classify the user message as one of: billing, technical, general.
 Respond with ONLY the category word, nothing else."

This usually works. Modern models are obedient. But "usually" is not a reliability guarantee. Production routers see drift here: "Billing" (capital B), "billing or technical", "general question". Each is a KeyError waiting to happen.

2. Structured output with an `enum` schema (strong)

{
  "type": "object",
  "properties": {
    "category": {"type": "string", "enum": ["billing", "technical", "general"]}
  },
  "required": ["category"]
}

Now the model can't return "Billing" even if it wants to — the schema rejects it. Anthropic, OpenAI, Google all support this via their "tool use as structured output" trick: define a tool whose input schema is your classification schema, force the model to call it, and read the input.

3. Validate-then-fallback (production-grade)

Even with structured output, you want one more layer:

category = classification.get("category", "")
if category not in ALLOWED:
    category = "general"  # or "fallback", or whatever your default is

This is the load-bearing line in every routing system that's been in production for more than a month. Models drift. Schemas can be bypassed by a prompt-injected user. The dispatch table needs a safe default.

What people get wrong

The most common failure mode in beginner routers is using the model to do both the classification and the answer in one call — "classify this and respond appropriately." The model usually does, but you've lost the dispatch decoupling. You can't swap out a specialist. You can't measure per-route cost. You can't log which route got chosen. Routing earns its value from the explicit two-call shape — keep them separate.

⌘↵ runs the editor.read, then continue.

promptdojo_›phase 03 · llm apis›ch 16 · agent loops

lesson 3 of 5 · routing — pick the path before doing the workstep 3 / 9

The classifier output is the most important constraint

A routing system fails the first time the classifier returns a string that isn't in your dispatch table. KeyError. The user gets a 500. You get paged.

The fix is constraining the classifier before it answers, not after. Three ways, in increasing reliability:

1. Enum in the prompt (weakest)

"Classify the user message as one of: billing, technical, general.
 Respond with ONLY the category word, nothing else."

2. Structured output with an `enum` schema (strong)

{
  "type": "object",
  "properties": {
    "category": {"type": "string", "enum": ["billing", "technical", "general"]}
  },
  "required": ["category"]
}

3. Validate-then-fallback (production-grade)

Even with structured output, you want one more layer:

category = classification.get("category", "")
if category not in ALLOWED:
    category = "general"  # or "fallback", or whatever your default is

What people get wrong

⌘↵ runs the editor.read, then continue.

Routing — pick the path before doing the work — step 3 of 9

The classifier output is the most important constraint

1. Enum in the prompt (weakest)

2. Structured output with an enum schema (strong)

3. Validate-then-fallback (production-grade)

What people get wrong

Routing — pick the path before doing the work — step 3 of 9

The classifier output is the most important constraint

1. Enum in the prompt (weakest)

2. Structured output with an enum schema (strong)

3. Validate-then-fallback (production-grade)

What people get wrong

2. Structured output with an `enum` schema (strong)

2. Structured output with an `enum` schema (strong)