15:[["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"$1e"}}],["$","$L1f",null,{"tree":{"toc":{"version":"0.1.0","generatedAt":"1970-01-01T00:00:00.000Z","chapters":[{"number":0,"slug":"before-you-build","title":"from worker to builder","blurb":"you are not here to become a programmer overnight. you are here to stop doing repeated work by hand. learn the builder loop, turn old workplace deliverables into interactive site artifacts, and build your first reusable tool before syntax appears.","status":"live","lessonCount":10,"stepCount":14,"liveLessonCount":10,"liveStepCount":14,"estMinutes":74,"xpTotal":36},{"number":1,"slug":"variables","title":"first code-reading lab — names, values, and simple outputs","blurb":"this is not a variables chapter for future Python programmers. it is your first lab for reading tiny AI-generated scripts: spot the names, run the output, and understand enough to change one thing safely.","status":"live","lessonCount":3,"stepCount":26,"liveLessonCount":3,"liveStepCount":26,"estMinutes":22,"xpTotal":88},{"number":2,"slug":"functions","title":"functions — and the missing-return bug ai ships","blurb":"ai writes functions constantly, and can silently forget the `return` line. learn to spot the missing return on sight.","status":"live","lessonCount":3,"stepCount":27,"liveLessonCount":3,"liveStepCount":27,"estMinutes":21,"xpTotal":94},{"number":3,"slug":"lists-and-dicts","title":"lists and dicts — the bones of every api","blurb":"every json response you've ever copied out of chatgpt or a rest api is some mix of two things: lists and dicts. read them on sight.","status":"live","lessonCount":3,"stepCount":27,"liveLessonCount":3,"liveStepCount":27,"estMinutes":21,"xpTotal":94},{"number":4,"slug":"loops","title":"loops — predict the output","blurb":"ai writes a loop every time you say *for each*. off-by-one bugs hide in the boundaries. read it before you trust it.","status":"live","lessonCount":3,"stepCount":28,"liveLessonCount":3,"liveStepCount":28,"estMinutes":21,"xpTotal":97},{"number":5,"slug":"conditionals","title":"conditionals — where ai silently bugs","blurb":"`if` looks simple. the traps inside it — empty values, `==` vs `is`, the difference between `0` and `None` — are where ai quietly ships wrong code.","status":"live","lessonCount":2,"stepCount":17,"liveLessonCount":2,"liveStepCount":17,"estMinutes":14,"xpTotal":57},{"number":6,"slug":"tracebacks","title":"tracebacks — cursor wrote this and crashed","blurb":"when python crashes, it tells you exactly what happened and where. most non-engineers panic at the wall of text. you're going to learn to read it.","status":"live","lessonCount":3,"stepCount":26,"liveLessonCount":3,"liveStepCount":26,"estMinutes":21,"xpTotal":91},{"number":7,"slug":"mutation-and-state","title":"mutation — why your code mysteriously breaks","blurb":"when a list inside a function changes the list outside the function, that's mutation. ai does this constantly without flagging it, and it's the bug class that takes the longest to find.","status":"live","lessonCount":2,"stepCount":17,"liveLessonCount":2,"liveStepCount":17,"estMinutes":14,"xpTotal":57},{"number":8,"slug":"modules-and-imports","title":"modules, imports, and why your venv hates you","blurb":"half of `pip install x` failures are environment confusion, not python bugs. learn what `import` actually does, what a virtual env is for, and why your script can't find the package you just installed.","status":"live","lessonCount":3,"stepCount":24,"liveLessonCount":3,"liveStepCount":24,"estMinutes":22,"xpTotal":81},{"number":9,"slug":"error-handling","title":"error handling — when ai's code crashes mid-flight","blurb":"ai loves a happy path. the moment a file isn't there or an api blinks, the script blows up. `try/except` is how you keep the program alive long enough to log what went wrong.","status":"live","lessonCount":3,"stepCount":27,"liveLessonCount":3,"liveStepCount":27,"estMinutes":23,"xpTotal":96},{"number":10,"slug":"files-and-io","title":"files and i/o — moving data in and out","blurb":"reading a csv, writing a log, parsing a json dump. the first thing ai does in any real project is touch a file. learn the few patterns it reaches for and the one it forgets.","status":"live","lessonCount":4,"stepCount":34,"liveLessonCount":4,"liveStepCount":34,"estMinutes":32,"xpTotal":119},{"number":11,"slug":"classes-basics","title":"classes — reading what ai just wrote you","blurb":"ai ships classes constantly: sqlalchemy models, fastapi schemas, custom exceptions. you don't need to design them. you need to read one without flinching.","status":"live","lessonCount":3,"stepCount":27,"liveLessonCount":3,"liveStepCount":27,"estMinutes":25,"xpTotal":96},{"number":12,"slug":"http-and-apis","title":"http and apis — making the call","blurb":"every ai script eventually calls an api. learn the shape of `httpx.get`, what a status code means, and how to pull a value out of the json that comes back.","status":"live","lessonCount":4,"stepCount":34,"liveLessonCount":4,"liveStepCount":34,"estMinutes":32,"xpTotal":122},{"number":13,"slug":"llm-apis","title":"llm apis — talking to claude and openai","blurb":"every ai feature you ship eventually calls a model api. learn the messages pattern, how to read the response, and the four lines ai writes every single time.","status":"live","lessonCount":4,"stepCount":27,"liveLessonCount":4,"liveStepCount":27,"estMinutes":45,"xpTotal":87},{"number":14,"slug":"structured-output","title":"structured output — making the model return json","blurb":"free-form text breaks every pipeline. learn the schema-first pattern ai uses to get reliable json back, validate it with pydantic, and catch the model's lies before they hit prod.","status":"live","lessonCount":3,"stepCount":24,"liveLessonCount":3,"liveStepCount":24,"estMinutes":33,"xpTotal":79},{"number":15,"slug":"mcp","title":"mcp — the model context protocol","blurb":"mcp is the new standard for plugging tools and data sources into ai agents. learn what an mcp server actually is, how claude code lists tools, and why this is replacing one-off integrations everywhere.","status":"live","lessonCount":3,"stepCount":26,"liveLessonCount":3,"liveStepCount":26,"estMinutes":40,"xpTotal":85},{"number":16,"slug":"agent-loops","title":"agent loops — tool_use and the request/tool/respond cycle","blurb":"an agent isn't magic. it's a while loop. learn the actual cycle claude code, cursor, and every other agent uses: model returns tool_use, you run the tool, you send the result back, repeat until end_turn.","status":"live","lessonCount":6,"stepCount":47,"liveLessonCount":6,"liveStepCount":47,"estMinutes":69,"xpTotal":153},{"number":17,"slug":"git-and-github","title":"git and github cli — the ai builder's actual workflow","blurb":"cursor and claude code commit on your behalf. reading those commits — and undoing the bad ones — is your job. learn the four-state model, the commands you'll run every day, and what `gh` does that `git` can't.","status":"live","lessonCount":3,"stepCount":24,"liveLessonCount":3,"liveStepCount":24,"estMinutes":35,"xpTotal":79},{"number":18,"slug":"secrets-and-env","title":"secrets — .env, api keys, and what not to commit","blurb":"ai ships keys to github all the time. learn the .env pattern, why os.getenv is non-negotiable, what to do when a key leaks, and the gitignore lines you need on day one.","status":"live","lessonCount":2,"stepCount":16,"liveLessonCount":2,"liveStepCount":16,"estMinutes":15,"xpTotal":58},{"number":19,"slug":"prompting","title":"prompting cursor and claude code effectively","blurb":"the difference between a one-shot ai session and a four-hour debugging spiral is almost always the first prompt. learn the structure that gets you usable code.","status":"live","lessonCount":5,"stepCount":36,"liveLessonCount":5,"liveStepCount":36,"estMinutes":59,"xpTotal":118},{"number":20,"slug":"agent-traces","title":"reading agent traces and telemetry","blurb":"when an agent fails, the trace tells you exactly where. learn to read tool calls, tool results, and stop reasons — the json breadcrumbs every agent leaves behind.","status":"live","lessonCount":3,"stepCount":25,"liveLessonCount":3,"liveStepCount":25,"estMinutes":29,"xpTotal":88},{"number":21,"slug":"evals","title":"eval-driven ai development","blurb":"if you can't test it, you can't ship it. learn the simple-but-strict eval patterns that separate ai features that work from ones that just feel like they do.","status":"live","lessonCount":5,"stepCount":34,"liveLessonCount":5,"liveStepCount":34,"estMinutes":56,"xpTotal":112},{"number":22,"slug":"context-and-retrieval","title":"context and retrieval — feeding the model real data","blurb":"rag without the overengineering. chunking, embeddings, vector search, and the small set of patterns that make a model answer from your data instead of its training set.","status":"live","lessonCount":5,"stepCount":43,"liveLessonCount":5,"liveStepCount":43,"estMinutes":65,"xpTotal":143},{"number":23,"slug":"production-tradeoffs","title":"production tradeoffs — cost, latency, quality","blurb":"the three numbers every shipped llm feature lives or dies by. token math, caching, streaming, batching, and the small set of decisions that move the product more than a model swap ever will.","status":"live","lessonCount":4,"stepCount":33,"liveLessonCount":4,"liveStepCount":33,"estMinutes":50,"xpTotal":108},{"number":24,"slug":"debugging-output","title":"debugging broken ai output","blurb":"when the model lies to your customer. the methodology for narrowing down what went wrong, the four most-common breakage classes, and the discipline that separates 'we shipped a fix' from 'we blamed the model and shrugged'.","status":"live","lessonCount":4,"stepCount":34,"liveLessonCount":4,"liveStepCount":34,"estMinutes":55,"xpTotal":109},{"number":25,"slug":"capstone","title":"capstone — ship the system","blurb":"wire it all together. the prompt, the call, the validation, the trace, the eval, the MCP tool. less a tutorial demo, more the smallest end-to-end llm feature you could ship to a real user. (retrieval and prompt-cache cost work live in chapters 22-23 — extend the capstone with them when you scale past the demo input set.)","status":"live","lessonCount":7,"stepCount":62,"liveLessonCount":7,"liveStepCount":62,"estMinutes":91,"xpTotal":219},{"number":26,"slug":"agent-harnesses","title":"agent harnesses — the layer between you and the raw api","blurb":"claude code, cursor, aider, codex cli — they're all the same four layers wrapped around the same model api. learn what those layers are, what each adds, and what you'd build yourself if you had to.","status":"live","lessonCount":4,"stepCount":28,"liveLessonCount":4,"liveStepCount":28,"estMinutes":55,"xpTotal":87},{"number":27,"slug":"ai-image-generation","title":"ai image generation — from prompt to production asset","blurb":"the 2026 image model landscape, the prompts that work, and the pipeline that turns one good idea into a hundred ready-to-ship images. nano-banana, flux, midjourney, ideogram — when each wins and what they cost.","status":"live","lessonCount":3,"stepCount":24,"liveLessonCount":3,"liveStepCount":24,"estMinutes":60,"xpTotal":67},{"number":28,"slug":"ai-video-generation","title":"ai video generation — sora, veo, higgsfield, and the second wave","blurb":"video is the hardest content type to generate, the most expensive, and the most strategically interesting. learn the 2026 model lineup, the camera-control patterns that separate slop from craft, and the cost math that decides whether your idea is viable.","status":"live","lessonCount":3,"stepCount":23,"liveLessonCount":3,"liveStepCount":23,"estMinutes":50,"xpTotal":61},{"number":29,"slug":"programmatic-design","title":"programmatic design — code-driven video and the new design pipeline","blurb":"ai generates raw assets; code stitches them into something shippable. hyperframes, remotion, claude design — when each tool wins, how they combine, and the data-driven workflows that turn one template into a hundred videos.","status":"live","lessonCount":3,"stepCount":24,"liveLessonCount":3,"liveStepCount":24,"estMinutes":46,"xpTotal":66},{"number":30,"slug":"harness-engineering","title":"harness engineering — the discipline behind the model","blurb":"every coding agent is a model plus a harness. the model is bought; the harness is engineered. learn the craft: how to ratchet rules from failures, fight context rot, design long-horizon loops, wire hooks as enforcement, and read the haas shift that's reshaping what you build vs buy.","status":"live","lessonCount":6,"stepCount":52,"liveLessonCount":6,"liveStepCount":52,"estMinutes":120,"xpTotal":143},{"number":31,"slug":"intro-to-terminal","title":"intro to terminal — the text way to drive your computer","blurb":"you've never opened a terminal. by the end of this chapter you have, and you can move around your files, make folders, and read files without touching the mouse. it's a keyboard shortcut, not a cockpit. every tool in the rest of this course assumes you can do this, so we do it first.","status":"live","lessonCount":3,"stepCount":23,"liveLessonCount":3,"liveStepCount":23,"estMinutes":26,"xpTotal":50},{"number":32,"slug":"intro-to-claude-cli","title":"intro to claude cli — the door to real building","blurb":"you've used claude in a chat window. the claude cli is the same model with its hands on your actual files. this chapter installs it, signs you in, and runs your first real command. by the end you've watched an ai read, plan, and change things on your machine, and you know when to reach for the cli instead of the chat box.","status":"live","lessonCount":3,"stepCount":23,"liveLessonCount":3,"liveStepCount":23,"estMinutes":32,"xpTotal":48},{"number":33,"slug":"intro-to-codex-cli","title":"intro to openai codex cli — a second tool in the kit","blurb":"you know the claude cli. the openai codex cli does the same job, an ai working in your terminal on your real files, with a different company behind it. this chapter installs it, signs you in, and runs your first command. most of what you already know carries straight over, so this chapter is mostly about what is different and when to reach for which.","status":"live","lessonCount":3,"stepCount":23,"liveLessonCount":3,"liveStepCount":23,"estMinutes":32,"xpTotal":45},{"number":34,"slug":"team-skills","title":"claude skills for teams — playbooks your whole team shares","blurb":"a claude skill is a packaged set of instructions — your team's playbook — that claude loads when it's relevant, so nobody has to re-explain it. this chapter is for people who manage teams. it covers what a skill is, how a team shares and provisions skills, real examples for hr, legal, and ops work, when a skill beats a one-off prompt, and the governance you need before any skill touches real work.","status":"live","lessonCount":4,"stepCount":28,"liveLessonCount":4,"liveStepCount":28,"estMinutes":41,"xpTotal":60},{"number":35,"slug":"dataframes-numpy-pandas","title":"dataframes with numpy and pandas","blurb":"tables are the working surface of applied ml. learn rows, columns, missing values, joins, aggregates, and the dataframe habits ai-generated notebooks assume.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":35,"xpTotal":130},{"number":36,"slug":"sql-for-ml-datasets","title":"sql for ml datasets","blurb":"most training data starts in a database. learn the select, join, filter, aggregate, and leakage traps that decide whether a model is learning signal or nonsense.","status":"live","lessonCount":4,"stepCount":28,"liveLessonCount":4,"liveStepCount":28,"estMinutes":28,"xpTotal":104},{"number":37,"slug":"dataset-pipelines","title":"dataset formats, ingestion, and validation pipelines","blurb":"a dataset is a product surface. build ingestion, validation, partitions, manifests, and checkpoints so the next run is not a mystery.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":130},{"number":38,"slug":"ml-math-and-stats","title":"ml math and statistics that actually show up","blurb":"vectors, probability, distributions, correlation, and uncertainty are not trivia. they are how you read model behavior without worshipping it.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":130},{"number":39,"slug":"supervised-learning-workflows","title":"supervised learning workflows","blurb":"labels, splits, baselines, training, prediction, and evaluation. the supervised workflow is the first complete model loop.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":40,"xpTotal":118},{"number":40,"slug":"unsupervised-learning-and-embeddings","title":"unsupervised learning, embeddings, and recommenders","blurb":"not every useful model has labels. cluster, compare, retrieve, and recommend by turning examples into useful neighborhoods.","status":"live","lessonCount":4,"stepCount":28,"liveLessonCount":4,"liveStepCount":28,"estMinutes":32,"xpTotal":95},{"number":41,"slug":"metrics-and-error-analysis","title":"metrics, slices, and error analysis","blurb":"accuracy is a blunt instrument. learn confusion matrices, precision, recall, thresholds, slices, and failure notes so model quality has evidence.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":133},{"number":42,"slug":"pytorch-tensors-and-autograd","title":"pytorch tensors and autograd","blurb":"read tensor code without flinching. tensors, shapes, broadcasting, gradients, and autograd are the grammar of modern deep learning scripts.","status":"live","lessonCount":4,"stepCount":28,"liveLessonCount":4,"liveStepCount":28,"estMinutes":30,"xpTotal":107},{"number":43,"slug":"training-loops-and-optimizers","title":"training loops, backprop, optimizers, and schedulers","blurb":"the training loop is where models change. learn loss, gradients, optimizer steps, schedules, checkpoints, and the bugs ai ships there.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":133},{"number":44,"slug":"deep-learning-architectures","title":"cnns, transformers, and useful llm internals","blurb":"architecture literacy for builders: convolution, attention, tokens, decoding, kv cache, quantization, and what those choices do to cost and behavior.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":133},{"number":45,"slug":"feature-experiments-registries","title":"feature pipelines, experiment tracking, and registries","blurb":"features, runs, configs, artifacts, and registries are how ml work becomes repeatable instead of a lucky notebook.","status":"live","lessonCount":5,"stepCount":35,"liveLessonCount":5,"liveStepCount":35,"estMinutes":37,"xpTotal":133},{"number":46,"slug":"model-serving-and-mlops","title":"model serving, ci/cd, and mlops","blurb":"serving a model means handling inputs, versions, routes, batch jobs, ci gates, rollback, and production failures deliberately.","status":"live","lessonCount":6,"stepCount":42,"liveLessonCount":6,"liveStepCount":42,"estMinutes":44,"xpTotal":159},{"number":47,"slug":"monitoring-cloud-portfolio","title":"monitoring, drift, cloud scale, and portfolio launch","blurb":"the last mile: logs, drift, alerts, retraining decisions, cloud cost, gpu constraints, architecture docs, demos, and role stories.","status":"live","lessonCount":6,"stepCount":42,"liveLessonCount":6,"liveStepCount":42,"estMinutes":44,"xpTotal":158}]},"accessToc":{"chapters":[{"slug":"before-you-build","lessons":[{"slug":"from-worker-to-builder","steps":[{"id":"before-you-build/from-worker-to-builder/s01-01-intro"}]},{"slug":"upgrade-dull-artifacts-into-interactive-sites","steps":[{"id":"before-you-build/upgrade-dull-artifacts-into-interactive-sites/s01-01-intro"},{"id":"before-you-build/upgrade-dull-artifacts-into-interactive-sites/s02-02-build-brief"},{"id":"before-you-build/upgrade-dull-artifacts-into-interactive-sites/s03-03-your-artifact-brief"}]},{"slug":"why-building-matters-now","steps":[{"id":"before-you-build/why-building-matters-now/s01-01-intro"}]},{"slug":"the-builder-loop","steps":[{"id":"before-you-build/the-builder-loop/s01-01-intro"}]},{"slug":"your-first-builder-brief","steps":[{"id":"before-you-build/your-first-builder-brief/s01-01-intro"}]},{"slug":"what-the-ai-is-doing","steps":[{"id":"before-you-build/what-the-ai-is-doing/s01-01-intro"}]},{"slug":"build-your-first-reusable-tool","steps":[{"id":"before-you-build/build-your-first-reusable-tool/s01-01-intro"}]},{"slug":"from-prompt-to-project","steps":[{"id":"before-you-build/from-prompt-to-project/s01-01-intro"}]},{"slug":"why-code-appears-later","steps":[{"id":"before-you-build/why-code-appears-later/s01-01-intro"}]},{"slug":"use-it-again","steps":[{"id":"before-you-build/use-it-again/s01-01-intro"},{"id":"before-you-build/use-it-again/s02-02-which-fix-compounds"},{"id":"before-you-build/use-it-again/s03-03-save-version-two"}]}]},{"slug":"variables","lessons":[{"slug":"naming-things","steps":[{"id":"variables/naming-things/s01-01-intro"},{"id":"variables/naming-things/s02-02-which-name-is-valid"},{"id":"variables/naming-things/s03-03-assignment"},{"id":"variables/naming-things/s04-04-predict-the-score"},{"id":"variables/naming-things/s05-05-fill-the-name"},{"id":"variables/naming-things/s06-06-fix-the-typo"},{"id":"variables/naming-things/s07-07-write-greeting"},{"id":"variables/naming-things/s08-08-checkpoint"}]},{"slug":"types-on-sight","steps":[{"id":"variables/types-on-sight/s01-01-intro"},{"id":"variables/types-on-sight/s02-02-spot-the-type"},{"id":"variables/types-on-sight/s03-03-string-plus-int"},{"id":"variables/types-on-sight/s04-04-predict-the-error"},{"id":"variables/types-on-sight/s05-05-fill-the-cast"},{"id":"variables/types-on-sight/s06-06-fix-the-none-check"},{"id":"variables/types-on-sight/s07-07-fix-the-coercion"},{"id":"variables/types-on-sight/s08-08-write-the-counter"},{"id":"variables/types-on-sight/s09-09-checkpoint"}]},{"slug":"print-and-fstrings","steps":[{"id":"variables/print-and-fstrings/s01-01-intro"},{"id":"variables/print-and-fstrings/s02-02-which-output"},{"id":"variables/print-and-fstrings/s03-03-fstring-anatomy"},{"id":"variables/print-and-fstrings/s04-04-predict-the-print"},{"id":"variables/print-and-fstrings/s05-05-fill-the-format"},{"id":"variables/print-and-fstrings/s06-06-fix-the-quotes"},{"id":"variables/print-and-fstrings/s07-07-fix-the-brace"},{"id":"variables/print-and-fstrings/s08-08-write-the-receipt"},{"id":"variables/print-and-fstrings/s09-09-checkpoint"}]}]},{"slug":"functions","lessons":[{"slug":"return-values","steps":[{"id":"functions/return-values/s01-01-intro"},{"id":"functions/return-values/s02-02-which-returns"},{"id":"functions/return-values/s03-03-return-vs-print"},{"id":"functions/return-values/s04-04-predict-the-result"},{"id":"functions/return-values/s05-05-fill-the-return"},{"id":"functions/return-values/s06-06-fix-the-missing-return"},{"id":"functions/return-values/s07-07-fix-print-vs-return"},{"id":"functions/return-values/s08-08-write-double"},{"id":"functions/return-values/s09-09-checkpoint"}]},{"slug":"arguments-and-defaults","steps":[{"id":"functions/arguments-and-defaults/s01-01-intro"},{"id":"functions/arguments-and-defaults/s02-02-positional-vs-keyword"},{"id":"functions/arguments-and-defaults/s03-03-defaults"},{"id":"functions/arguments-and-defaults/s04-04-predict-the-default"},{"id":"functions/arguments-and-defaults/s05-05-fill-the-keyword"},{"id":"functions/arguments-and-defaults/s06-06-fix-the-arg-order"},{"id":"functions/arguments-and-defaults/s07-07-fix-the-missing-default"},{"id":"functions/arguments-and-defaults/s08-08-write-greet"},{"id":"functions/arguments-and-defaults/s09-09-checkpoint"}]},{"slug":"closures-and-decorators","steps":[{"id":"functions/closures-and-decorators/s01-01-intro"},{"id":"functions/closures-and-decorators/s02-02-which-line-is-the-closure"},{"id":"functions/closures-and-decorators/s03-03-the-at-sign"},{"id":"functions/closures-and-decorators/s04-04-predict-the-counter"},{"id":"functions/closures-and-decorators/s05-05-fill-the-decorator"},{"id":"functions/closures-and-decorators/s06-06-fix-the-missing-return"},{"id":"functions/closures-and-decorators/s07-07-fix-the-decorator-call"},{"id":"functions/closures-and-decorators/s08-08-write-tiny-decorator"},{"id":"functions/closures-and-decorators/s09-09-checkpoint"}]}]},{"slug":"lists-and-dicts","lessons":[{"slug":"the-bones-of-apis","steps":[{"id":"lists-and-dicts/the-bones-of-apis/s01-01-intro"},{"id":"lists-and-dicts/the-bones-of-apis/s02-02-which-is-which"},{"id":"lists-and-dicts/the-bones-of-apis/s03-03-indexing"},{"id":"lists-and-dicts/the-bones-of-apis/s04-04-predict-the-index"},{"id":"lists-and-dicts/the-bones-of-apis/s05-05-fill-the-key"},{"id":"lists-and-dicts/the-bones-of-apis/s06-06-fix-the-keyerror"},{"id":"lists-and-dicts/the-bones-of-apis/s07-07-fix-the-index"},{"id":"lists-and-dicts/the-bones-of-apis/s08-08-write-append"},{"id":"lists-and-dicts/the-bones-of-apis/s09-09-checkpoint"}]},{"slug":"list-comprehensions","steps":[{"id":"lists-and-dicts/list-comprehensions/s01-01-intro"},{"id":"lists-and-dicts/list-comprehensions/s02-02-which-comp-shape"},{"id":"lists-and-dicts/list-comprehensions/s03-03-with-filter"},{"id":"lists-and-dicts/list-comprehensions/s04-04-predict-the-comp"},{"id":"lists-and-dicts/list-comprehensions/s05-05-fill-the-comp"},{"id":"lists-and-dicts/list-comprehensions/s06-06-fix-the-comp"},{"id":"lists-and-dicts/list-comprehensions/s07-07-fix-the-filter"},{"id":"lists-and-dicts/list-comprehensions/s08-08-write-comp"},{"id":"lists-and-dicts/list-comprehensions/s09-09-checkpoint"}]},{"slug":"nested-data-shapes","steps":[{"id":"lists-and-dicts/nested-data-shapes/s01-01-intro"},{"id":"lists-and-dicts/nested-data-shapes/s02-02-which-path"},{"id":"lists-and-dicts/nested-data-shapes/s03-03-walking-the-shape"},{"id":"lists-and-dicts/nested-data-shapes/s04-04-predict-the-deep-key"},{"id":"lists-and-dicts/nested-data-shapes/s05-05-fill-the-index"},{"id":"lists-and-dicts/nested-data-shapes/s06-06-fix-the-wrong-path"},{"id":"lists-and-dicts/nested-data-shapes/s07-07-fix-the-list-vs-dict"},{"id":"lists-and-dicts/nested-data-shapes/s08-08-write-extract-emails"},{"id":"lists-and-dicts/nested-data-shapes/s09-09-checkpoint"}]}]},{"slug":"loops","lessons":[{"slug":"predict-the-output","steps":[{"id":"loops/predict-the-output/s01-01-intro"},{"id":"loops/predict-the-output/s02-02-which-loop-shape"},{"id":"loops/predict-the-output/s03-03-range"},{"id":"loops/predict-the-output/s04-04-predict-the-list-loop"},{"id":"loops/predict-the-output/s05-05-predict-the-range"},{"id":"loops/predict-the-output/s06-06-fill-the-loop-variable"},{"id":"loops/predict-the-output/s07-07-fix-the-off-by-one"},{"id":"loops/predict-the-output/s08-08-fix-the-dict-loop"},{"id":"loops/predict-the-output/s09-09-write-sum-loop"},{"id":"loops/predict-the-output/s10-10-checkpoint"}]},{"slug":"while-and-break","steps":[{"id":"loops/while-and-break/s01-01-intro"},{"id":"loops/while-and-break/s02-02-which-while-shape"},{"id":"loops/while-and-break/s03-03-break-and-continue"},{"id":"loops/while-and-break/s04-04-predict-the-while"},{"id":"loops/while-and-break/s05-05-fill-the-condition"},{"id":"loops/while-and-break/s06-06-fix-the-infinite-loop"},{"id":"loops/while-and-break/s07-07-fix-the-break"},{"id":"loops/while-and-break/s08-08-write-countdown"},{"id":"loops/while-and-break/s09-09-checkpoint"}]},{"slug":"enumerate-and-zip","steps":[{"id":"loops/enumerate-and-zip/s01-01-intro"},{"id":"loops/enumerate-and-zip/s02-02-which-pair"},{"id":"loops/enumerate-and-zip/s03-03-zip-pairs"},{"id":"loops/enumerate-and-zip/s04-04-predict-enumerate"},{"id":"loops/enumerate-and-zip/s05-05-fill-the-builtin"},{"id":"loops/enumerate-and-zip/s06-06-fix-the-range-len"},{"id":"loops/enumerate-and-zip/s07-07-fix-the-zip-mismatch"},{"id":"loops/enumerate-and-zip/s08-08-write-numbered-list"},{"id":"loops/enumerate-and-zip/s09-09-checkpoint"}]}]},{"slug":"conditionals","lessons":[{"slug":"truthiness-bugs","steps":[{"id":"conditionals/truthiness-bugs/s01-01-intro"},{"id":"conditionals/truthiness-bugs/s02-02-truthiness-quiz"},{"id":"conditionals/truthiness-bugs/s03-03-eq-vs-is"},{"id":"conditionals/truthiness-bugs/s04-04-predict-the-empty-list"},{"id":"conditionals/truthiness-bugs/s05-05-fill-the-condition"},{"id":"conditionals/truthiness-bugs/s06-06-fix-the-zero-bug"},{"id":"conditionals/truthiness-bugs/s07-07-fix-the-is-comparison"},{"id":"conditionals/truthiness-bugs/s08-08-checkpoint"}]},{"slug":"elif-and-pattern-match","steps":[{"id":"conditionals/elif-and-pattern-match/s01-01-intro"},{"id":"conditionals/elif-and-pattern-match/s02-02-which-fires-first"},{"id":"conditionals/elif-and-pattern-match/s03-03-match-statement"},{"id":"conditionals/elif-and-pattern-match/s04-04-predict-the-elif"},{"id":"conditionals/elif-and-pattern-match/s05-05-fill-the-elif"},{"id":"conditionals/elif-and-pattern-match/s06-06-fix-the-elif-order"},{"id":"conditionals/elif-and-pattern-match/s07-07-fix-the-match"},{"id":"conditionals/elif-and-pattern-match/s08-08-write-grade"},{"id":"conditionals/elif-and-pattern-match/s09-09-checkpoint"}]}]},{"slug":"tracebacks","lessons":[{"slug":"reading-the-stack","steps":[{"id":"tracebacks/reading-the-stack/s01-01-anatomy"},{"id":"tracebacks/reading-the-stack/s02-02-which-error"},{"id":"tracebacks/reading-the-stack/s03-03-error-types"},{"id":"tracebacks/reading-the-stack/s04-04-predict-the-error"},{"id":"tracebacks/reading-the-stack/s05-05-fix-nameerror"},{"id":"tracebacks/reading-the-stack/s06-06-fix-typeerror"},{"id":"tracebacks/reading-the-stack/s07-07-fix-attributeerror"},{"id":"tracebacks/reading-the-stack/s08-08-checkpoint"}]},{"slug":"the-five-error-classes","steps":[{"id":"tracebacks/the-five-error-classes/s01-01-intro"},{"id":"tracebacks/the-five-error-classes/s02-02-which-error-fits"},{"id":"tracebacks/the-five-error-classes/s03-03-zero-division-and-value"},{"id":"tracebacks/the-five-error-classes/s04-04-predict-the-class"},{"id":"tracebacks/the-five-error-classes/s05-05-fix-zerodivision"},{"id":"tracebacks/the-five-error-classes/s06-06-fix-valueerror"},{"id":"tracebacks/the-five-error-classes/s07-07-fix-indexerror"},{"id":"tracebacks/the-five-error-classes/s08-08-write-defensive"},{"id":"tracebacks/the-five-error-classes/s09-09-checkpoint"}]},{"slug":"print-debugging","steps":[{"id":"tracebacks/print-debugging/s01-01-intro"},{"id":"tracebacks/print-debugging/s02-02-which-print-helps"},{"id":"tracebacks/print-debugging/s03-03-breakpoint-explained"},{"id":"tracebacks/print-debugging/s04-04-predict-the-print-trace"},{"id":"tracebacks/print-debugging/s05-05-fill-the-print"},{"id":"tracebacks/print-debugging/s06-06-fix-the-silent-bug"},{"id":"tracebacks/print-debugging/s07-07-fix-the-print-placement"},{"id":"tracebacks/print-debugging/s08-08-write-trace-the-loop"},{"id":"tracebacks/print-debugging/s09-09-checkpoint"}]}]},{"slug":"mutation-and-state","lessons":[{"slug":"why-it-breaks","steps":[{"id":"mutation-and-state/why-it-breaks/s01-01-intro"},{"id":"mutation-and-state/why-it-breaks/s02-02-which-mutates"},{"id":"mutation-and-state/why-it-breaks/s03-03-shared-reference"},{"id":"mutation-and-state/why-it-breaks/s04-04-predict-the-mutation"},{"id":"mutation-and-state/why-it-breaks/s05-05-fill-the-copy"},{"id":"mutation-and-state/why-it-breaks/s06-06-fix-the-default-arg"},{"id":"mutation-and-state/why-it-breaks/s07-07-fix-the-shared-list"},{"id":"mutation-and-state/why-it-breaks/s08-08-checkpoint"}]},{"slug":"copy-vs-reference","steps":[{"id":"mutation-and-state/copy-vs-reference/s01-01-intro"},{"id":"mutation-and-state/copy-vs-reference/s02-02-which-copies"},{"id":"mutation-and-state/copy-vs-reference/s03-03-deep-copy"},{"id":"mutation-and-state/copy-vs-reference/s04-04-predict-the-shallow"},{"id":"mutation-and-state/copy-vs-reference/s05-05-fill-the-copy-call"},{"id":"mutation-and-state/copy-vs-reference/s06-06-fix-the-shallow-bug"},{"id":"mutation-and-state/copy-vs-reference/s07-07-fix-the-dict-copy"},{"id":"mutation-and-state/copy-vs-reference/s08-08-write-clone"},{"id":"mutation-and-state/copy-vs-reference/s09-09-checkpoint"}]}]},{"slug":"modules-and-imports","lessons":[{"slug":"why-venv-hates-you","steps":[{"id":"modules-and-imports/why-venv-hates-you/s01-01-what-import-does"},{"id":"modules-and-imports/why-venv-hates-you/s02-02-which-import-works"},{"id":"modules-and-imports/why-venv-hates-you/s03-03-from-import"},{"id":"modules-and-imports/why-venv-hates-you/s04-04-predict-the-import"},{"id":"modules-and-imports/why-venv-hates-you/s05-05-venv-explained"},{"id":"modules-and-imports/why-venv-hates-you/s06-06-fill-the-import"},{"id":"modules-and-imports/why-venv-hates-you/s07-07-fix-the-import"},{"id":"modules-and-imports/why-venv-hates-you/s08-08-checkpoint"}]},{"slug":"from-imports-and-aliases","steps":[{"id":"modules-and-imports/from-imports-and-aliases/s01-01-aliases"},{"id":"modules-and-imports/from-imports-and-aliases/s02-02-which-alias"},{"id":"modules-and-imports/from-imports-and-aliases/s03-03-multi-and-star"},{"id":"modules-and-imports/from-imports-and-aliases/s04-04-predict-the-alias"},{"id":"modules-and-imports/from-imports-and-aliases/s05-05-fill-the-alias"},{"id":"modules-and-imports/from-imports-and-aliases/s06-06-fix-the-alias"},{"id":"modules-and-imports/from-imports-and-aliases/s07-07-fix-the-name-clash"},{"id":"modules-and-imports/from-imports-and-aliases/s08-08-write-import"},{"id":"modules-and-imports/from-imports-and-aliases/s09-09-checkpoint"}]},{"slug":"ml-package-map","steps":[{"id":"modules-and-imports/ml-package-map/s01-01-intro"},{"id":"modules-and-imports/ml-package-map/s02-02-choose-the-risk"},{"id":"modules-and-imports/ml-package-map/s03-03-predict-the-checks"},{"id":"modules-and-imports/ml-package-map/s04-04-fill-the-return"},{"id":"modules-and-imports/ml-package-map/s05-05-fix-the-overtrust"},{"id":"modules-and-imports/ml-package-map/s06-06-write-count-ready"},{"id":"modules-and-imports/ml-package-map/s07-07-checkpoint"}]}]},{"slug":"error-handling","lessons":[{"slug":"try-except-basics","steps":[{"id":"error-handling/try-except-basics/s01-01-intro"},{"id":"error-handling/try-except-basics/s02-02-which-block-runs"},{"id":"error-handling/try-except-basics/s03-03-naming-the-error"},{"id":"error-handling/try-except-basics/s04-04-predict-the-fallback"},{"id":"error-handling/try-except-basics/s05-05-fill-the-keyword"},{"id":"error-handling/try-except-basics/s06-06-fix-the-bare-except"},{"id":"error-handling/try-except-basics/s07-07-fix-the-wrong-exception"},{"id":"error-handling/try-except-basics/s08-08-write-safe-int"},{"id":"error-handling/try-except-basics/s09-09-checkpoint"}]},{"slug":"catching-specific-errors","steps":[{"id":"error-handling/catching-specific-errors/s01-01-intro"},{"id":"error-handling/catching-specific-errors/s02-02-which-except-runs"},{"id":"error-handling/catching-specific-errors/s03-03-multiple-excepts"},{"id":"error-handling/catching-specific-errors/s04-04-predict-the-branch"},{"id":"error-handling/catching-specific-errors/s05-05-fill-the-as-variable"},{"id":"error-handling/catching-specific-errors/s06-06-fix-the-broad-except"},{"id":"error-handling/catching-specific-errors/s07-07-fix-the-wrong-class"},{"id":"error-handling/catching-specific-errors/s08-08-write-safe-lookup"},{"id":"error-handling/catching-specific-errors/s09-09-checkpoint"}]},{"slug":"raising-and-custom-exceptions","steps":[{"id":"error-handling/raising-and-custom-exceptions/s01-01-intro"},{"id":"error-handling/raising-and-custom-exceptions/s02-02-which-raise-fires"},{"id":"error-handling/raising-and-custom-exceptions/s03-03-custom-exception-classes"},{"id":"error-handling/raising-and-custom-exceptions/s04-04-predict-the-raise"},{"id":"error-handling/raising-and-custom-exceptions/s05-05-fill-the-raise-keyword"},{"id":"error-handling/raising-and-custom-exceptions/s06-06-fix-the-silent-swallow"},{"id":"error-handling/raising-and-custom-exceptions/s07-07-fix-the-bare-raise"},{"id":"error-handling/raising-and-custom-exceptions/s08-08-write-validate-age"},{"id":"error-handling/raising-and-custom-exceptions/s09-09-checkpoint"}]}]},{"slug":"files-and-io","lessons":[{"slug":"reading-and-writing","steps":[{"id":"files-and-io/reading-and-writing/s01-01-intro"},{"id":"files-and-io/reading-and-writing/s02-02-which-mode"},{"id":"files-and-io/reading-and-writing/s03-03-with-block"},{"id":"files-and-io/reading-and-writing/s04-04-predict-the-readlines"},{"id":"files-and-io/reading-and-writing/s05-05-fill-the-mode"},{"id":"files-and-io/reading-and-writing/s06-06-fix-the-no-with"},{"id":"files-and-io/reading-and-writing/s07-07-fix-the-wrong-mode"},{"id":"files-and-io/reading-and-writing/s08-08-write-log-line"},{"id":"files-and-io/reading-and-writing/s09-09-checkpoint"}]},{"slug":"pathlib-basics","steps":[{"id":"files-and-io/pathlib-basics/s01-01-intro"},{"id":"files-and-io/pathlib-basics/s02-02-which-call-is-pathlib"},{"id":"files-and-io/pathlib-basics/s03-03-read-write-and-glob"},{"id":"files-and-io/pathlib-basics/s04-04-predict-the-suffix"},{"id":"files-and-io/pathlib-basics/s05-05-fill-the-method"},{"id":"files-and-io/pathlib-basics/s06-06-fix-the-string-concat"},{"id":"files-and-io/pathlib-basics/s07-07-fix-the-missing-check"},{"id":"files-and-io/pathlib-basics/s08-08-write-list-text-files"},{"id":"files-and-io/pathlib-basics/s09-09-checkpoint"}]},{"slug":"csv-and-jsonl","steps":[{"id":"files-and-io/csv-and-jsonl/s01-01-intro"},{"id":"files-and-io/csv-and-jsonl/s02-02-which-reader"},{"id":"files-and-io/csv-and-jsonl/s03-03-jsonl-line-by-line"},{"id":"files-and-io/csv-and-jsonl/s04-04-predict-the-row"},{"id":"files-and-io/csv-and-jsonl/s05-05-fill-the-dictreader"},{"id":"files-and-io/csv-and-jsonl/s06-06-fix-the-encoding"},{"id":"files-and-io/csv-and-jsonl/s07-07-fix-the-jsonl-parse"},{"id":"files-and-io/csv-and-jsonl/s08-08-write-load-jsonl"},{"id":"files-and-io/csv-and-jsonl/s09-09-checkpoint"}]},{"slug":"dataset-manifests-and-formats","steps":[{"id":"files-and-io/dataset-manifests-and-formats/s01-01-intro"},{"id":"files-and-io/dataset-manifests-and-formats/s02-02-choose-the-risk"},{"id":"files-and-io/dataset-manifests-and-formats/s03-03-predict-the-checks"},{"id":"files-and-io/dataset-manifests-and-formats/s04-04-fill-the-return"},{"id":"files-and-io/dataset-manifests-and-formats/s05-05-fix-the-overtrust"},{"id":"files-and-io/dataset-manifests-and-formats/s06-06-write-manifest-entry"},{"id":"files-and-io/dataset-manifests-and-formats/s07-07-checkpoint"}]}]},{"slug":"classes-basics","lessons":[{"slug":"reading-a-class","steps":[{"id":"classes-basics/reading-a-class/s01-01-intro"},{"id":"classes-basics/reading-a-class/s02-02-which-is-the-method"},{"id":"classes-basics/reading-a-class/s03-03-self-and-init"},{"id":"classes-basics/reading-a-class/s04-04-predict-the-instance"},{"id":"classes-basics/reading-a-class/s05-05-fill-the-self"},{"id":"classes-basics/reading-a-class/s06-06-fix-the-missing-self"},{"id":"classes-basics/reading-a-class/s07-07-fix-the-init-args"},{"id":"classes-basics/reading-a-class/s08-08-write-counter"},{"id":"classes-basics/reading-a-class/s09-09-checkpoint"}]},{"slug":"instance-vs-class","steps":[{"id":"classes-basics/instance-vs-class/s01-01-intro"},{"id":"classes-basics/instance-vs-class/s02-02-which-attribute-is-shared"},{"id":"classes-basics/instance-vs-class/s03-03-mutable-class-attr-trap"},{"id":"classes-basics/instance-vs-class/s04-04-predict-the-shared-list"},{"id":"classes-basics/instance-vs-class/s05-05-fill-the-self"},{"id":"classes-basics/instance-vs-class/s06-06-fix-the-shared-default"},{"id":"classes-basics/instance-vs-class/s07-07-fix-the-class-attr-typo"},{"id":"classes-basics/instance-vs-class/s08-08-write-counter-class"},{"id":"classes-basics/instance-vs-class/s09-09-checkpoint"}]},{"slug":"dataclasses","steps":[{"id":"classes-basics/dataclasses/s01-01-intro"},{"id":"classes-basics/dataclasses/s02-02-which-decorator"},{"id":"classes-basics/dataclasses/s03-03-defaults-and-factories"},{"id":"classes-basics/dataclasses/s04-04-predict-the-repr"},{"id":"classes-basics/dataclasses/s05-05-fill-the-decorator"},{"id":"classes-basics/dataclasses/s06-06-fix-the-default-list"},{"id":"classes-basics/dataclasses/s07-07-fix-the-frozen-mutation"},{"id":"classes-basics/dataclasses/s08-08-write-task-dataclass"},{"id":"classes-basics/dataclasses/s09-09-checkpoint"}]}]},{"slug":"http-and-apis","lessons":[{"slug":"the-shape-of-a-call","steps":[{"id":"http-and-apis/the-shape-of-a-call/s01-01-intro"},{"id":"http-and-apis/the-shape-of-a-call/s02-02-which-status-is-success"},{"id":"http-and-apis/the-shape-of-a-call/s03-03-parsing-the-json"},{"id":"http-and-apis/the-shape-of-a-call/s04-04-predict-the-key"},{"id":"http-and-apis/the-shape-of-a-call/s05-05-fill-the-method"},{"id":"http-and-apis/the-shape-of-a-call/s06-06-fix-the-status-check"},{"id":"http-and-apis/the-shape-of-a-call/s07-07-fix-the-key-access"},{"id":"http-and-apis/the-shape-of-a-call/s08-08-write-extract-name"},{"id":"http-and-apis/the-shape-of-a-call/s09-09-checkpoint"}]},{"slug":"status-and-errors","steps":[{"id":"http-and-apis/status-and-errors/s01-01-intro"},{"id":"http-and-apis/status-and-errors/s02-02-which-status-family"},{"id":"http-and-apis/status-and-errors/s03-03-error-bodies-and-retries"},{"id":"http-and-apis/status-and-errors/s04-04-predict-the-status-handling"},{"id":"http-and-apis/status-and-errors/s05-05-fill-the-raise-for-status"},{"id":"http-and-apis/status-and-errors/s06-06-fix-the-missing-check"},{"id":"http-and-apis/status-and-errors/s07-07-fix-the-broad-retry"},{"id":"http-and-apis/status-and-errors/s08-08-write-classify-status"},{"id":"http-and-apis/status-and-errors/s09-09-checkpoint"}]},{"slug":"parsing-nested-responses","steps":[{"id":"http-and-apis/parsing-nested-responses/s01-01-intro"},{"id":"http-and-apis/parsing-nested-responses/s02-02-which-access-is-safe"},{"id":"http-and-apis/parsing-nested-responses/s03-03-walking-lists-and-dicts"},{"id":"http-and-apis/parsing-nested-responses/s04-04-predict-the-deep-key"},{"id":"http-and-apis/parsing-nested-responses/s05-05-fill-the-get-default"},{"id":"http-and-apis/parsing-nested-responses/s06-06-fix-the-keyerror-chain"},{"id":"http-and-apis/parsing-nested-responses/s07-07-fix-the-silent-none"},{"id":"http-and-apis/parsing-nested-responses/s08-08-write-extract-content"},{"id":"http-and-apis/parsing-nested-responses/s09-09-checkpoint"}]},{"slug":"api-ingestion-with-checkpoints","steps":[{"id":"http-and-apis/api-ingestion-with-checkpoints/s01-01-intro"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s02-02-choose-the-risk"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s03-03-predict-the-checks"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s04-04-fill-the-return"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s05-05-fix-the-overtrust"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s06-06-write-count-ready"},{"id":"http-and-apis/api-ingestion-with-checkpoints/s07-07-checkpoint"}]}]},{"slug":"llm-apis","lessons":[{"slug":"before-the-api-what-the-model-is-doing","steps":[{"id":"llm-apis/before-the-api-what-the-model-is-doing/s01-01-intro"}]},{"slug":"the-messages-pattern","steps":[{"id":"llm-apis/the-messages-pattern/s01-01-intro"},{"id":"llm-apis/the-messages-pattern/s02-02-which-role-is-yours"},{"id":"llm-apis/the-messages-pattern/s03-03-reading-the-response"},{"id":"llm-apis/the-messages-pattern/s04-04-predict-the-text"},{"id":"llm-apis/the-messages-pattern/s05-05-fill-the-role"},{"id":"llm-apis/the-messages-pattern/s06-06-fix-the-message-shape"},{"id":"llm-apis/the-messages-pattern/s07-07-fix-the-response-access"},{"id":"llm-apis/the-messages-pattern/s08-08-write-ask-claude"},{"id":"llm-apis/the-messages-pattern/s09-09-checkpoint"}]},{"slug":"reading-the-response","steps":[{"id":"llm-apis/reading-the-response/s01-01-intro"},{"id":"llm-apis/reading-the-response/s02-02-which-block-do-you-read"},{"id":"llm-apis/reading-the-response/s03-03-the-five-stop-reasons"},{"id":"llm-apis/reading-the-response/s04-04-predict-the-text"},{"id":"llm-apis/reading-the-response/s05-05-fill-the-text-extract"},{"id":"llm-apis/reading-the-response/s06-06-fix-the-content-zero"},{"id":"llm-apis/reading-the-response/s07-07-fix-the-stop-assumption"},{"id":"llm-apis/reading-the-response/s08-08-write-the-extractor"},{"id":"llm-apis/reading-the-response/s09-09-checkpoint"}]},{"slug":"the-model-picker","steps":[{"id":"llm-apis/the-model-picker/s01-01-why-model-choice-is-a-product-decision"},{"id":"llm-apis/the-model-picker/s02-02-haiku-territory"},{"id":"llm-apis/the-model-picker/s03-03-which-model-here"},{"id":"llm-apis/the-model-picker/s04-04-sonnet-territory"},{"id":"llm-apis/the-model-picker/s05-05-opus-and-the-thinking-budget"},{"id":"llm-apis/the-model-picker/s06-06-pick-the-model"},{"id":"llm-apis/the-model-picker/s07-07-write-the-model-routing-memo"},{"id":"llm-apis/the-model-picker/s08-08-checkpoint"}]}]},{"slug":"structured-output","lessons":[{"slug":"getting-json-back","steps":[{"id":"structured-output/getting-json-back/s01-01-intro"},{"id":"structured-output/getting-json-back/s02-02-why-validate"},{"id":"structured-output/getting-json-back/s03-03-pydantic-models"},{"id":"structured-output/getting-json-back/s04-04-predict-the-validated"},{"id":"structured-output/getting-json-back/s05-05-fill-the-field-type"},{"id":"structured-output/getting-json-back/s06-06-fix-the-required-field"},{"id":"structured-output/getting-json-back/s07-07-fix-the-type-mismatch"},{"id":"structured-output/getting-json-back/s08-08-write-parse-extraction"},{"id":"structured-output/getting-json-back/s09-09-checkpoint"}]},{"slug":"schemas-eat-prompts","steps":[{"id":"structured-output/schemas-eat-prompts/s01-01-the-trust-boundary"},{"id":"structured-output/schemas-eat-prompts/s02-02-three-shipped-breakages"},{"id":"structured-output/schemas-eat-prompts/s03-03-which-boundary-broke"},{"id":"structured-output/schemas-eat-prompts/s04-04-schema-as-prompt-replacement"},{"id":"structured-output/schemas-eat-prompts/s05-05-where-validation-belongs"},{"id":"structured-output/schemas-eat-prompts/s06-06-the-pydantic-migration-as-war-story"},{"id":"structured-output/schemas-eat-prompts/s07-07-write-the-boundary-audit"},{"id":"structured-output/schemas-eat-prompts/s08-08-checkpoint"}]},{"slug":"dataset-schema-validation","steps":[{"id":"structured-output/dataset-schema-validation/s01-01-intro"},{"id":"structured-output/dataset-schema-validation/s02-02-choose-the-risk"},{"id":"structured-output/dataset-schema-validation/s03-03-predict-the-checks"},{"id":"structured-output/dataset-schema-validation/s04-04-fill-the-return"},{"id":"structured-output/dataset-schema-validation/s05-05-fix-the-overtrust"},{"id":"structured-output/dataset-schema-validation/s06-06-write-count-ready"},{"id":"structured-output/dataset-schema-validation/s07-07-checkpoint"}]}]},{"slug":"mcp","lessons":[{"slug":"what-mcp-is","steps":[{"id":"mcp/what-mcp-is/s01-01-intro"},{"id":"mcp/what-mcp-is/s02-02-which-side-is-the-server"},{"id":"mcp/what-mcp-is/s03-03-the-tool-list"},{"id":"mcp/what-mcp-is/s04-04-predict-the-tool-result"},{"id":"mcp/what-mcp-is/s05-05-fill-the-tool-name"},{"id":"mcp/what-mcp-is/s06-06-fix-the-missing-argument"},{"id":"mcp/what-mcp-is/s07-07-fix-the-result-shape"},{"id":"mcp/what-mcp-is/s08-08-write-call-tool"},{"id":"mcp/what-mcp-is/s09-09-checkpoint"}]},{"slug":"writing-a-tiny-mcp-server","steps":[{"id":"mcp/writing-a-tiny-mcp-server/s01-01-intro"},{"id":"mcp/writing-a-tiny-mcp-server/s02-02-which-message-is-call-tool"},{"id":"mcp/writing-a-tiny-mcp-server/s03-03-the-tool-registry"},{"id":"mcp/writing-a-tiny-mcp-server/s04-04-predict-the-list-tools-output"},{"id":"mcp/writing-a-tiny-mcp-server/s05-05-fill-the-input-schema"},{"id":"mcp/writing-a-tiny-mcp-server/s06-06-fix-the-missing-error-response"},{"id":"mcp/writing-a-tiny-mcp-server/s07-07-fix-the-validation-gap"},{"id":"mcp/writing-a-tiny-mcp-server/s08-08-write-the-call-tool-dispatcher"},{"id":"mcp/writing-a-tiny-mcp-server/s09-09-checkpoint"}]},{"slug":"why-mcp-won","steps":[{"id":"mcp/why-mcp-won/s01-01-the-graveyard-of-tool-protocols"},{"id":"mcp/why-mcp-won/s02-02-what-openai-plugins-got-wrong"},{"id":"mcp/why-mcp-won/s03-03-which-failure-mode"},{"id":"mcp/why-mcp-won/s04-04-the-mcp-bet"},{"id":"mcp/why-mcp-won/s05-05-the-network-effect"},{"id":"mcp/why-mcp-won/s06-06-spot-the-mcp-wedge"},{"id":"mcp/why-mcp-won/s07-07-write-the-protocol-postmortem"},{"id":"mcp/why-mcp-won/s08-08-checkpoint"}]}]},{"slug":"agent-loops","lessons":[{"slug":"the-loop","steps":[{"id":"agent-loops/the-loop/s01-01-intro"},{"id":"agent-loops/the-loop/s02-02-which-stop-reason"},{"id":"agent-loops/the-loop/s03-03-tool-result-shape"},{"id":"agent-loops/the-loop/s04-04-predict-the-next-call"},{"id":"agent-loops/the-loop/s05-05-fill-the-stop-condition"},{"id":"agent-loops/the-loop/s06-06-fix-the-tool-id"},{"id":"agent-loops/the-loop/s07-07-fix-the-loop-exit"},{"id":"agent-loops/the-loop/s08-08-write-the-loop"},{"id":"agent-loops/the-loop/s09-09-checkpoint"}]},{"slug":"multi-step-tools","steps":[{"id":"agent-loops/multi-step-tools/s01-01-intro"},{"id":"agent-loops/multi-step-tools/s02-02-which-tool-runs"},{"id":"agent-loops/multi-step-tools/s03-03-tool-registry"},{"id":"agent-loops/multi-step-tools/s04-04-predict-the-chain"},{"id":"agent-loops/multi-step-tools/s05-05-fill-the-dispatch"},{"id":"agent-loops/multi-step-tools/s06-06-fix-the-missing-tool"},{"id":"agent-loops/multi-step-tools/s07-07-fix-the-parallel-ids"},{"id":"agent-loops/multi-step-tools/s08-08-write-multi-tool-agent"},{"id":"agent-loops/multi-step-tools/s09-09-checkpoint"}]},{"slug":"routing","steps":[{"id":"agent-loops/routing/s01-01-intro"},{"id":"agent-loops/routing/s02-02-when-to-route"},{"id":"agent-loops/routing/s03-03-classifier-output"},{"id":"agent-loops/routing/s04-04-predict-the-route"},{"id":"agent-loops/routing/s05-05-fill-the-dispatch"},{"id":"agent-loops/routing/s06-06-fix-the-fallback"},{"id":"agent-loops/routing/s07-07-fix-the-loose-classifier"},{"id":"agent-loops/routing/s08-08-write-the-router"},{"id":"agent-loops/routing/s09-09-checkpoint"}]},{"slug":"evaluator-optimizer","steps":[{"id":"agent-loops/evaluator-optimizer/s01-01-intro"},{"id":"agent-loops/evaluator-optimizer/s02-02-when-it-earns-its-keep"},{"id":"agent-loops/evaluator-optimizer/s03-03-judge-output-shape"},{"id":"agent-loops/evaluator-optimizer/s04-04-predict-the-iterations"},{"id":"agent-loops/evaluator-optimizer/s05-05-fill-the-exit"},{"id":"agent-loops/evaluator-optimizer/s06-06-fix-the-judge-parsing"},{"id":"agent-loops/evaluator-optimizer/s07-07-fix-the-missing-cap"},{"id":"agent-loops/evaluator-optimizer/s08-08-write-the-evaluator-optimizer"},{"id":"agent-loops/evaluator-optimizer/s09-09-checkpoint"}]},{"slug":"why-every-framework-is-thirty-lines","steps":[{"id":"agent-loops/why-every-framework-is-thirty-lines/s01-01-the-frameworks-converged"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s02-02-langgraph-walkthrough"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s03-03-which-framework-fits"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s04-04-vercel-ai-sdk-walkthrough"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s05-05-the-five-canonical-patterns"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s06-06-when-to-stay-framework-free"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s07-07-write-the-tradeoff-memo"},{"id":"agent-loops/why-every-framework-is-thirty-lines/s08-08-checkpoint"}]},{"slug":"the-industry-map","steps":[{"id":"agent-loops/the-industry-map/s01-01-the-five-layers"},{"id":"agent-loops/the-industry-map/s02-02-which-layer-failed"},{"id":"agent-loops/the-industry-map/s03-03-the-shape-of-the-stack"}]}]},{"slug":"git-and-github","lessons":[{"slug":"the-three-states","steps":[{"id":"git-and-github/the-three-states/s01-01-intro"},{"id":"git-and-github/the-three-states/s02-02-which-command-stages"},{"id":"git-and-github/the-three-states/s03-03-status-and-diff"},{"id":"git-and-github/the-three-states/s04-04-predict-the-status"},{"id":"git-and-github/the-three-states/s05-05-fill-the-command"},{"id":"git-and-github/the-three-states/s06-06-fix-the-staged-files"},{"id":"git-and-github/the-three-states/s07-07-fix-the-pr-command"},{"id":"git-and-github/the-three-states/s08-08-write-status-summary"},{"id":"git-and-github/the-three-states/s09-09-checkpoint"}]},{"slug":"three-git-disasters-ai-shipped","steps":[{"id":"git-and-github/three-git-disasters-ai-shipped/s01-01-why-postmortems-teach-better"},{"id":"git-and-github/three-git-disasters-ai-shipped/s02-02-uber-2016-credentials-in-repo"},{"id":"git-and-github/three-git-disasters-ai-shipped/s03-03-which-control-would-have-caught-it"},{"id":"git-and-github/three-git-disasters-ai-shipped/s04-04-the-samsung-paste"},{"id":"git-and-github/three-git-disasters-ai-shipped/s05-05-the-cursor-git-add-dot-default"},{"id":"git-and-github/three-git-disasters-ai-shipped/s06-06-spot-the-failure-mode"},{"id":"git-and-github/three-git-disasters-ai-shipped/s07-07-write-your-pre-commit-checklist"},{"id":"git-and-github/three-git-disasters-ai-shipped/s08-08-checkpoint"}]},{"slug":"github-actions-for-model-checks","steps":[{"id":"git-and-github/github-actions-for-model-checks/s01-01-intro"},{"id":"git-and-github/github-actions-for-model-checks/s02-02-choose-the-risk"},{"id":"git-and-github/github-actions-for-model-checks/s03-03-predict-the-checks"},{"id":"git-and-github/github-actions-for-model-checks/s04-04-fill-the-return"},{"id":"git-and-github/github-actions-for-model-checks/s05-05-fix-the-overtrust"},{"id":"git-and-github/github-actions-for-model-checks/s06-06-write-count-ready"},{"id":"git-and-github/github-actions-for-model-checks/s07-07-checkpoint"}]}]},{"slug":"secrets-and-env","lessons":[{"slug":"keeping-keys-safe","steps":[{"id":"secrets-and-env/keeping-keys-safe/s01-01-intro"},{"id":"secrets-and-env/keeping-keys-safe/s02-02-which-line-leaks"},{"id":"secrets-and-env/keeping-keys-safe/s03-03-loading-env-vars"},{"id":"secrets-and-env/keeping-keys-safe/s04-04-predict-the-key"},{"id":"secrets-and-env/keeping-keys-safe/s05-05-fill-the-getenv"},{"id":"secrets-and-env/keeping-keys-safe/s06-06-fix-the-hardcoded-key"},{"id":"secrets-and-env/keeping-keys-safe/s07-07-fix-the-missing-default"},{"id":"secrets-and-env/keeping-keys-safe/s08-08-write-load-config"},{"id":"secrets-and-env/keeping-keys-safe/s09-09-checkpoint"}]},{"slug":"ml-service-secrets","steps":[{"id":"secrets-and-env/ml-service-secrets/s01-01-intro"},{"id":"secrets-and-env/ml-service-secrets/s02-02-choose-the-risk"},{"id":"secrets-and-env/ml-service-secrets/s03-03-predict-the-checks"},{"id":"secrets-and-env/ml-service-secrets/s04-04-fill-the-return"},{"id":"secrets-and-env/ml-service-secrets/s05-05-fix-the-overtrust"},{"id":"secrets-and-env/ml-service-secrets/s06-06-write-count-ready"},{"id":"secrets-and-env/ml-service-secrets/s07-07-checkpoint"}]}]},{"slug":"prompting","lessons":[{"slug":"builder-briefs-before-prompts","steps":[{"id":"prompting/builder-briefs-before-prompts/s01-01-intro"}]},{"slug":"the-prompt-craft","steps":[{"id":"prompting/the-prompt-craft/s01-01-intro"},{"id":"prompting/the-prompt-craft/s02-02-which-prompt-is-better"},{"id":"prompting/the-prompt-craft/s03-03-context-rot"},{"id":"prompting/the-prompt-craft/s04-04-predict-the-output"},{"id":"prompting/the-prompt-craft/s05-05-fill-the-constraint"},{"id":"prompting/the-prompt-craft/s06-06-fix-the-vague-prompt"},{"id":"prompting/the-prompt-craft/s07-07-fix-the-bloated-context"},{"id":"prompting/the-prompt-craft/s08-08-write-the-prompt"},{"id":"prompting/the-prompt-craft/s09-09-checkpoint"}]},{"slug":"few-shot-and-reasoning","steps":[{"id":"prompting/few-shot-and-reasoning/s01-01-intro"},{"id":"prompting/few-shot-and-reasoning/s02-02-when-few-shot-helps"},{"id":"prompting/few-shot-and-reasoning/s03-03-the-cot-trap"},{"id":"prompting/few-shot-and-reasoning/s04-04-predict-the-format-lock"},{"id":"prompting/few-shot-and-reasoning/s05-05-fill-the-example-format"},{"id":"prompting/few-shot-and-reasoning/s06-06-fix-the-cot-on-reasoning"},{"id":"prompting/few-shot-and-reasoning/s07-07-fix-the-format-mismatch"},{"id":"prompting/few-shot-and-reasoning/s08-08-write-the-prompt-builder"},{"id":"prompting/few-shot-and-reasoning/s09-09-checkpoint"}]},{"slug":"agent-config-files","steps":[{"id":"prompting/agent-config-files/s01-01-intro"},{"id":"prompting/agent-config-files/s02-02-which-belongs-in-the-file"},{"id":"prompting/agent-config-files/s03-03-anatomy-of-a-good-file"},{"id":"prompting/agent-config-files/s04-04-predict-which-rule-fires"},{"id":"prompting/agent-config-files/s05-05-fill-the-rule-block"},{"id":"prompting/agent-config-files/s06-06-fix-the-too-vague"},{"id":"prompting/agent-config-files/s07-07-fix-the-stale-pointer"},{"id":"prompting/agent-config-files/s08-08-write-a-claude-md"},{"id":"prompting/agent-config-files/s09-09-checkpoint"}]},{"slug":"what-aged-in-prompting","steps":[{"id":"prompting/what-aged-in-prompting/s01-01-prompting-as-a-moving-target"},{"id":"prompting/what-aged-in-prompting/s02-02-cot-then-vs-now"},{"id":"prompting/what-aged-in-prompting/s03-03-which-still-works"},{"id":"prompting/what-aged-in-prompting/s04-04-the-system-prompt-arms-race"},{"id":"prompting/what-aged-in-prompting/s05-05-spot-the-stale-technique"},{"id":"prompting/what-aged-in-prompting/s06-06-the-prompt-engineer-debate"},{"id":"prompting/what-aged-in-prompting/s07-07-write-the-cheatsheet-update"},{"id":"prompting/what-aged-in-prompting/s08-08-checkpoint"}]}]},{"slug":"agent-traces","lessons":[{"slug":"reading-the-trace","steps":[{"id":"agent-traces/reading-the-trace/s01-01-intro"},{"id":"agent-traces/reading-the-trace/s02-02-which-turn-failed"},{"id":"agent-traces/reading-the-trace/s03-03-the-stop-reasons"},{"id":"agent-traces/reading-the-trace/s04-04-predict-the-stop"},{"id":"agent-traces/reading-the-trace/s05-05-fill-the-tool-name"},{"id":"agent-traces/reading-the-trace/s06-06-fix-the-loop"},{"id":"agent-traces/reading-the-trace/s07-07-fix-the-arg-shape"},{"id":"agent-traces/reading-the-trace/s08-08-write-the-summarizer"},{"id":"agent-traces/reading-the-trace/s09-09-checkpoint"}]},{"slug":"trace-driven-debugging","steps":[{"id":"agent-traces/trace-driven-debugging/s01-01-intro"},{"id":"agent-traces/trace-driven-debugging/s02-02-which-pattern-is-this"},{"id":"agent-traces/trace-driven-debugging/s03-03-the-loop-signature"},{"id":"agent-traces/trace-driven-debugging/s04-04-predict-the-call-count"},{"id":"agent-traces/trace-driven-debugging/s05-05-fill-the-detector"},{"id":"agent-traces/trace-driven-debugging/s06-06-fix-the-tool-routing"},{"id":"agent-traces/trace-driven-debugging/s07-07-fix-the-prompt-bloat"},{"id":"agent-traces/trace-driven-debugging/s08-08-write-the-trace-summary"},{"id":"agent-traces/trace-driven-debugging/s09-09-checkpoint"}]},{"slug":"prediction-ids-and-inference-logs","steps":[{"id":"agent-traces/prediction-ids-and-inference-logs/s01-01-intro"},{"id":"agent-traces/prediction-ids-and-inference-logs/s02-02-choose-the-risk"},{"id":"agent-traces/prediction-ids-and-inference-logs/s03-03-predict-the-checks"},{"id":"agent-traces/prediction-ids-and-inference-logs/s04-04-fill-the-return"},{"id":"agent-traces/prediction-ids-and-inference-logs/s05-05-fix-the-overtrust"},{"id":"agent-traces/prediction-ids-and-inference-logs/s06-06-write-count-ready"},{"id":"agent-traces/prediction-ids-and-inference-logs/s07-07-checkpoint"}]}]},{"slug":"evals","lessons":[{"slug":"check-before-you-trust","steps":[{"id":"evals/check-before-you-trust/s01-01-intro"}]},{"slug":"writing-evals","steps":[{"id":"evals/writing-evals/s01-01-intro"},{"id":"evals/writing-evals/s02-02-which-eval-passes"},{"id":"evals/writing-evals/s03-03-eval-patterns"},{"id":"evals/writing-evals/s04-04-predict-the-pass-rate"},{"id":"evals/writing-evals/s05-05-fill-the-assertion"},{"id":"evals/writing-evals/s06-06-fix-the-flaky-eval"},{"id":"evals/writing-evals/s07-07-fix-the-overstrict-eval"},{"id":"evals/writing-evals/s08-08-write-the-suite"},{"id":"evals/writing-evals/s09-09-checkpoint"}]},{"slug":"llm-as-judge","steps":[{"id":"evals/llm-as-judge/s01-01-intro"},{"id":"evals/llm-as-judge/s02-02-pairwise-or-rubric"},{"id":"evals/llm-as-judge/s03-03-the-four-biases"},{"id":"evals/llm-as-judge/s04-04-predict-the-bias"},{"id":"evals/llm-as-judge/s05-05-fill-the-rubric-call"},{"id":"evals/llm-as-judge/s06-06-fix-the-likert-trap"},{"id":"evals/llm-as-judge/s07-07-fix-the-position-bias"},{"id":"evals/llm-as-judge/s08-08-write-the-rubric-judge"},{"id":"evals/llm-as-judge/s09-09-checkpoint"}]},{"slug":"the-rise-of-evals-as-a-discipline","steps":[{"id":"evals/the-rise-of-evals-as-a-discipline/s01-01-the-prompt-engineering-era"},{"id":"evals/the-rise-of-evals-as-a-discipline/s02-02-the-eval-turn"},{"id":"evals/the-rise-of-evals-as-a-discipline/s03-03-which-company-survives-a-model-swap"},{"id":"evals/the-rise-of-evals-as-a-discipline/s04-04-case-studies"},{"id":"evals/the-rise-of-evals-as-a-discipline/s05-05-eval-first-vs-prompt-first"},{"id":"evals/the-rise-of-evals-as-a-discipline/s06-06-why-the-eval-engineer-eats-the-prompt-engineer"},{"id":"evals/the-rise-of-evals-as-a-discipline/s07-07-write-the-eval-readiness-audit"},{"id":"evals/the-rise-of-evals-as-a-discipline/s08-08-checkpoint"}]},{"slug":"model-validation-gates","steps":[{"id":"evals/model-validation-gates/s01-01-intro"},{"id":"evals/model-validation-gates/s02-02-choose-the-risk"},{"id":"evals/model-validation-gates/s03-03-predict-the-checks"},{"id":"evals/model-validation-gates/s04-04-fill-the-return"},{"id":"evals/model-validation-gates/s05-05-fix-the-overtrust"},{"id":"evals/model-validation-gates/s06-06-write-count-ready"},{"id":"evals/model-validation-gates/s07-07-checkpoint"}]}]},{"slug":"context-and-retrieval","lessons":[{"slug":"chunking-that-respects-structure","steps":[{"id":"context-and-retrieval/chunking-that-respects-structure/s01-01-intro"},{"id":"context-and-retrieval/chunking-that-respects-structure/s02-02-which-content-shape-breaks"},{"id":"context-and-retrieval/chunking-that-respects-structure/s03-03-the-recursive-splitter"},{"id":"context-and-retrieval/chunking-that-respects-structure/s04-04-predict-the-shred"},{"id":"context-and-retrieval/chunking-that-respects-structure/s05-05-fill-the-separator-list"},{"id":"context-and-retrieval/chunking-that-respects-structure/s06-06-fix-the-mid-sentence-cut"},{"id":"context-and-retrieval/chunking-that-respects-structure/s07-07-fix-the-missing-overlap"},{"id":"context-and-retrieval/chunking-that-respects-structure/s08-08-write-the-recursive-splitter"},{"id":"context-and-retrieval/chunking-that-respects-structure/s09-09-checkpoint"}]},{"slug":"embedding-that-fits-the-budget","steps":[{"id":"context-and-retrieval/embedding-that-fits-the-budget/s01-01-intro"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s02-02-which-model-fits"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s03-03-the-vector-space"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s04-04-predict-the-closest"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s05-05-fill-the-cosine-formula"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s06-06-fix-the-wrong-embedding-model"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s07-07-fix-the-missing-normalization"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s08-08-write-the-similarity-ranker"},{"id":"context-and-retrieval/embedding-that-fits-the-budget/s09-09-checkpoint"}]},{"slug":"retrieval-that-finds-the-right-thing","steps":[{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s01-01-intro"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s02-02-which-result-is-good"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s03-03-the-distance-threshold"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s04-04-predict-the-top-k"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s05-05-fill-the-cutoff"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s06-06-fix-the-no-threshold"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s07-07-fix-the-dup-results"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s08-08-write-the-ranker"},{"id":"context-and-retrieval/retrieval-that-finds-the-right-thing/s09-09-checkpoint"}]},{"slug":"rag-vs-long-context-vs-fine-tune","steps":[{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s01-01-the-three-way-fork"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s02-02-rag-when-you-need-freshness"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s03-03-which-fork-for-this-scenario"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s04-04-long-context-when-the-corpus-fits"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s05-05-fine-tune-when-style-or-format-is-the-product"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s06-06-which-fork-dies-first"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s07-07-the-rubric"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s08-08-write-the-fork-decision"},{"id":"context-and-retrieval/rag-vs-long-context-vs-fine-tune/s09-09-checkpoint"}]},{"slug":"retrieval-metrics-and-vector-db-shape","steps":[{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s01-01-intro"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s02-02-choose-the-risk"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s03-03-predict-the-checks"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s04-04-fill-the-return"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s05-05-fix-the-overtrust"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s06-06-write-count-ready"},{"id":"context-and-retrieval/retrieval-metrics-and-vector-db-shape/s07-07-checkpoint"}]}]},{"slug":"production-tradeoffs","lessons":[{"slug":"prompt-caching-correctly","steps":[{"id":"production-tradeoffs/prompt-caching-correctly/s01-01-intro"},{"id":"production-tradeoffs/prompt-caching-correctly/s02-02-which-piece-caches"},{"id":"production-tradeoffs/prompt-caching-correctly/s03-03-the-cache-breakpoints"},{"id":"production-tradeoffs/prompt-caching-correctly/s04-04-predict-the-savings"},{"id":"production-tradeoffs/prompt-caching-correctly/s05-05-fill-the-cache-control"},{"id":"production-tradeoffs/prompt-caching-correctly/s06-06-fix-the-cache-buster"},{"id":"production-tradeoffs/prompt-caching-correctly/s07-07-fix-the-ttl-default"},{"id":"production-tradeoffs/prompt-caching-correctly/s08-08-write-the-cost-estimator"},{"id":"production-tradeoffs/prompt-caching-correctly/s09-09-checkpoint"}]},{"slug":"read-the-token-bill","steps":[{"id":"production-tradeoffs/read-the-token-bill/s01-01-intro"},{"id":"production-tradeoffs/read-the-token-bill/s02-02-which-prompt-costs-more"},{"id":"production-tradeoffs/read-the-token-bill/s03-03-the-shape-of-a-bill"},{"id":"production-tradeoffs/read-the-token-bill/s04-04-predict-the-monthly-spend"},{"id":"production-tradeoffs/read-the-token-bill/s05-05-fill-the-cost-formula"},{"id":"production-tradeoffs/read-the-token-bill/s06-06-fix-the-pricing-ratio-error"},{"id":"production-tradeoffs/read-the-token-bill/s07-07-fix-the-units-error"},{"id":"production-tradeoffs/read-the-token-bill/s08-08-write-the-cost-estimator"},{"id":"production-tradeoffs/read-the-token-bill/s09-09-checkpoint"}]},{"slug":"the-model-price-war","steps":[{"id":"production-tradeoffs/the-model-price-war/s01-01-the-price-curve"},{"id":"production-tradeoffs/the-model-price-war/s02-02-three-bets-and-how-they-aged"},{"id":"production-tradeoffs/the-model-price-war/s03-03-which-startup-survived"},{"id":"production-tradeoffs/the-model-price-war/s04-04-the-incumbent-trap"},{"id":"production-tradeoffs/the-model-price-war/s05-05-features-now-viable"},{"id":"production-tradeoffs/the-model-price-war/s06-06-pricing-for-a-war-you-cant-predict"},{"id":"production-tradeoffs/the-model-price-war/s07-07-write-the-cost-model"},{"id":"production-tradeoffs/the-model-price-war/s08-08-checkpoint"}]},{"slug":"batch-realtime-costs","steps":[{"id":"production-tradeoffs/batch-realtime-costs/s01-01-intro"},{"id":"production-tradeoffs/batch-realtime-costs/s02-02-choose-the-risk"},{"id":"production-tradeoffs/batch-realtime-costs/s03-03-predict-the-checks"},{"id":"production-tradeoffs/batch-realtime-costs/s04-04-fill-the-return"},{"id":"production-tradeoffs/batch-realtime-costs/s05-05-fix-the-overtrust"},{"id":"production-tradeoffs/batch-realtime-costs/s06-06-write-routing-brief"},{"id":"production-tradeoffs/batch-realtime-costs/s07-07-checkpoint"}]}]},{"slug":"debugging-output","lessons":[{"slug":"read-the-trace-not-the-chat","steps":[{"id":"debugging-output/read-the-trace-not-the-chat/s01-01-intro"},{"id":"debugging-output/read-the-trace-not-the-chat/s02-02-where-to-look-first"},{"id":"debugging-output/read-the-trace-not-the-chat/s03-03-the-four-failure-classes"},{"id":"debugging-output/read-the-trace-not-the-chat/s04-04-predict-the-bad-turn"},{"id":"debugging-output/read-the-trace-not-the-chat/s05-05-fill-the-find-bad-turn"},{"id":"debugging-output/read-the-trace-not-the-chat/s06-06-fix-the-fix-the-symptom"},{"id":"debugging-output/read-the-trace-not-the-chat/s07-07-fix-the-blame-the-model"},{"id":"debugging-output/read-the-trace-not-the-chat/s08-08-write-the-trace-summary"},{"id":"debugging-output/read-the-trace-not-the-chat/s09-09-checkpoint"}]},{"slug":"the-four-breakage-classes","steps":[{"id":"debugging-output/the-four-breakage-classes/s01-01-intro"},{"id":"debugging-output/the-four-breakage-classes/s02-02-classify-the-refund-bug"},{"id":"debugging-output/the-four-breakage-classes/s03-03-the-trace-anatomy"},{"id":"debugging-output/the-four-breakage-classes/s04-04-predict-the-class"},{"id":"debugging-output/the-four-breakage-classes/s05-05-fill-the-class-name"},{"id":"debugging-output/the-four-breakage-classes/s06-06-fix-the-misclassification"},{"id":"debugging-output/the-four-breakage-classes/s07-07-fix-the-parse-mangle"},{"id":"debugging-output/the-four-breakage-classes/s08-08-write-the-classifier"},{"id":"debugging-output/the-four-breakage-classes/s09-09-checkpoint"}]},{"slug":"five-real-postmortems","steps":[{"id":"debugging-output/five-real-postmortems/s01-01-the-postmortem-template"},{"id":"debugging-output/five-real-postmortems/s02-02-air-canada-refund-lawsuit"},{"id":"debugging-output/five-real-postmortems/s03-03-dpd-and-nyc-mycity"},{"id":"debugging-output/five-real-postmortems/s04-04-classify-the-five"},{"id":"debugging-output/five-real-postmortems/s05-05-recruiter-json-mangle"},{"id":"debugging-output/five-real-postmortems/s06-06-glean-style-retrieval-miss"},{"id":"debugging-output/five-real-postmortems/s07-07-which-fix-catches-the-most"},{"id":"debugging-output/five-real-postmortems/s08-08-write-the-postmortem"},{"id":"debugging-output/five-real-postmortems/s09-09-checkpoint"}]},{"slug":"prediction-failure-triage","steps":[{"id":"debugging-output/prediction-failure-triage/s01-01-intro"},{"id":"debugging-output/prediction-failure-triage/s02-02-choose-the-risk"},{"id":"debugging-output/prediction-failure-triage/s03-03-predict-the-checks"},{"id":"debugging-output/prediction-failure-triage/s04-04-fill-the-return"},{"id":"debugging-output/prediction-failure-triage/s05-05-fix-the-overtrust"},{"id":"debugging-output/prediction-failure-triage/s06-06-write-triage-receipt"},{"id":"debugging-output/prediction-failure-triage/s07-07-checkpoint"}]}]},{"slug":"capstone","lessons":[{"slug":"pick-a-project-that-ships","steps":[{"id":"capstone/pick-a-project-that-ships/s01-01-the-builders-graveyard"},{"id":"capstone/pick-a-project-that-ships/s02-02-which-project-survives-contact-with-reality"},{"id":"capstone/pick-a-project-that-ships/s03-03-the-three-shapes-that-ship"},{"id":"capstone/pick-a-project-that-ships/s04-04-the-wedge-checklist"},{"id":"capstone/pick-a-project-that-ships/s05-05-spot-the-missing-wedge"},{"id":"capstone/pick-a-project-that-ships/s06-06-scope-down-the-capstone"},{"id":"capstone/pick-a-project-that-ships/s07-07-checkpoint"}]},{"slug":"build-a-cli-agent","steps":[{"id":"capstone/build-a-cli-agent/s01-01-intro"},{"id":"capstone/build-a-cli-agent/s02-02-which-piece-is-missing"},{"id":"capstone/build-a-cli-agent/s03-03-the-tool-loop"},{"id":"capstone/build-a-cli-agent/s04-04-predict-the-final-answer"},{"id":"capstone/build-a-cli-agent/s05-05-fill-the-stop-check"},{"id":"capstone/build-a-cli-agent/s06-06-fix-the-tool-dispatch"},{"id":"capstone/build-a-cli-agent/s07-07-fix-the-message-append"},{"id":"capstone/build-a-cli-agent/s08-08-the-logging-pattern"},{"id":"capstone/build-a-cli-agent/s09-09-write-the-tool-runner"},{"id":"capstone/build-a-cli-agent/s10-10-fix-the-loop-exit"},{"id":"capstone/build-a-cli-agent/s11-11-write-the-agent"},{"id":"capstone/build-a-cli-agent/s12-12-checkpoint"}]},{"slug":"wire-the-real-model","steps":[{"id":"capstone/wire-the-real-model/s01-01-intro"},{"id":"capstone/wire-the-real-model/s02-02-where-the-key-lives"},{"id":"capstone/wire-the-real-model/s03-03-the-sdk-response-shape"},{"id":"capstone/wire-the-real-model/s04-04-predict-the-block-type"},{"id":"capstone/wire-the-real-model/s05-05-fill-the-key-load"},{"id":"capstone/wire-the-real-model/s06-06-fix-the-hardcoded-key"},{"id":"capstone/wire-the-real-model/s07-07-fix-the-block-access"},{"id":"capstone/wire-the-real-model/s08-08-write-the-shape-adapter"},{"id":"capstone/wire-the-real-model/s09-09-checkpoint"}]},{"slug":"validate-tool-inputs","steps":[{"id":"capstone/validate-tool-inputs/s01-01-intro"},{"id":"capstone/validate-tool-inputs/s02-02-which-input-needs-validation"},{"id":"capstone/validate-tool-inputs/s03-03-the-validator-surface"},{"id":"capstone/validate-tool-inputs/s04-04-predict-the-error"},{"id":"capstone/validate-tool-inputs/s05-05-fill-the-validate-call"},{"id":"capstone/validate-tool-inputs/s06-06-fix-the-untyped-tool"},{"id":"capstone/validate-tool-inputs/s07-07-fix-the-error-leak"},{"id":"capstone/validate-tool-inputs/s08-08-write-the-validator"},{"id":"capstone/validate-tool-inputs/s09-09-checkpoint"}]},{"slug":"add-evals-and-traces","steps":[{"id":"capstone/add-evals-and-traces/s01-01-intro"},{"id":"capstone/add-evals-and-traces/s02-02-which-eval-fits"},{"id":"capstone/add-evals-and-traces/s03-03-the-trace-shape"},{"id":"capstone/add-evals-and-traces/s04-04-predict-the-pass-rate"},{"id":"capstone/add-evals-and-traces/s05-05-fill-the-trace-append"},{"id":"capstone/add-evals-and-traces/s06-06-fix-the-trace-mutation"},{"id":"capstone/add-evals-and-traces/s07-07-fix-the-tautological-eval"},{"id":"capstone/add-evals-and-traces/s08-08-write-the-eval-runner"},{"id":"capstone/add-evals-and-traces/s09-09-checkpoint"}]},{"slug":"wire-an-mcp-tool","steps":[{"id":"capstone/wire-an-mcp-tool/s01-01-intro"},{"id":"capstone/wire-an-mcp-tool/s02-02-when-to-use-mcp"},{"id":"capstone/wire-an-mcp-tool/s03-03-the-mcp-protocol"},{"id":"capstone/wire-an-mcp-tool/s04-04-predict-the-tool-list"},{"id":"capstone/wire-an-mcp-tool/s05-05-fill-the-tools-call"},{"id":"capstone/wire-an-mcp-tool/s06-06-fix-the-result-shape"},{"id":"capstone/wire-an-mcp-tool/s07-07-fix-the-iserror-flag"},{"id":"capstone/wire-an-mcp-tool/s08-08-write-the-mcp-bridge"},{"id":"capstone/wire-an-mcp-tool/s09-09-checkpoint"}]},{"slug":"portfolio-ml-system-option","steps":[{"id":"capstone/portfolio-ml-system-option/s01-01-intro"},{"id":"capstone/portfolio-ml-system-option/s02-02-choose-the-risk"},{"id":"capstone/portfolio-ml-system-option/s03-03-predict-the-checks"},{"id":"capstone/portfolio-ml-system-option/s04-04-fill-the-return"},{"id":"capstone/portfolio-ml-system-option/s05-05-fix-the-overtrust"},{"id":"capstone/portfolio-ml-system-option/s06-06-write-count-ready"},{"id":"capstone/portfolio-ml-system-option/s07-07-checkpoint"}]}]},{"slug":"agent-harnesses","lessons":[{"slug":"project-workspaces-not-product-names","steps":[{"id":"agent-harnesses/project-workspaces-not-product-names/s01-01-intro"}]},{"slug":"what-a-harness-is","steps":[{"id":"agent-harnesses/what-a-harness-is/s01-01-intro"},{"id":"agent-harnesses/what-a-harness-is/s02-02-which-is-a-harness"},{"id":"agent-harnesses/what-a-harness-is/s03-03-the-four-layers"},{"id":"agent-harnesses/what-a-harness-is/s04-04-predict-the-layer"},{"id":"agent-harnesses/what-a-harness-is/s05-05-fill-the-dispatch"},{"id":"agent-harnesses/what-a-harness-is/s06-06-fix-the-no-retry"},{"id":"agent-harnesses/what-a-harness-is/s07-07-fix-the-hardcoded-registry"},{"id":"agent-harnesses/what-a-harness-is/s08-08-write-mini-harness"},{"id":"agent-harnesses/what-a-harness-is/s09-09-checkpoint"}]},{"slug":"architecting-an-ai-native-workflow","steps":[{"id":"agent-harnesses/architecting-an-ai-native-workflow/s01-01-intro"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s02-02-which-workflow-fits"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s03-03-mapping-the-workflow"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s04-04-predict-the-readiness-score"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s05-05-fill-the-required-fields"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s06-06-fix-the-implicit-policy"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s07-07-fix-the-wrong-metric"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s08-08-write-the-workflow-validator"},{"id":"agent-harnesses/architecting-an-ai-native-workflow/s09-09-checkpoint"}]},{"slug":"five-industries-walked-through","steps":[{"id":"agent-harnesses/five-industries-walked-through/s01-01-the-case-study-method"},{"id":"agent-harnesses/five-industries-walked-through/s02-02-home-services"},{"id":"agent-harnesses/five-industries-walked-through/s03-03-which-task-belongs-to-the-agent"},{"id":"agent-harnesses/five-industries-walked-through/s04-04-insurance-brokerage"},{"id":"agent-harnesses/five-industries-walked-through/s05-05-recruiting-firm"},{"id":"agent-harnesses/five-industries-walked-through/s06-06-which-industry-wedges-fastest"},{"id":"agent-harnesses/five-industries-walked-through/s07-07-the-incumbent-trap"},{"id":"agent-harnesses/five-industries-walked-through/s08-08-write-the-process-debt-audit"},{"id":"agent-harnesses/five-industries-walked-through/s09-09-checkpoint"}]}]},{"slug":"ai-image-generation","lessons":[{"slug":"the-image-model-landscape","steps":[{"id":"ai-image-generation/the-image-model-landscape/s01-01-the-six-families"},{"id":"ai-image-generation/the-image-model-landscape/s02-02-flux-vs-midjourney"},{"id":"ai-image-generation/the-image-model-landscape/s03-03-pick-the-model"},{"id":"ai-image-generation/the-image-model-landscape/s04-04-nano-banana-and-the-batch-economics"},{"id":"ai-image-generation/the-image-model-landscape/s05-05-text-in-image"},{"id":"ai-image-generation/the-image-model-landscape/s06-06-which-model-for-text"},{"id":"ai-image-generation/the-image-model-landscape/s07-07-the-decision-tree"},{"id":"ai-image-generation/the-image-model-landscape/s08-08-write-pick-image-model"},{"id":"ai-image-generation/the-image-model-landscape/s09-09-checkpoint"}]},{"slug":"prompting-for-real-output","steps":[{"id":"ai-image-generation/prompting-for-real-output/s01-01-why-the-girl-with-red-hair-fails"},{"id":"ai-image-generation/prompting-for-real-output/s02-02-the-eight-knobs"},{"id":"ai-image-generation/prompting-for-real-output/s03-03-fix-the-vague-prompt"},{"id":"ai-image-generation/prompting-for-real-output/s04-04-controlnet-and-reference"},{"id":"ai-image-generation/prompting-for-real-output/s05-05-which-knob-is-missing"},{"id":"ai-image-generation/prompting-for-real-output/s06-06-failure-modes"},{"id":"ai-image-generation/prompting-for-real-output/s07-07-write-score-image-prompt"},{"id":"ai-image-generation/prompting-for-real-output/s08-08-checkpoint"}]},{"slug":"the-image-pipeline","steps":[{"id":"ai-image-generation/the-image-pipeline/s01-01-the-six-stages"},{"id":"ai-image-generation/the-image-pipeline/s02-02-batch-and-filter"},{"id":"ai-image-generation/the-image-pipeline/s03-03-which-stage-is-broken"},{"id":"ai-image-generation/the-image-pipeline/s04-04-format-and-platform"},{"id":"ai-image-generation/the-image-pipeline/s05-05-fill-the-pipeline"},{"id":"ai-image-generation/the-image-pipeline/s06-06-write-cost-for-batch"},{"id":"ai-image-generation/the-image-pipeline/s07-07-checkpoint"}]}]},{"slug":"ai-video-generation","lessons":[{"slug":"the-video-model-lineup","steps":[{"id":"ai-video-generation/the-video-model-lineup/s01-01-the-landscape"},{"id":"ai-video-generation/the-video-model-lineup/s02-02-sora-and-veo"},{"id":"ai-video-generation/the-video-model-lineup/s03-03-higgsfield-and-camera-control"},{"id":"ai-video-generation/the-video-model-lineup/s04-04-the-second-wave"},{"id":"ai-video-generation/the-video-model-lineup/s05-05-text-vs-image-vs-video"},{"id":"ai-video-generation/the-video-model-lineup/s06-06-which-lane"},{"id":"ai-video-generation/the-video-model-lineup/s07-07-pick-the-model"},{"id":"ai-video-generation/the-video-model-lineup/s08-08-write-pick-video-model"},{"id":"ai-video-generation/the-video-model-lineup/s09-09-checkpoint"}]},{"slug":"camera-control-and-motion","steps":[{"id":"ai-video-generation/camera-control-and-motion/s01-01-the-static-camera-problem"},{"id":"ai-video-generation/camera-control-and-motion/s02-02-shot-vocabulary"},{"id":"ai-video-generation/camera-control-and-motion/s03-03-which-shot-is-which"},{"id":"ai-video-generation/camera-control-and-motion/s04-04-camera-moves"},{"id":"ai-video-generation/camera-control-and-motion/s05-05-the-i2v-keyframe-pattern"},{"id":"ai-video-generation/camera-control-and-motion/s06-06-spot-the-generic-prompt"},{"id":"ai-video-generation/camera-control-and-motion/s07-07-write-audit-shot-list"},{"id":"ai-video-generation/camera-control-and-motion/s08-08-checkpoint"}]},{"slug":"the-cost-math","steps":[{"id":"ai-video-generation/the-cost-math/s01-01-the-unit-economics"},{"id":"ai-video-generation/the-cost-math/s02-02-the-retake-rate-reality"},{"id":"ai-video-generation/the-cost-math/s03-03-which-tier-for-this-budget"},{"id":"ai-video-generation/the-cost-math/s04-04-the-60-second-promo"},{"id":"ai-video-generation/the-cost-math/s05-05-write-cost-per-minute"},{"id":"ai-video-generation/the-cost-math/s06-06-checkpoint"}]}]},{"slug":"programmatic-design","lessons":[{"slug":"why-programmatic","steps":[{"id":"programmatic-design/why-programmatic/s01-01-intro"},{"id":"programmatic-design/why-programmatic/s02-02-the-three-jobs"},{"id":"programmatic-design/why-programmatic/s03-03-which-task-is-code"},{"id":"programmatic-design/why-programmatic/s04-04-parametric-vs-one-shot"},{"id":"programmatic-design/why-programmatic/s05-05-the-classifier"},{"id":"programmatic-design/why-programmatic/s06-06-write-classify-video-task"},{"id":"programmatic-design/why-programmatic/s07-07-checkpoint"}]},{"slug":"hyperframes-and-remotion","steps":[{"id":"programmatic-design/hyperframes-and-remotion/s01-01-the-two-tools"},{"id":"programmatic-design/hyperframes-and-remotion/s02-02-hyperframes-tour"},{"id":"programmatic-design/hyperframes-and-remotion/s03-03-hyperframes-cli"},{"id":"programmatic-design/hyperframes-and-remotion/s04-04-remotion-tour"},{"id":"programmatic-design/hyperframes-and-remotion/s05-05-the-decision-tree"},{"id":"programmatic-design/hyperframes-and-remotion/s06-06-pick-the-tool"},{"id":"programmatic-design/hyperframes-and-remotion/s07-07-claude-design-html"},{"id":"programmatic-design/hyperframes-and-remotion/s08-08-write-pick-tool"},{"id":"programmatic-design/hyperframes-and-remotion/s09-09-checkpoint"}]},{"slug":"the-ai-native-design-pipeline","steps":[{"id":"programmatic-design/the-ai-native-design-pipeline/s01-01-the-seven-steps"},{"id":"programmatic-design/the-ai-native-design-pipeline/s02-02-where-ai-fits-each-step"},{"id":"programmatic-design/the-ai-native-design-pipeline/s03-03-which-model-which-step"},{"id":"programmatic-design/the-ai-native-design-pipeline/s04-04-website-to-hyperframes"},{"id":"programmatic-design/the-ai-native-design-pipeline/s05-05-the-cost-model"},{"id":"programmatic-design/the-ai-native-design-pipeline/s06-06-score-the-pipeline"},{"id":"programmatic-design/the-ai-native-design-pipeline/s07-07-write-score-pipeline"},{"id":"programmatic-design/the-ai-native-design-pipeline/s08-08-checkpoint"}]}]},{"slug":"harness-engineering","lessons":[{"slug":"the-harness-engineering-mindset","steps":[{"id":"harness-engineering/the-harness-engineering-mindset/s01-01-the-model-is-only-one-input"},{"id":"harness-engineering/the-harness-engineering-mindset/s02-02-the-six-pieces-of-a-harness"},{"id":"harness-engineering/the-harness-engineering-mindset/s03-03-which-piece-is-missing"},{"id":"harness-engineering/the-harness-engineering-mindset/s04-04-the-skill-issue-reframe"},{"id":"harness-engineering/the-harness-engineering-mindset/s05-05-pick-the-fix"},{"id":"harness-engineering/the-harness-engineering-mindset/s06-06-the-harness-gap"},{"id":"harness-engineering/the-harness-engineering-mindset/s07-07-write-the-harness-inventory"},{"id":"harness-engineering/the-harness-engineering-mindset/s08-08-checkpoint"}]},{"slug":"the-ratchet","steps":[{"id":"harness-engineering/the-ratchet/s01-01-failures-are-signals-not-flukes"},{"id":"harness-engineering/the-ratchet/s02-02-every-line-traces-to-a-failure"},{"id":"harness-engineering/the-ratchet/s03-03-which-rule-earned-its-line"},{"id":"harness-engineering/the-ratchet/s04-04-working-backwards-from-behavior"},{"id":"harness-engineering/the-ratchet/s05-05-the-ratchet-log"},{"id":"harness-engineering/the-ratchet/s06-06-spot-the-cruft"},{"id":"harness-engineering/the-ratchet/s07-07-write-the-ratchet-update"},{"id":"harness-engineering/the-ratchet/s08-08-audit-the-ratchet"},{"id":"harness-engineering/the-ratchet/s09-09-checkpoint"}]},{"slug":"context-engineering","steps":[{"id":"harness-engineering/context-engineering/s01-01-the-context-rot"},{"id":"harness-engineering/context-engineering/s02-02-compaction"},{"id":"harness-engineering/context-engineering/s03-03-tool-call-offloading"},{"id":"harness-engineering/context-engineering/s04-04-progressive-disclosure"},{"id":"harness-engineering/context-engineering/s05-05-which-technique-fits"},{"id":"harness-engineering/context-engineering/s06-06-the-memory-hierarchy"},{"id":"harness-engineering/context-engineering/s07-07-classify-the-data"},{"id":"harness-engineering/context-engineering/s08-08-write-the-context-budget"},{"id":"harness-engineering/context-engineering/s09-09-checkpoint"}]},{"slug":"long-horizon-execution","steps":[{"id":"harness-engineering/long-horizon-execution/s01-01-the-40-step-problem"},{"id":"harness-engineering/long-horizon-execution/s02-02-loops-intercepting-early-stops"},{"id":"harness-engineering/long-horizon-execution/s03-03-planning-as-a-separate-step"},{"id":"harness-engineering/long-horizon-execution/s04-04-splits-generation-vs-evaluation"},{"id":"harness-engineering/long-horizon-execution/s05-05-which-pattern-fixes-the-failure"},{"id":"harness-engineering/long-horizon-execution/s06-06-hooks-as-the-enforcement-layer"},{"id":"harness-engineering/long-horizon-execution/s07-07-the-type-check-backpressure-pattern"},{"id":"harness-engineering/long-horizon-execution/s08-08-write-the-hook-router"},{"id":"harness-engineering/long-horizon-execution/s09-09-write-the-completion-guard"},{"id":"harness-engineering/long-horizon-execution/s10-10-checkpoint"}]},{"slug":"the-haas-shift","steps":[{"id":"harness-engineering/the-haas-shift/s01-01-llm-apis-to-harness-apis"},{"id":"harness-engineering/the-haas-shift/s02-02-the-convergence"},{"id":"harness-engineering/the-haas-shift/s03-03-which-haas-fits"},{"id":"harness-engineering/the-haas-shift/s04-04-harnesses-dont-shrink-they-move"},{"id":"harness-engineering/the-haas-shift/s05-05-the-overfitting-feedback-loop"},{"id":"harness-engineering/the-haas-shift/s06-06-mcp-security-as-prompt-injection"},{"id":"harness-engineering/the-haas-shift/s07-07-which-component-is-debt"},{"id":"harness-engineering/the-haas-shift/s08-08-write-the-build-vs-buy"},{"id":"harness-engineering/the-haas-shift/s09-09-checkpoint"}]},{"slug":"pipeline-boundaries","steps":[{"id":"harness-engineering/pipeline-boundaries/s01-01-intro"},{"id":"harness-engineering/pipeline-boundaries/s02-02-choose-the-risk"},{"id":"harness-engineering/pipeline-boundaries/s03-03-predict-the-checks"},{"id":"harness-engineering/pipeline-boundaries/s04-04-fill-the-return"},{"id":"harness-engineering/pipeline-boundaries/s05-05-fix-the-overtrust"},{"id":"harness-engineering/pipeline-boundaries/s06-06-write-count-ready"},{"id":"harness-engineering/pipeline-boundaries/s07-07-checkpoint"}]}]},{"slug":"intro-to-terminal","lessons":[{"slug":"what-the-terminal-is","steps":[{"id":"intro-to-terminal/what-the-terminal-is/s01-01-not-a-cockpit"},{"id":"intro-to-terminal/what-the-terminal-is/s02-02-what-it-actually-is"},{"id":"intro-to-terminal/what-the-terminal-is/s03-03-open-it"},{"id":"intro-to-terminal/what-the-terminal-is/s04-04-which-app"},{"id":"intro-to-terminal/what-the-terminal-is/s05-05-the-prompt"},{"id":"intro-to-terminal/what-the-terminal-is/s06-06-what-a-command-is"},{"id":"intro-to-terminal/what-the-terminal-is/s07-07-checkpoint"}]},{"slug":"moving-around","steps":[{"id":"intro-to-terminal/moving-around/s01-01-the-filesystem-is-a-tree"},{"id":"intro-to-terminal/moving-around/s02-02-pwd-and-ls"},{"id":"intro-to-terminal/moving-around/s03-03-where-am-i"},{"id":"intro-to-terminal/moving-around/s04-04-read-this-listing"},{"id":"intro-to-terminal/moving-around/s05-05-cd"},{"id":"intro-to-terminal/moving-around/s06-06-follow-the-moves"},{"id":"intro-to-terminal/moving-around/s07-07-pick-the-command"},{"id":"intro-to-terminal/moving-around/s08-08-checkpoint"}]},{"slug":"making-things","steps":[{"id":"intro-to-terminal/making-things/s01-01-mkdir-and-touch"},{"id":"intro-to-terminal/making-things/s02-02-folder-or-file"},{"id":"intro-to-terminal/making-things/s03-03-cat-and-echo"},{"id":"intro-to-terminal/making-things/s04-04-reading-a-file"},{"id":"intro-to-terminal/making-things/s05-05-rm-carefully"},{"id":"intro-to-terminal/making-things/s06-06-the-no-undo-rule"},{"id":"intro-to-terminal/making-things/s07-07-string-it-together"},{"id":"intro-to-terminal/making-things/s08-08-checkpoint"}]}]},{"slug":"intro-to-claude-cli","lessons":[{"slug":"what-the-claude-cli-is","steps":[{"id":"intro-to-claude-cli/what-the-claude-cli-is/s01-01-the-glass-is-gone"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s02-02-chat-vs-cli"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s03-03-which-tool"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s04-04-the-agent-loop"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s05-05-read-the-loop"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s06-06-permission-and-limits"},{"id":"intro-to-claude-cli/what-the-claude-cli-is/s07-07-checkpoint"}]},{"slug":"install-and-sign-in","steps":[{"id":"intro-to-claude-cli/install-and-sign-in/s01-01-two-ways-to-install"},{"id":"intro-to-claude-cli/install-and-sign-in/s02-02-run-the-installer"},{"id":"intro-to-claude-cli/install-and-sign-in/s03-03-install-check"},{"id":"intro-to-claude-cli/install-and-sign-in/s04-04-sign-in"},{"id":"intro-to-claude-cli/install-and-sign-in/s05-05-what-sign-in-did"},{"id":"intro-to-claude-cli/install-and-sign-in/s06-06-auth-check"},{"id":"intro-to-claude-cli/install-and-sign-in/s07-07-account-not-api-key"},{"id":"intro-to-claude-cli/install-and-sign-in/s08-08-checkpoint"}]},{"slug":"your-first-session","steps":[{"id":"intro-to-claude-cli/your-first-session/s01-01-start-in-a-folder"},{"id":"intro-to-claude-cli/your-first-session/s02-02-your-first-instruction"},{"id":"intro-to-claude-cli/your-first-session/s03-03-watch-the-loop"},{"id":"intro-to-claude-cli/your-first-session/s04-04-read-what-it-did"},{"id":"intro-to-claude-cli/your-first-session/s05-05-one-shot-or-interactive"},{"id":"intro-to-claude-cli/your-first-session/s06-06-chat-or-cli-decision"},{"id":"intro-to-claude-cli/your-first-session/s07-07-where-this-goes"},{"id":"intro-to-claude-cli/your-first-session/s08-08-checkpoint"}]}]},{"slug":"intro-to-codex-cli","lessons":[{"slug":"another-tool-same-idea","steps":[{"id":"intro-to-codex-cli/another-tool-same-idea/s01-01-why-a-second-tool"},{"id":"intro-to-codex-cli/another-tool-same-idea/s02-02-what-carries-over"},{"id":"intro-to-codex-cli/another-tool-same-idea/s03-03-what-carries-over-check"},{"id":"intro-to-codex-cli/another-tool-same-idea/s04-04-what-is-different"},{"id":"intro-to-codex-cli/another-tool-same-idea/s05-05-spot-the-difference"},{"id":"intro-to-codex-cli/another-tool-same-idea/s06-06-when-to-reach-for-which"},{"id":"intro-to-codex-cli/another-tool-same-idea/s07-07-checkpoint"}]},{"slug":"install-and-sign-in","steps":[{"id":"intro-to-codex-cli/install-and-sign-in/s01-01-the-install-paths"},{"id":"intro-to-codex-cli/install-and-sign-in/s02-02-installing-it"},{"id":"intro-to-codex-cli/install-and-sign-in/s03-03-verify-it"},{"id":"intro-to-codex-cli/install-and-sign-in/s04-04-install-check"},{"id":"intro-to-codex-cli/install-and-sign-in/s05-05-sign-in"},{"id":"intro-to-codex-cli/install-and-sign-in/s06-06-account-vs-api-key"},{"id":"intro-to-codex-cli/install-and-sign-in/s07-07-auth-check"},{"id":"intro-to-codex-cli/install-and-sign-in/s08-08-checkpoint"}]},{"slug":"your-first-session","steps":[{"id":"intro-to-codex-cli/your-first-session/s01-01-start-and-instruct"},{"id":"intro-to-codex-cli/your-first-session/s02-02-the-loop-again"},{"id":"intro-to-codex-cli/your-first-session/s03-03-read-what-it-did"},{"id":"intro-to-codex-cli/your-first-session/s04-04-usage-patterns"},{"id":"intro-to-codex-cli/your-first-session/s05-05-usage-scenario"},{"id":"intro-to-codex-cli/your-first-session/s06-06-running-both"},{"id":"intro-to-codex-cli/your-first-session/s07-07-where-this-goes"},{"id":"intro-to-codex-cli/your-first-session/s08-08-checkpoint"}]}]},{"slug":"team-skills","lessons":[{"slug":"what-a-skill-is","steps":[{"id":"team-skills/what-a-skill-is/s01-01-the-re-explaining-problem"},{"id":"team-skills/what-a-skill-is/s02-02-a-skill-defined"},{"id":"team-skills/what-a-skill-is/s03-03-a-real-skill"},{"id":"team-skills/what-a-skill-is/s04-04-what-a-skill-is-check"},{"id":"team-skills/what-a-skill-is/s05-05-skill-vs-prompt-vs-agent"},{"id":"team-skills/what-a-skill-is/s06-06-skill-or-agent"},{"id":"team-skills/what-a-skill-is/s07-07-checkpoint"}]},{"slug":"skills-as-a-team-thing","steps":[{"id":"team-skills/skills-as-a-team-thing/s01-01-the-team-problem"},{"id":"team-skills/skills-as-a-team-thing/s02-02-org-wide-skills"},{"id":"team-skills/skills-as-a-team-thing/s03-03-sharing-and-the-directory"},{"id":"team-skills/skills-as-a-team-thing/s04-04-deployment-check"},{"id":"team-skills/skills-as-a-team-thing/s05-05-collaborating-through-skills"},{"id":"team-skills/skills-as-a-team-thing/s06-06-sharing-scenario"},{"id":"team-skills/skills-as-a-team-thing/s07-07-checkpoint"}]},{"slug":"role-playbooks","steps":[{"id":"team-skills/role-playbooks/s01-01-hr-skills"},{"id":"team-skills/role-playbooks/s02-02-legal-skills"},{"id":"team-skills/role-playbooks/s03-03-operations-skills"},{"id":"team-skills/role-playbooks/s04-04-match-the-skill"},{"id":"team-skills/role-playbooks/s05-05-skill-or-one-off"},{"id":"team-skills/role-playbooks/s06-06-skill-or-one-off-check"},{"id":"team-skills/role-playbooks/s07-07-checkpoint"}]},{"slug":"governance-and-trust","steps":[{"id":"team-skills/governance-and-trust/s01-01-a-skill-can-do-real-things"},{"id":"team-skills/governance-and-trust/s02-02-treat-it-like-software"},{"id":"team-skills/governance-and-trust/s03-03-evaluate-before-you-deploy"},{"id":"team-skills/governance-and-trust/s04-04-access-control"},{"id":"team-skills/governance-and-trust/s05-05-governance-scenario"},{"id":"team-skills/governance-and-trust/s06-06-a-rollout-checklist"},{"id":"team-skills/governance-and-trust/s07-07-checkpoint"}]}]},{"slug":"dataframes-numpy-pandas","lessons":[{"slug":"array-and-series-shapes","steps":[{"id":"dataframes-numpy-pandas/array-and-series-shapes/s01-01-intro"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s02-02-choose-the-risk"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s03-03-predict-the-checks"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s04-04-fill-the-return"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s05-05-fix-the-overtrust"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s06-06-write-count-ready"},{"id":"dataframes-numpy-pandas/array-and-series-shapes/s07-07-checkpoint"}]},{"slug":"dataframe-selection-and-cleaning","steps":[{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s01-01-intro"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s02-02-choose-the-risk"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s03-03-predict-the-checks"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s04-04-fill-the-return"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s05-05-fix-the-overtrust"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s06-06-write-count-ready"},{"id":"dataframes-numpy-pandas/dataframe-selection-and-cleaning/s07-07-checkpoint"}]},{"slug":"missing-values-and-types","steps":[{"id":"dataframes-numpy-pandas/missing-values-and-types/s01-01-intro"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s02-02-choose-the-risk"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s03-03-predict-the-checks"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s04-04-fill-the-return"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s05-05-fix-the-overtrust"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s06-06-write-count-ready"},{"id":"dataframes-numpy-pandas/missing-values-and-types/s07-07-checkpoint"}]},{"slug":"groupby-joins-and-features","steps":[{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s01-01-intro"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s02-02-choose-the-risk"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s03-03-predict-the-checks"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s04-04-fill-the-return"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s05-05-fix-the-overtrust"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s06-06-write-count-ready"},{"id":"dataframes-numpy-pandas/groupby-joins-and-features/s07-07-checkpoint"}]},{"slug":"mission-api-to-dataframe","steps":[{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s01-01-intro"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s02-02-choose-the-risk"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s03-03-predict-the-checks"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s04-04-fill-the-return"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s05-05-fix-the-overtrust"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s06-06-write-count-ready"},{"id":"dataframes-numpy-pandas/mission-api-to-dataframe/s07-07-checkpoint"}]}]},{"slug":"sql-for-ml-datasets","lessons":[{"slug":"select-filter-aggregate","steps":[{"id":"sql-for-ml-datasets/select-filter-aggregate/s01-01-intro"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s02-02-choose-the-risk"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s03-03-predict-the-checks"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s04-04-fill-the-return"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s05-05-fix-the-overtrust"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s06-06-write-count-ready"},{"id":"sql-for-ml-datasets/select-filter-aggregate/s07-07-checkpoint"}]},{"slug":"joins-and-label-windows","steps":[{"id":"sql-for-ml-datasets/joins-and-label-windows/s01-01-intro"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s02-02-choose-the-risk"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s03-03-predict-the-checks"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s04-04-fill-the-return"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s05-05-fix-the-overtrust"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s06-06-write-count-ready"},{"id":"sql-for-ml-datasets/joins-and-label-windows/s07-07-checkpoint"}]},{"slug":"sql-quality-checks","steps":[{"id":"sql-for-ml-datasets/sql-quality-checks/s01-01-intro"},{"id":"sql-for-ml-datasets/sql-quality-checks/s02-02-choose-the-risk"},{"id":"sql-for-ml-datasets/sql-quality-checks/s03-03-predict-the-checks"},{"id":"sql-for-ml-datasets/sql-quality-checks/s04-04-fill-the-return"},{"id":"sql-for-ml-datasets/sql-quality-checks/s05-05-fix-the-overtrust"},{"id":"sql-for-ml-datasets/sql-quality-checks/s06-06-write-count-ready"},{"id":"sql-for-ml-datasets/sql-quality-checks/s07-07-checkpoint"}]},{"slug":"mission-sql-feature-query-lab","steps":[{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s01-01-intro"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s02-02-choose-the-risk"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s03-03-predict-the-checks"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s04-04-fill-the-return"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s05-05-fix-the-overtrust"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s06-06-write-count-ready"},{"id":"sql-for-ml-datasets/mission-sql-feature-query-lab/s07-07-checkpoint"}]}]},{"slug":"dataset-pipelines","lessons":[{"slug":"csv-jsonl-parquet-tradeoffs","steps":[{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s01-01-intro"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s02-02-choose-the-risk"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s03-03-predict-the-checks"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s04-04-fill-the-return"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s05-05-fix-the-overtrust"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s06-06-write-count-ready"},{"id":"dataset-pipelines/csv-jsonl-parquet-tradeoffs/s07-07-checkpoint"}]},{"slug":"pagination-retries-checkpoints","steps":[{"id":"dataset-pipelines/pagination-retries-checkpoints/s01-01-intro"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s02-02-choose-the-risk"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s03-03-predict-the-checks"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s04-04-fill-the-return"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s05-05-fix-the-overtrust"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s06-06-write-count-ready"},{"id":"dataset-pipelines/pagination-retries-checkpoints/s07-07-checkpoint"}]},{"slug":"schema-drift-and-contracts","steps":[{"id":"dataset-pipelines/schema-drift-and-contracts/s01-01-intro"},{"id":"dataset-pipelines/schema-drift-and-contracts/s02-02-choose-the-risk"},{"id":"dataset-pipelines/schema-drift-and-contracts/s03-03-predict-the-checks"},{"id":"dataset-pipelines/schema-drift-and-contracts/s04-04-fill-the-return"},{"id":"dataset-pipelines/schema-drift-and-contracts/s05-05-fix-the-overtrust"},{"id":"dataset-pipelines/schema-drift-and-contracts/s06-06-write-count-ready"},{"id":"dataset-pipelines/schema-drift-and-contracts/s07-07-checkpoint"}]},{"slug":"pipeline-orchestration-shape","steps":[{"id":"dataset-pipelines/pipeline-orchestration-shape/s01-01-intro"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s02-02-choose-the-risk"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s03-03-predict-the-checks"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s04-04-fill-the-return"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s05-05-fix-the-overtrust"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s06-06-write-count-ready"},{"id":"dataset-pipelines/pipeline-orchestration-shape/s07-07-checkpoint"}]},{"slug":"mission-api-to-dataset-pipeline","steps":[{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s01-01-intro"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s02-02-choose-the-risk"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s03-03-predict-the-checks"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s04-04-fill-the-return"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s05-05-fix-the-overtrust"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s06-06-write-count-ready"},{"id":"dataset-pipelines/mission-api-to-dataset-pipeline/s07-07-checkpoint"}]}]},{"slug":"ml-math-and-stats","lessons":[{"slug":"vectors-matrices-dot-products","steps":[{"id":"ml-math-and-stats/vectors-matrices-dot-products/s01-01-intro"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s02-02-choose-the-risk"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s03-03-predict-the-checks"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s04-04-fill-the-return"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s05-05-fix-the-overtrust"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s06-06-write-count-ready"},{"id":"ml-math-and-stats/vectors-matrices-dot-products/s07-07-checkpoint"}]},{"slug":"probability-and-baselines","steps":[{"id":"ml-math-and-stats/probability-and-baselines/s01-01-intro"},{"id":"ml-math-and-stats/probability-and-baselines/s02-02-choose-the-risk"},{"id":"ml-math-and-stats/probability-and-baselines/s03-03-predict-the-checks"},{"id":"ml-math-and-stats/probability-and-baselines/s04-04-fill-the-return"},{"id":"ml-math-and-stats/probability-and-baselines/s05-05-fix-the-overtrust"},{"id":"ml-math-and-stats/probability-and-baselines/s06-06-write-count-ready"},{"id":"ml-math-and-stats/probability-and-baselines/s07-07-checkpoint"}]},{"slug":"distributions-sampling-variance","steps":[{"id":"ml-math-and-stats/distributions-sampling-variance/s01-01-intro"},{"id":"ml-math-and-stats/distributions-sampling-variance/s02-02-choose-the-risk"},{"id":"ml-math-and-stats/distributions-sampling-variance/s03-03-predict-the-checks"},{"id":"ml-math-and-stats/distributions-sampling-variance/s04-04-fill-the-return"},{"id":"ml-math-and-stats/distributions-sampling-variance/s05-05-fix-the-overtrust"},{"id":"ml-math-and-stats/distributions-sampling-variance/s06-06-write-variance"},{"id":"ml-math-and-stats/distributions-sampling-variance/s07-07-checkpoint"}]},{"slug":"bias-variance-and-leakage","steps":[{"id":"ml-math-and-stats/bias-variance-and-leakage/s01-01-intro"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s02-02-choose-the-risk"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s03-03-predict-the-checks"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s04-04-fill-the-return"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s05-05-fix-the-overtrust"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s06-06-write-diagnose"},{"id":"ml-math-and-stats/bias-variance-and-leakage/s07-07-checkpoint"}]},{"slug":"mission-statistical-sanity-check","steps":[{"id":"ml-math-and-stats/mission-statistical-sanity-check/s01-01-intro"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s02-02-choose-the-risk"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s03-03-predict-the-checks"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s04-04-fill-the-return"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s05-05-fix-the-overtrust"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s06-06-write-count-ready"},{"id":"ml-math-and-stats/mission-statistical-sanity-check/s07-07-checkpoint"}]}]},{"slug":"supervised-learning-workflows","lessons":[{"slug":"labels-features-splits","steps":[{"id":"supervised-learning-workflows/labels-features-splits/s01-01-intro"},{"id":"supervised-learning-workflows/labels-features-splits/s02-02-choose-the-risk"},{"id":"supervised-learning-workflows/labels-features-splits/s03-03-predict-the-checks"},{"id":"supervised-learning-workflows/labels-features-splits/s04-04-fill-the-return"},{"id":"supervised-learning-workflows/labels-features-splits/s05-05-fix-the-overtrust"},{"id":"supervised-learning-workflows/labels-features-splits/s06-06-write-split-audit"},{"id":"supervised-learning-workflows/labels-features-splits/s07-07-checkpoint"}]},{"slug":"baselines-before-fancy-models","steps":[{"id":"supervised-learning-workflows/baselines-before-fancy-models/s01-01-intro"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s02-02-choose-the-risk"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s03-03-predict-the-checks"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s04-04-fill-the-return"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s05-05-fix-the-overtrust"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s06-06-write-baseline-comparison"},{"id":"supervised-learning-workflows/baselines-before-fancy-models/s07-07-checkpoint"}]},{"slug":"training-and-prediction-loop","steps":[{"id":"supervised-learning-workflows/training-and-prediction-loop/s01-01-intro"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s02-02-choose-the-risk"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s03-03-predict-the-checks"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s04-04-fill-the-return"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s05-05-fix-the-overtrust"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s06-06-write-train"},{"id":"supervised-learning-workflows/training-and-prediction-loop/s07-07-checkpoint"}]},{"slug":"overfitting-and-regularization","steps":[{"id":"supervised-learning-workflows/overfitting-and-regularization/s01-01-intro"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s02-02-choose-the-risk"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s03-03-predict-the-checks"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s04-04-fill-the-return"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s05-05-fix-the-overtrust"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s06-06-write-overfit-audit"},{"id":"supervised-learning-workflows/overfitting-and-regularization/s07-07-checkpoint"}]},{"slug":"mission-baseline-model-showdown","steps":[{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s01-01-intro"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s02-02-choose-the-risk"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s03-03-predict-the-checks"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s04-04-fill-the-return"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s05-05-fix-the-overtrust"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s06-06-write-classifier-brief"},{"id":"supervised-learning-workflows/mission-baseline-model-showdown/s07-07-checkpoint"}]}]},{"slug":"unsupervised-learning-and-embeddings","lessons":[{"slug":"clustering-without-labels","steps":[{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s01-01-intro"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s02-02-choose-the-risk"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s03-03-predict-the-checks"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s04-04-fill-the-return"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s05-05-fix-the-overtrust"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s06-06-write-cluster-summary"},{"id":"unsupervised-learning-and-embeddings/clustering-without-labels/s07-07-checkpoint"}]},{"slug":"dimensionality-and-distance","steps":[{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s01-01-intro"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s02-02-choose-the-risk"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s03-03-predict-the-checks"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s04-04-fill-the-return"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s05-05-fix-the-overtrust"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s06-06-write-euclidean"},{"id":"unsupervised-learning-and-embeddings/dimensionality-and-distance/s07-07-checkpoint"}]},{"slug":"embeddings-as-features","steps":[{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s01-01-intro"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s02-02-choose-the-risk"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s03-03-predict-the-checks"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s04-04-fill-the-return"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s05-05-fix-the-overtrust"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s06-06-write-retrieval-receipt"},{"id":"unsupervised-learning-and-embeddings/embeddings-as-features/s07-07-checkpoint"}]},{"slug":"mission-tiny-recommender","steps":[{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s01-01-intro"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s02-02-choose-the-risk"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s03-03-predict-the-checks"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s04-04-fill-the-return"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s05-05-fix-the-overtrust"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s06-06-write-recommendation-card"},{"id":"unsupervised-learning-and-embeddings/mission-tiny-recommender/s07-07-checkpoint"}]}]},{"slug":"metrics-and-error-analysis","lessons":[{"slug":"confusion-matrix","steps":[{"id":"metrics-and-error-analysis/confusion-matrix/s01-01-intro"},{"id":"metrics-and-error-analysis/confusion-matrix/s02-02-choose-the-risk"},{"id":"metrics-and-error-analysis/confusion-matrix/s03-03-predict-the-checks"},{"id":"metrics-and-error-analysis/confusion-matrix/s04-04-fill-the-return"},{"id":"metrics-and-error-analysis/confusion-matrix/s05-05-fix-the-overtrust"},{"id":"metrics-and-error-analysis/confusion-matrix/s06-06-write-count-ready"},{"id":"metrics-and-error-analysis/confusion-matrix/s07-07-checkpoint"}]},{"slug":"precision-recall-thresholds","steps":[{"id":"metrics-and-error-analysis/precision-recall-thresholds/s01-01-intro"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s02-02-choose-the-risk"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s03-03-predict-the-checks"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s04-04-fill-the-return"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s05-05-fix-the-overtrust"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s06-06-write-count-ready"},{"id":"metrics-and-error-analysis/precision-recall-thresholds/s07-07-checkpoint"}]},{"slug":"slice-based-error-analysis","steps":[{"id":"metrics-and-error-analysis/slice-based-error-analysis/s01-01-intro"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s02-02-choose-the-risk"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s03-03-predict-the-checks"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s04-04-fill-the-return"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s05-05-fix-the-overtrust"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s06-06-write-count-ready"},{"id":"metrics-and-error-analysis/slice-based-error-analysis/s07-07-checkpoint"}]},{"slug":"regression-metrics-and-residuals","steps":[{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s01-01-intro"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s02-02-choose-the-risk"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s03-03-predict-the-checks"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s04-04-fill-the-return"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s05-05-fix-the-overtrust"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s06-06-write-count-ready"},{"id":"metrics-and-error-analysis/regression-metrics-and-residuals/s07-07-checkpoint"}]},{"slug":"mission-confusion-matrix-triage","steps":[{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s01-01-intro"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s02-02-choose-the-risk"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s03-03-predict-the-checks"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s04-04-fill-the-return"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s05-05-fix-the-overtrust"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s06-06-write-count-ready"},{"id":"metrics-and-error-analysis/mission-confusion-matrix-triage/s07-07-checkpoint"}]}]},{"slug":"pytorch-tensors-and-autograd","lessons":[{"slug":"tensors-and-shapes","steps":[{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s01-01-intro"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s02-02-choose-the-risk"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s03-03-predict-the-checks"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s04-04-fill-the-return"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s05-05-fix-the-overtrust"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s06-06-write-count-ready"},{"id":"pytorch-tensors-and-autograd/tensors-and-shapes/s07-07-checkpoint"}]},{"slug":"broadcasting-and-vectorization","steps":[{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s01-01-intro"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s02-02-choose-the-risk"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s03-03-predict-the-checks"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s04-04-fill-the-return"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s05-05-fix-the-overtrust"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s06-06-write-count-ready"},{"id":"pytorch-tensors-and-autograd/broadcasting-and-vectorization/s07-07-checkpoint"}]},{"slug":"autograd-mental-model","steps":[{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s01-01-intro"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s02-02-choose-the-risk"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s03-03-predict-the-checks"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s04-04-fill-the-return"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s05-05-fix-the-overtrust"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s06-06-write-count-ready"},{"id":"pytorch-tensors-and-autograd/autograd-mental-model/s07-07-checkpoint"}]},{"slug":"mission-tensor-playground","steps":[{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s01-01-intro"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s02-02-choose-the-risk"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s03-03-predict-the-checks"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s04-04-fill-the-return"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s05-05-fix-the-overtrust"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s06-06-write-count-ready"},{"id":"pytorch-tensors-and-autograd/mission-tensor-playground/s07-07-checkpoint"}]}]},{"slug":"training-loops-and-optimizers","lessons":[{"slug":"loss-forward-backward-step","steps":[{"id":"training-loops-and-optimizers/loss-forward-backward-step/s01-01-intro"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s02-02-choose-the-risk"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s03-03-predict-the-checks"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s04-04-fill-the-return"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s05-05-fix-the-overtrust"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s06-06-write-count-ready"},{"id":"training-loops-and-optimizers/loss-forward-backward-step/s07-07-checkpoint"}]},{"slug":"gradient-descent-by-hand","steps":[{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s01-01-intro"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s02-02-choose-the-risk"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s03-03-predict-the-checks"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s04-04-fill-the-return"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s05-05-fix-the-overtrust"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s06-06-write-count-ready"},{"id":"training-loops-and-optimizers/gradient-descent-by-hand/s07-07-checkpoint"}]},{"slug":"optimizers-and-schedulers","steps":[{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s01-01-intro"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s02-02-choose-the-risk"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s03-03-predict-the-checks"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s04-04-fill-the-return"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s05-05-fix-the-overtrust"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s06-06-write-count-ready"},{"id":"training-loops-and-optimizers/optimizers-and-schedulers/s07-07-checkpoint"}]},{"slug":"checkpoints-and-reproducibility","steps":[{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s01-01-intro"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s02-02-choose-the-risk"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s03-03-predict-the-checks"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s04-04-fill-the-return"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s05-05-fix-the-overtrust"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s06-06-write-count-ready"},{"id":"training-loops-and-optimizers/checkpoints-and-reproducibility/s07-07-checkpoint"}]},{"slug":"mission-overfit-then-recover","steps":[{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s01-01-intro"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s02-02-choose-the-risk"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s03-03-predict-the-checks"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s04-04-fill-the-return"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s05-05-fix-the-overtrust"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s06-06-write-count-ready"},{"id":"training-loops-and-optimizers/mission-overfit-then-recover/s07-07-checkpoint"}]}]},{"slug":"deep-learning-architectures","lessons":[{"slug":"cnns-and-local-patterns","steps":[{"id":"deep-learning-architectures/cnns-and-local-patterns/s01-01-intro"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s02-02-choose-the-risk"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s03-03-predict-the-checks"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s04-04-fill-the-return"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s05-05-fix-the-overtrust"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s06-06-write-count-ready"},{"id":"deep-learning-architectures/cnns-and-local-patterns/s07-07-checkpoint"}]},{"slug":"tokenizers-and-context-budget","steps":[{"id":"deep-learning-architectures/tokenizers-and-context-budget/s01-01-intro"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s02-02-choose-the-risk"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s03-03-predict-the-checks"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s04-04-fill-the-return"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s05-05-fix-the-overtrust"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s06-06-write-count-ready"},{"id":"deep-learning-architectures/tokenizers-and-context-budget/s07-07-checkpoint"}]},{"slug":"attention-and-transformer-blocks","steps":[{"id":"deep-learning-architectures/attention-and-transformer-blocks/s01-01-intro"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s02-02-choose-the-risk"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s03-03-predict-the-checks"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s04-04-fill-the-return"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s05-05-fix-the-overtrust"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s06-06-write-count-ready"},{"id":"deep-learning-architectures/attention-and-transformer-blocks/s07-07-checkpoint"}]},{"slug":"decoding-kv-cache-quantization","steps":[{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s01-01-intro"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s02-02-choose-the-risk"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s03-03-predict-the-checks"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s04-04-fill-the-return"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s05-05-fix-the-overtrust"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s06-06-write-count-ready"},{"id":"deep-learning-architectures/decoding-kv-cache-quantization/s07-07-checkpoint"}]},{"slug":"mission-architecture-tradeoff-note","steps":[{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s01-01-intro"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s02-02-choose-the-risk"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s03-03-predict-the-checks"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s04-04-fill-the-return"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s05-05-fix-the-overtrust"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s06-06-write-count-ready"},{"id":"deep-learning-architectures/mission-architecture-tradeoff-note/s07-07-checkpoint"}]}]},{"slug":"feature-experiments-registries","lessons":[{"slug":"feature-pipeline-contracts","steps":[{"id":"feature-experiments-registries/feature-pipeline-contracts/s01-01-intro"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s02-02-choose-the-risk"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s03-03-predict-the-checks"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s04-04-fill-the-return"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s05-05-fix-the-overtrust"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s06-06-write-count-ready"},{"id":"feature-experiments-registries/feature-pipeline-contracts/s07-07-checkpoint"}]},{"slug":"train-inference-skew","steps":[{"id":"feature-experiments-registries/train-inference-skew/s01-01-intro"},{"id":"feature-experiments-registries/train-inference-skew/s02-02-choose-the-risk"},{"id":"feature-experiments-registries/train-inference-skew/s03-03-predict-the-checks"},{"id":"feature-experiments-registries/train-inference-skew/s04-04-fill-the-return"},{"id":"feature-experiments-registries/train-inference-skew/s05-05-fix-the-overtrust"},{"id":"feature-experiments-registries/train-inference-skew/s06-06-write-count-ready"},{"id":"feature-experiments-registries/train-inference-skew/s07-07-checkpoint"}]},{"slug":"experiment-tracker-lite","steps":[{"id":"feature-experiments-registries/experiment-tracker-lite/s01-01-intro"},{"id":"feature-experiments-registries/experiment-tracker-lite/s02-02-choose-the-risk"},{"id":"feature-experiments-registries/experiment-tracker-lite/s03-03-predict-the-checks"},{"id":"feature-experiments-registries/experiment-tracker-lite/s04-04-fill-the-return"},{"id":"feature-experiments-registries/experiment-tracker-lite/s05-05-fix-the-overtrust"},{"id":"feature-experiments-registries/experiment-tracker-lite/s06-06-write-count-ready"},{"id":"feature-experiments-registries/experiment-tracker-lite/s07-07-checkpoint"}]},{"slug":"model-registry-shape","steps":[{"id":"feature-experiments-registries/model-registry-shape/s01-01-intro"},{"id":"feature-experiments-registries/model-registry-shape/s02-02-choose-the-risk"},{"id":"feature-experiments-registries/model-registry-shape/s03-03-predict-the-checks"},{"id":"feature-experiments-registries/model-registry-shape/s04-04-fill-the-return"},{"id":"feature-experiments-registries/model-registry-shape/s05-05-fix-the-overtrust"},{"id":"feature-experiments-registries/model-registry-shape/s06-06-write-count-ready"},{"id":"feature-experiments-registries/model-registry-shape/s07-07-checkpoint"}]},{"slug":"mission-feature-pipeline-and-tracker","steps":[{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s01-01-intro"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s02-02-choose-the-risk"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s03-03-predict-the-checks"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s04-04-fill-the-return"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s05-05-fix-the-overtrust"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s06-06-write-count-ready"},{"id":"feature-experiments-registries/mission-feature-pipeline-and-tracker/s07-07-checkpoint"}]}]},{"slug":"model-serving-and-mlops","lessons":[{"slug":"fastapi-inference-shape","steps":[{"id":"model-serving-and-mlops/fastapi-inference-shape/s01-01-intro"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s06-06-write-count-ready"},{"id":"model-serving-and-mlops/fastapi-inference-shape/s07-07-checkpoint"}]},{"slug":"batch-vs-realtime-serving","steps":[{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s01-01-intro"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s06-06-write-count-ready"},{"id":"model-serving-and-mlops/batch-vs-realtime-serving/s07-07-checkpoint"}]},{"slug":"docker-runtime-and-config","steps":[{"id":"model-serving-and-mlops/docker-runtime-and-config/s01-01-intro"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s06-06-write-count-ready"},{"id":"model-serving-and-mlops/docker-runtime-and-config/s07-07-checkpoint"}]},{"slug":"github-actions-validation-gates","steps":[{"id":"model-serving-and-mlops/github-actions-validation-gates/s01-01-intro"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s06-06-write-failing-checks"},{"id":"model-serving-and-mlops/github-actions-validation-gates/s07-07-checkpoint"}]},{"slug":"continuous-training-and-rollout","steps":[{"id":"model-serving-and-mlops/continuous-training-and-rollout/s01-01-intro"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s06-06-write-count-ready"},{"id":"model-serving-and-mlops/continuous-training-and-rollout/s07-07-checkpoint"}]},{"slug":"mission-fastapi-model-server","steps":[{"id":"model-serving-and-mlops/mission-fastapi-model-server/s01-01-intro"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s02-02-choose-the-risk"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s03-03-predict-the-checks"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s04-04-fill-the-return"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s05-05-fix-the-overtrust"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s06-06-write-count-ready"},{"id":"model-serving-and-mlops/mission-fastapi-model-server/s07-07-checkpoint"}]}]},{"slug":"monitoring-cloud-portfolio","lessons":[{"slug":"structured-logs-and-alerts","steps":[{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s01-01-intro"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s06-06-write-count-ready"},{"id":"monitoring-cloud-portfolio/structured-logs-and-alerts/s07-07-checkpoint"}]},{"slug":"data-and-concept-drift","steps":[{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s01-01-intro"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s06-06-write-count-ready"},{"id":"monitoring-cloud-portfolio/data-and-concept-drift/s07-07-checkpoint"}]},{"slug":"retraining-triggers","steps":[{"id":"monitoring-cloud-portfolio/retraining-triggers/s01-01-intro"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s06-06-write-count-ready"},{"id":"monitoring-cloud-portfolio/retraining-triggers/s07-07-checkpoint"}]},{"slug":"cloud-gpu-kubernetes-cost","steps":[{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s01-01-intro"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s06-06-write-count-ready"},{"id":"monitoring-cloud-portfolio/cloud-gpu-kubernetes-cost/s07-07-checkpoint"}]},{"slug":"portfolio-architecture-docs","steps":[{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s01-01-intro"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s06-06-write-count-ready"},{"id":"monitoring-cloud-portfolio/portfolio-architecture-docs/s07-07-checkpoint"}]},{"slug":"mission-final-portfolio-ml-system","steps":[{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s01-01-intro"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s02-02-choose-the-risk"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s03-03-predict-the-checks"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s04-04-fill-the-return"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s05-05-fix-the-overtrust"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s06-06-write-decide"},{"id":"monitoring-cloud-portfolio/mission-final-portfolio-ml-system/s07-07-checkpoint"}]}]}]},"detail":{"number":21,"slug":"evals","title":"eval-driven ai development","blurb":"if you can't test it, you can't ship it. learn the simple-but-strict eval patterns that separate ai features that work from ones that just feel like they do.","overview":"$20","status":"live","lessons":[{"slug":"check-before-you-trust","title":"Check before you trust","estMinutes":8,"prerequisites":[],"status":"live","steps":[{"type":"read","id":"evals/check-before-you-trust/s01-01-intro","xp":2,"hint":[],"personalize":false,"phase":"warmup","estSeconds":300,"concept":"check-before-you-trust","body":"$21","cta":"Got it","runnable":true}],"xpTotal":2},{"slug":"writing-evals","title":"Assertions on AI output, not vibes","estMinutes":9,"prerequisites":[],"status":"live","steps":[{"type":"read","id":"evals/writing-evals/s01-01-intro","xp":1,"hint":[],"personalize":false,"phase":"warmup","estSeconds":110,"concept":"eval-mindset","body":"$22","cta":"Got it","code":"# the canonical eval shape — input, expected, actual, pass/fail.\ncases = [\n {\"name\": \"extract_email_simple\",\n \"input\": \"ping me at sam@example.com\",\n \"expected\": \"sam@example.com\"},\n {\"name\": \"extract_email_with_punct\",\n \"input\": \"(maya@example.com)\",\n \"expected\": \"maya@example.com\"},\n]\n\n# in real life this would call an LLM. here it's a stub.\ndef fake_extract(text):\n import re\n m = re.search(r\"[\\w.+-]+@[\\w-]+\\.[\\w.-]+\", text)\n return m.group(0) if m else None\n\npassed = 0\nfor c in cases:\n actual = fake_extract(c[\"input\"])\n ok = actual == c[\"expected\"]\n passed += ok\n print(f\"{c['name']}: {'PASS' if ok else 'FAIL'}\")\nprint(f\"{passed}/{len(cases)}\")\n","runnable":true},{"type":"mc","id":"evals/writing-evals/s02-02-which-eval-passes","xp":2,"hint":[{"level":1,"body":"`actual` is a full sentence. Which check is True for any sentence that contains the word `Paris`?","cost":0},{"level":2,"body":"`\"Paris\" in actual` is the substring check — it passes whenever the answer is somewhere in the output. That's the most robust pattern for free-form AI output.","cost":0}],"personalize":false,"phase":"warmup","estSeconds":50,"concept":"assertion-styles","prompt":"Cursor wrote three evals against the same string output. Two are\nFalse, one is True. Which assertion *passes*?\n","options":[{"id":"a","label":"`e1` — exact equality with `\"Paris\"`","explain":"False — `actual` is the full sentence, not just the word `Paris`."},{"id":"b","label":"`e2` — `\"Paris\" in actual`"},{"id":"c","label":"`e3` — `actual.startswith(\"Paris\")`","explain":"False — the sentence starts with `The`, not `Paris`."},{"id":"d","label":"None of them pass.","explain":"One of them passes. Look at the substring check."}],"answerIds":["b"],"shuffle":false,"code":"actual = \"The capital of France is Paris.\"\n\n# three candidate evals, each checking a different way.\ne1 = actual == \"Paris\"\ne2 = \"Paris\" in actual\ne3 = actual.startswith(\"Paris\")\n\nprint(e1, e2, e3)\n","runnable":true},{"type":"read","id":"evals/writing-evals/s03-03-eval-patterns","xp":1,"hint":[],"personalize":false,"phase":"build","estSeconds":130,"concept":"eval-patterns","body":"$23","cta":"Got it","code":"# four eval patterns, in order from strict to loose.\nimport json\n\n# pattern 1: exact match\ndef eval_exact(actual, expected):\n return actual == expected\n\n# pattern 2: substring\ndef eval_contains(actual, expected_substring):\n return expected_substring.lower() in actual.lower()\n\n# pattern 3: shape check on JSON\ndef eval_shape(actual_str, required_keys):\n try:\n data = json.loads(actual_str)\n except json.JSONDecodeError:\n return False\n return all(k in data for k in required_keys)\n\n# pattern 4: regex (still rule-based, just more flexible)\nimport re\ndef eval_regex(actual, pattern):\n return re.search(pattern, actual) is not None\n\nprint(eval_exact(\"yes\", \"yes\"))\nprint(eval_contains(\"The answer is 42.\", \"42\"))\nprint(eval_shape('{\"name\":\"maya\",\"score\":7}', [\"name\", \"score\"]))\nprint(eval_regex(\"order 1234\", r\"order \\d+\"))\n","runnable":true},{"type":"predict","id":"evals/writing-evals/s04-04-predict-the-pass-rate","xp":3,"hint":[{"level":1,"body":"Lowercase both sides, then check substring. Cases 1, 2, and 4 all contain `paris` after lowercasing. Case 3 has `lyon`.","cost":0},{"level":2,"body":"Three pass, one fails. Output is `3/4`.","cost":0}],"personalize":false,"phase":"build","estSeconds":70,"concept":"pass-rate-counting","code":"cases = [\n {\"actual\": \"Paris\", \"expected\": \"Paris\"},\n {\"actual\": \"The capital is Paris.\",\"expected\": \"Paris\"},\n {\"actual\": \"Lyon\", \"expected\": \"Paris\"},\n {\"actual\": \"PARIS\", \"expected\": \"Paris\"},\n]\n\npassed = 0\nfor c in cases:\n if c[\"expected\"].lower() in c[\"actual\"].lower():\n passed += 1\n\nprint(f\"{passed}/{len(cases)}\")\n","prompt":"Read the eval loop on the right. It uses the case-insensitive\n*contains* check. How many of the four cases pass?\n","grader":{"kind":"stdout-equality","expected":"3/4","normalize":"collapse-trailing-newline"}},{"type":"fill","id":"evals/writing-evals/s05-05-fill-the-assertion","xp":3,"hint":[{"level":1,"body":"It's the Python keyword that raises `AssertionError` when the expression is False.","cost":0},{"level":2,"body":"It's `assert`. The line becomes `assert actual == \"sam@example.com\"`.","cost":0}],"personalize":false,"phase":"build","estSeconds":60,"concept":"assert-keyword","prompt":"Cursor wrote a pytest-style eval, but the assertion line is missing\nthe keyword that turns a comparison into a passing or failing test.\nFill in the keyword.\n\nKeyword: ___\n","code":"def fake_extract(text):\n return \"sam@example.com\"\n\ndef test_extract_email():\n actual = fake_extract(\"ping me at sam@example.com\")\n ___ actual == \"sam@example.com\"\n\ntest_extract_email()\nprint(\"ok\")\n","blanks":[{"id":"keyword","accept":["assert"],"caseSensitive":true,"normalize":"trim"}]},{"type":"fix","id":"evals/writing-evals/s06-06-fix-the-flaky-eval","xp":4,"hint":[{"level":1,"body":"AI output capitalization is unreliable. Lowercase both sides before comparing.","cost":0},{"level":2,"body":"Change line 4 to `ok = actual.lower() == expected.lower()`.","cost":0}],"personalize":false,"phase":"build","estSeconds":90,"concept":"case-insensitive-eval","brokenCode":"expected = \"yes\"\nactual = \"YES\"\n\nok = actual == expected\nprint(\"pass\" if ok else \"fail\")\n","prompt":"Claude wrote a yes/no classifier eval. The `actual` answer is `\"YES\"`\nbut the eval expects `\"yes\"` and fails. The fix is to make the\ncomparison case-insensitive. Change line 4 so the script prints\n`pass`.\n\nExpected output:\n```\npass\n```\n","grader":{"kind":"stdout-equality","expected":"pass","normalize":"collapse-trailing-newline"},"bugLines":[4],"revealAfter":4},{"type":"fix","id":"evals/writing-evals/s07-07-fix-the-overstrict-eval","xp":4,"hint":[{"level":1,"body":"Exact equality is too strict for prose. Use `in` for a substring check.","cost":0},{"level":2,"body":"Change line 4 to `ok = expected in actual`. That checks whether `Paris` appears anywhere in the sentence.","cost":0}],"personalize":false,"phase":"build","estSeconds":95,"concept":"substring-eval","brokenCode":"expected = \"Paris\"\nactual = \"The capital of France is Paris.\"\n\nok = actual == expected\nprint(\"pass\" if ok else \"fail\")\n","prompt":"Cursor wrote an eval that checks whether the model mentioned `Paris`\nsomewhere in its answer. It used exact equality, which fails because\nthe model returned a full sentence. Switch the check to a *contains*\ncomparison so it passes.\n\nExpected output:\n```\npass\n```\n","grader":{"kind":"stdout-equality","expected":"pass","normalize":"collapse-trailing-newline"},"bugLines":[4],"revealAfter":4},{"type":"write","id":"evals/writing-evals/s08-08-write-the-suite","xp":5,"hint":[{"level":1,"body":"Loop over cases, call `classify` on each `input`, compare to `expected` (case-insensitive), count passes.","cost":0},{"level":2,"body":"Initialize `passed = 0`. In the loop, increment when `actual.lower() == c[\"expected\"].lower()`. Return `(passed, len(cases))`.","cost":0}],"personalize":false,"phase":"build","estSeconds":140,"concept":"build-eval-suite","prompt":"Build a small eval runner. The starter has a list of cases (each with\n`name`, `input`, and `expected`) and a stub `classify(text)` that\nreturns `\"yes\"` or `\"no\"` based on whether the text contains\n`\"cancel\"`. Write a function `run_suite(cases)` that:\n\n- Calls `classify(case[\"input\"])` for each case.\n- Compares the lowercased actual to the lowercased expected.\n- Returns a tuple `(passed, total)`.\n\nThen call `run_suite(cases)` and print the result as\n`passed/total`.\n\nExpected output:\n```\n3/3\n```\n","starter":"cases = [\n {\"name\": \"explicit_cancel\",\n \"input\": \"please cancel my order\",\n \"expected\": \"yes\"},\n {\"name\": \"no_intent\",\n \"input\": \"what's the status of my order\",\n \"expected\": \"no\"},\n {\"name\": \"shouty_cancel\",\n \"input\": \"CANCEL THIS NOW\",\n \"expected\": \"yes\"},\n]\n\ndef classify(text):\n return \"yes\" if \"cancel\" in text.lower() else \"no\"\n\n# define run_suite(cases) below\n\npassed, total = run_suite(cases)\nprint(f\"{passed}/{total}\")\n","grader":{"kind":"stdout-equality","expected":"3/3","normalize":"collapse-trailing-newline"},"solution":"cases = [\n {\"name\": \"explicit_cancel\",\n \"input\": \"please cancel my order\",\n \"expected\": \"yes\"},\n {\"name\": \"no_intent\",\n \"input\": \"what's the status of my order\",\n \"expected\": \"no\"},\n {\"name\": \"shouty_cancel\",\n \"input\": \"CANCEL THIS NOW\",\n \"expected\": \"yes\"},\n]\n\ndef classify(text):\n return \"yes\" if \"cancel\" in text.lower() else \"no\"\n\ndef run_suite(cases):\n passed = 0\n for c in cases:\n actual = classify(c[\"input\"])\n if actual.lower() == c[\"expected\"].lower():\n passed += 1\n return passed, len(cases)\n\npassed, total = run_suite(cases)\nprint(f\"{passed}/{total}\")\n","hiddenTests":[]},{"type":"checkpoint","id":"evals/writing-evals/s09-09-checkpoint","xp":8,"hint":[],"personalize":false,"phase":"check","estSeconds":130,"concept":"regression-eval","prompt":"Last one. The starter has a `regression_cases` list and a stub\nclassifier that has a known bug — it returns `\"no\"` for the input\n`\"cancel my account\"`, which should be `\"yes\"`. Add the missing case\nto `regression_cases` so it would catch this exact bug. Run the suite\nand confirm the output is `2/3` (two pass, one — the new regression\ncase — fails).\n\nAdd one entry: input `\"cancel my account\"`, expected `\"yes\"`, name\n`\"account_cancel\"`.\n\nExpected output:\n```\n2/3\n```\n","starter":"def classify(text):\n # buggy: misses \"account\" cancel intent\n if \"order\" in text.lower() and \"cancel\" in text.lower():\n return \"yes\"\n return \"no\"\n\nregression_cases = [\n {\"name\": \"explicit_cancel\",\n \"input\": \"please cancel my order\",\n \"expected\": \"yes\"},\n {\"name\": \"no_intent\",\n \"input\": \"what's the status of my order\",\n \"expected\": \"no\"},\n # add the regression case here\n]\n\ndef run_suite(cases):\n passed = 0\n for c in cases:\n if classify(c[\"input\"]).lower() == c[\"expected\"].lower():\n passed += 1\n return passed, len(cases)\n\np, t = run_suite(regression_cases)\nprint(f\"{p}/{t}\")\n","grader":{"kind":"stdout-equality","expected":"2/3","normalize":"collapse-trailing-newline"},"solution":"def classify(text):\n if \"order\" in text.lower() and \"cancel\" in text.lower():\n return \"yes\"\n return \"no\"\n\nregression_cases = [\n {\"name\": \"explicit_cancel\",\n \"input\": \"please cancel my order\",\n \"expected\": \"yes\"},\n {\"name\": \"no_intent\",\n \"input\": \"what's the status of my order\",\n \"expected\": \"no\"},\n {\"name\": \"account_cancel\",\n \"input\": \"cancel my account\",\n \"expected\": \"yes\"},\n]\n\ndef run_suite(cases):\n passed = 0\n for c in cases:\n if classify(c[\"input\"]).lower() == c[\"expected\"].lower():\n passed += 1\n return passed, len(cases)\n\np, t = run_suite(regression_cases)\nprint(f\"{p}/{t}\")\n","passThreshold":0.66}],"xpTotal":31},{"slug":"llm-as-judge","title":"LLM-as-judge — when the judge is another model","estMinutes":12,"prerequisites":["writing-evals"],"status":"live","steps":[{"type":"read","id":"evals/llm-as-judge/s01-01-intro","xp":1,"hint":[],"personalize":false,"phase":"warmup","estSeconds":120,"concept":"llm-as-judge-pattern","body":"$24","cta":"Got it","code":"# the judge is just a model with a critique prompt and a binary verdict.\n\ndef judge(question, answer, rubric):\n # in real code: a model call constrained to {\"passed\": bool, \"critique\": str}.\n # here: a deterministic mock that returns canned verdicts.\n if \"Paris\" in answer and len(answer) < 50:\n return {\"passed\": True, \"critique\": \"Concise and correct.\"}\n if \"Paris\" in answer:\n return {\"passed\": True, \"critique\": \"Correct but verbose.\"}\n return {\"passed\": False, \"critique\": \"Did not name Paris.\"}\n\ncases = [\n (\"What's the capital of France?\", \"Paris.\"),\n (\"What's the capital of France?\", \"Paris is the capital, located in northern France.\"),\n (\"What's the capital of France?\", \"I think maybe Lyon?\"),\n]\nfor q, a in cases:\n v = judge(q, a, \"Reward correct, concise answers.\")\n print(f\"{a!r}: pass={v['passed']} — {v['critique']}\")\n","runnable":true},{"type":"mc","id":"evals/llm-as-judge/s02-02-pairwise-or-rubric","xp":2,"hint":[{"level":1,"body":"Pairwise needs *two outputs to compare* and returns a relative winner. Which scenarios actually have two outputs to compare?","cost":0},{"level":2,"body":"Only C has two prompt drafts to compare. A, B, D all need absolute pass/fail or scores.","cost":0}],"personalize":false,"phase":"warmup","estSeconds":60,"concept":"pairwise-vs-rubric-fit","prompt":"Pairwise asks \"A or B better?\" → returns A/B/tie. Rubric asks\n\"does this output pass?\" → returns pass/fail. Which TWO scenarios\nabove are the strongest fit for **pairwise**?\n","options":[{"id":"a_and_d","label":"A (regression) and D (production gate)","explain":"Both need an absolute quality floor — 'pass or fail per case' — and run on every input. Pairwise gives you a relative winner, not a floor."},{"id":"b_and_c","label":"B (daily dashboard) and C (A/B prompt test)","explain":"B needs an absolute number for a trend line. Pairwise gives you 'A vs B,' not 'how good is today.'"},{"id":"a_and_c","label":"A (regression) and C (A/B prompt test)","explain":"A needs a per-case pass/fail to gate CI. Pairwise can't say 'this case got worse' — only 'this output beat that output.'"},{"id":"c_and_d","label":"C (A/B prompt test) and D (production gate)","explain":"D needs an absolute floor per output. Pairwise compares two outputs, doesn't grade one."},{"id":"c_only_paired_correctly","label":"C only — the others all fit rubric"},{"id":"pairwise_for_c_and_b_dashboard","label":"Both A/B testing scenarios — C is the only A/B test here","explain":"B is a dashboard, not an A/B test."}],"answerIds":["c_only_paired_correctly"],"shuffle":false,"code":"# four eval scenarios. picking the wrong shape costs you reliability.\nscenarios = {\n \"A\": \"Regression eval: same prompt v1 vs v2, run on 50 fixed cases, fail CI on regression.\",\n \"B\": \"Daily quality dashboard: one number per day showing 'how well is the model doing'.\",\n \"C\": \"A/B prompt test: which of two prompt drafts produces better support replies on a 30-case set?\",\n \"D\": \"Production gate: every output must meet a minimum quality bar before being shown to users.\",\n}\nfor k, v in scenarios.items():\n print(f\"{k}: {v}\")\n","runnable":true},{"type":"read","id":"evals/llm-as-judge/s03-03-the-four-biases","xp":2,"hint":[],"personalize":false,"phase":"build","estSeconds":110,"concept":"judge-bias-taxonomy","body":"$25","cta":"Got it","code":"# the bias hall of fame, with what the literature actually reports.\n\nBIASES = {\n \"position\": \"Judge prefers whichever response appears first.\",\n \"length\": \"Judge prefers longer outputs even when shorter is correct.\",\n \"self_preference\": \"Judge favors outputs from its own model family.\",\n \"style\": \"Judge over-weights confident, well-formatted prose.\",\n}\n\n# what the literature actually reports (observed effects, not Cohen's d):\nEVIDENCE = {\n \"position\": \"GPT-4 order-swap consistency: 65% -> 77.5% (Zheng et al. 2023)\",\n \"length\": \"MT-Bench 'repetitive list attack' wins on padding (Zheng et al. 2023)\",\n \"self_pref\": \"GPT-4 favors GPT-4 outputs; Claude favors Claude (Zheng et al. 2023)\",\n \"style\": \"Style correlates ~1.0 with judge score, above correctness ~0.88 (Feuer/Goldblum 2024)\",\n}\n\nfor name, desc in BIASES.items():\n print(f\"{name}: {desc}\")\n","runnable":true},{"type":"predict","id":"evals/llm-as-judge/s04-04-predict-the-bias","xp":3,"hint":[{"level":1,"body":"The judge always returns \"A\". So v1==\"A\" AND v2==\"A\". The `consistent` check requires `(v1==\"A\" AND v2==\"B\")` OR `(v1==\"B\" AND v2==\"A\")` — i.e., one slot wins round 1 and the OTHER slot wins round 2. Both being \"A\" means neither branch matches.","cost":0},{"level":2,"body":"`consistent? False` — bias detected. The both-orders mitigation just caught a judge that ignores content entirely.","cost":0}],"personalize":false,"phase":"build","estSeconds":80,"concept":"position-bias-detection","code":"# a position-biased mock judge: always picks whatever sits in slot A,\n# regardless of which physical output is in there.\n\ndef fake_judge_pairwise(input_text, a, b):\n return \"A\" # bias: always picks slot A, ignores content\n\nX = \"Output X (correct, concise)\"\nY = \"Output Y (verbose, partially wrong)\"\n\n# round 1: X in slot A, Y in slot B\nv1 = fake_judge_pairwise(\"Which is better?\", X, Y)\n# round 2: Y in slot A, X in slot B\nv2 = fake_judge_pairwise(\"Which is better?\", Y, X)\n\n# consistent? if the SAME physical output won both rounds, yes.\n# round 1 winner=A means X won. round 2 winner=A means Y won.\n# different physical outputs won → not consistent → bias detected.\nconsistent = (v1 == \"A\" and v2 == \"B\") or (v1 == \"B\" and v2 == \"A\")\n\nprint(f\"order 1: winner={v1}\")\nprint(f\"order 2: winner={v2}\")\nprint(f\"consistent? {consistent}\")\n","prompt":"Read the editor. The fake judge has pure position bias — it always\npicks slot A, no matter what content sits there. We run the same\npair in both orders, then check whether the same physical output\nwon both times.\n\nWhat does the script print on the third line?\n","grader":{"kind":"stdout-equality","expected":"order 1: winner=A\norder 2: winner=A\nconsistent? False\n","normalize":"collapse-trailing-newline"}},{"type":"fill","id":"evals/llm-as-judge/s05-05-fill-the-rubric-call","xp":3,"hint":[{"level":1,"body":"The verdict dict has a `\"passed\"` key whose value is `True` or `False`. Read that field directly — no comparison against a number.","cost":0},{"level":2,"body":"`verdict[\"passed\"]` — the boolean is the whole answer. No `>= 4` threshold, no `== \"PASS\"` string compare.","cost":0}],"personalize":false,"phase":"build","estSeconds":65,"concept":"rubric-judge-binary-verdict","prompt":"The judge is a function `judge_rubric(question, answer, rubric)`\nthat returns a dict like `{\"passed\": True, \"critique\": \"...\"}`.\nYour code needs to read the verdict and branch on the binary\n`passed` field — never on a numeric score.\n\nFill in the missing condition so the loop prints `pass` for\npassing cases and `fail` for failing ones.\n\nExpected output:\n```\npass\nfail\n```\n","code":"def judge_rubric(question, answer, rubric):\n if \"Paris\" in answer:\n return {\"passed\": True, \"critique\": \"Names Paris.\"}\n return {\"passed\": False, \"critique\": \"Does not name Paris.\"}\n\ncases = [\n (\"Paris.\"),\n (\"Lyon, I think.\"),\n]\nrubric = \"Reward correct city name.\"\nfor answer in cases:\n verdict = judge_rubric(\"capital of France?\", answer, rubric)\n if ___:\n print(\"pass\")\n else:\n print(\"fail\")\n","blanks":[{"id":"pass_check","accept":["verdict[\"passed\"]","verdict['passed']","verdict.get(\"passed\")","verdict.get('passed')"],"caseSensitive":true,"normalize":"trim"}]},{"type":"fix","id":"evals/llm-as-judge/s06-06-fix-the-likert-trap","xp":4,"hint":[{"level":1,"body":"Read `v[\"passed\"]` instead of computing a threshold over `v[\"score\"]`. The binary field is what the rubric is calibrated against; the score is incidental.","cost":0},{"level":2,"body":"Change line 8 to:\n\n```python\nship = v[\"passed\"]\n```\n","cost":0}],"personalize":false,"phase":"build","estSeconds":100,"concept":"binary-not-likert","brokenCode":"verdicts = [\n {\"score\": 4, \"passed\": False, \"critique\": \"Well-written but factually wrong.\"},\n {\"score\": 3, \"passed\": True, \"critique\": \"Correct, plain prose.\"},\n]\n\nfor i, v in enumerate(verdicts, start=1):\n # bug: numeric threshold ignores the binary verdict the rubric was designed for.\n ship = v[\"score\"] >= 4\n print(f\"case {i}: ship={ship}\")\n","prompt":"The judge returns both a 1-5 numeric score AND a binary `passed`\nflag (some teams emit both during transition). The code is using\nthe score with a hardcoded threshold — `score >= 4` — but the\nrubric was designed for binary pass/fail and the 1-5 axis is\nuncalibrated. As Hamel Husain puts it: averaging Likert scores\nproduces false precision.\n\nTwo cases come in: one where the model judged 4/5 with `passed: False`\n(the answer is technically wrong but well-written — style bias),\nand one where the model judged 3/5 with `passed: True` (the answer\nis correct but plain).\n\nFix the gate so it reads the binary verdict instead of the numeric\nscore.\n\nExpected output:\n```\ncase 1: ship=False\ncase 2: ship=True\n```\n","grader":{"kind":"stdout-equality","expected":"case 1: ship=False\ncase 2: ship=True\n","normalize":"collapse-trailing-newline"},"bugLines":[8],"revealAfter":4},{"type":"fix","id":"evals/llm-as-judge/s07-07-fix-the-position-bias","xp":4,"hint":[{"level":1,"body":"Run the judge twice: once as `(question, output_x, output_y)`, once as `(question, output_y, output_x)`. If both rounds pick the same physical output, that's a real winner. If they disagree, return `\"tie\"`.","cost":0},{"level":2,"body":"Replace line 11 with the both-orders mitigation:\n\n```python\nv1 = biased_judge(question, output_x, output_y) # X in A, Y in B\nv2 = biased_judge(question, output_y, output_x) # Y in A, X in B\nx_won_both = (v1 == \"A\" and v2 == \"B\")\ny_won_both = (v1 == \"B\" and v2 == \"A\")\nif x_won_both: result = \"X\"\nelif y_won_both: result = \"Y\"\nelse: result = \"tie\"\n```\n","cost":0}],"personalize":false,"phase":"build","estSeconds":110,"concept":"both-orders-mitigation","brokenCode":"# the judge is biased toward whichever option sits in slot A.\ndef biased_judge(question, a, b):\n return \"A\" # always picks A, regardless of content\n\nquestion = \"Which output is better?\"\noutput_x = \"Output X\"\noutput_y = \"Output Y\"\n\n# bug: only one order tested, so we ship whatever sits in slot A.\nwinner = biased_judge(question, output_x, output_y)\nresult = \"X\" if winner == \"A\" else \"Y\"\nprint(f\"result: {result}\")\n","prompt":"The pairwise judge is calling itself only ONCE per pair — order\n`(a, b)`. Position bias means whichever output sits in slot `A`\ngets ~35% more votes than it deserves on close pairs.\n\nFix the code to run BOTH orders and only declare a winner when the\njudge is *consistent* (same physical output wins regardless of\nposition). On disagreement, return `\"tie\"`.\n\nExpected output:\n```\nresult: tie\n```\n","grader":{"kind":"stdout-equality","expected":"result: tie","normalize":"collapse-trailing-newline"},"bugLines":[11],"revealAfter":5},{"type":"write","id":"evals/llm-as-judge/s08-08-write-the-rubric-judge","xp":5,"hint":[{"level":1,"body":"Loop over cases, call the judge, increment a counter on `v[\"passed\"]`. After the loop compute `total - passed` for failures and `passed / total` for the rate.","cost":0},{"level":2,"body":"```python\ndef run_judge_suite(cases):\n passed_count = 0\n for c in cases:\n v = judge_rubric(c[\"question\"], c[\"answer\"], c[\"rubric\"])\n if v[\"passed\"]:\n passed_count += 1\n total = len(cases)\n return {\n \"total\": total,\n \"passed\": passed_count,\n \"failed\": total - passed_count,\n \"pass_rate\": round(passed_count / total, 2),\n }\n```\n","cost":0}],"personalize":false,"phase":"build","estSeconds":180,"concept":"rubric-judge-suite","prompt":"Build a rubric-style eval suite. Write `run_judge_suite(cases)` that:\n\n- Takes a list of dicts, each shaped\n `{\"question\": str, \"answer\": str, \"rubric\": str}`.\n- For each case, calls `judge_rubric(question, answer, rubric)`.\n The judge returns `{\"passed\": bool, \"critique\": str}`.\n- Counts how many cases pass.\n- Returns a dict `{\"total\": , \"passed\": , \"failed\": , \"pass_rate\": <0.0-1.0 rounded to 2 places>}`.\n\nThe script will run a 4-case suite. Expected output:\n```\ntotal=4 passed=2 failed=2 pass_rate=0.5\n```\n","starter":"def judge_rubric(question, answer, rubric):\n # mock judge: passes if \"Paris\" appears in answer.\n if \"Paris\" in answer:\n return {\"passed\": True, \"critique\": \"Names Paris.\"}\n return {\"passed\": False, \"critique\": \"Does not name Paris.\"}\n\ncases = [\n {\"question\": \"capital of France?\", \"answer\": \"Paris.\", \"rubric\": \"Reward correct answer.\"},\n {\"question\": \"capital of France?\", \"answer\": \"Lyon, I think.\", \"rubric\": \"Reward correct answer.\"},\n {\"question\": \"capital of France?\", \"answer\": \"Paris, in northern France.\", \"rubric\": \"Reward correct, concise answer.\"},\n {\"question\": \"capital of France?\", \"answer\": \"Some city in the south, maybe.\", \"rubric\": \"Reward correct answer.\"},\n]\n\n# define run_judge_suite(cases) below\n\nresult = run_judge_suite(cases)\nprint(f\"total={result['total']} passed={result['passed']} failed={result['failed']} pass_rate={result['pass_rate']}\")\n","grader":{"kind":"stdout-equality","expected":"total=4 passed=2 failed=2 pass_rate=0.5","normalize":"collapse-trailing-newline"},"solution":"$26","hiddenTests":[]},{"type":"checkpoint","id":"evals/llm-as-judge/s09-09-checkpoint","xp":8,"hint":[],"personalize":false,"phase":"check","estSeconds":220,"concept":"position-bias-aware-pairwise","prompt":"Final drill. Build a pairwise judge that detects its OWN position\nbias and returns `tie` when it disagrees with itself across orders.\n\nWrite `pairwise(question, output_a, output_b)` that:\n\n- Calls `fake_judge(question, first, second)` twice:\n - Once with `first=output_a, second=output_b` — read result as\n \"the output named output_a sits in slot A this round.\"\n - Once with `first=output_b, second=output_a` — output_b is now\n in slot A.\n- Each call returns `\"A\"` or `\"B\"` (the *slot* that won).\n- Translate slot wins back to which physical output won that round.\n- If the SAME physical output won both rounds, return that output's\n label (`\"a\"` or `\"b\"`).\n- If the rounds disagreed, return `\"tie\"`.\n\nThen run a multi-case suite. Expected output:\n```\ncase 1: a wins (consistent)\ncase 2: tie (position-biased — judge disagrees with itself)\ncase 3: b wins (consistent)\n```\n","starter":"# the fake judge has a known length bias: always picks slot with the longer string.\ndef fake_judge(question, first, second):\n return \"A\" if len(first) > len(second) else \"B\"\n\n# define pairwise(question, output_a, output_b) below\n\ncases = [\n # case 1: output_a is genuinely longer → wins both orders → \"a wins\"\n (\"Pick the better answer.\", \"A long detailed correct answer here.\", \"Short.\"),\n # case 2: equal length → judge fluctuates → \"tie\"\n (\"Pick the better answer.\", \"Same exact length here\", \"Same exact length here\"),\n # case 3: output_b is genuinely longer → wins both orders → \"b wins\"\n (\"Pick the better answer.\", \"Brief.\", \"A long detailed correct answer here.\"),\n]\n\nfor i, (q, a, b) in enumerate(cases, start=1):\n result = pairwise(q, a, b)\n label = \"a wins (consistent)\" if result == \"a\" else \\\n \"b wins (consistent)\" if result == \"b\" else \\\n \"tie (position-biased — judge disagrees with itself)\"\n print(f\"case {i}: {label}\")\n","grader":{"kind":"stdout-equality","expected":"case 1: a wins (consistent)\ncase 2: tie (position-biased — judge disagrees with itself)\ncase 3: b wins (consistent)\n","normalize":"collapse-trailing-newline"},"solution":"$27","passThreshold":0.66}],"xpTotal":32},{"slug":"the-rise-of-evals-as-a-discipline","title":"How evals went from research curiosity to the only thing that ships — a five-year history","estMinutes":20,"prerequisites":["llm-as-judge"],"status":"live","steps":[{"type":"read","id":"evals/the-rise-of-evals-as-a-discipline/s01-01-the-prompt-engineering-era","xp":1,"hint":[],"personalize":false,"phase":"warmup","estSeconds":220,"concept":"prompt-engineering-era","body":"$28","cta":"Got it","runnable":true},{"type":"read","id":"evals/the-rise-of-evals-as-a-discipline/s02-02-the-eval-turn","xp":1,"hint":[],"personalize":false,"phase":"warmup","estSeconds":220,"concept":"the-eval-turn","body":"$29","cta":"Got it","runnable":true},{"type":"mc","id":"evals/the-rise-of-evals-as-a-discipline/s03-03-which-company-survives-a-model-swap","xp":2,"hint":[{"level":1,"body":"The question 'does my product still work after a model swap?' is only answerable if you can run a known set of inputs through the new model and compare to known-good outputs. That's an eval suite. Only one company has one.","cost":0}],"personalize":false,"phase":"build","estSeconds":70,"concept":"model-swap-survival","prompt":"Tomorrow, GPT-5 ships. Every team's existing prompts will behave\ndifferently — some better, some worse, some catastrophically wrong\non inputs they used to handle.\n\nWhich company still has a working product on day 2 with high\nconfidence?\n","options":[{"id":"a","label":"A — VibeCo (founder tunes by feel)","explain":"No test cases means no way to detect that GPT-5 broke anything until customers complain. The vibes era's exact failure mode."},{"id":"b","label":"B — TestsBeforePromptCo (200 cases in CI)"},{"id":"c","label":"C — DemoDrivenCo (three README examples)","explain":"Three hand-picked inputs is better than zero, but it's still demo-driven evals. The cases that break on a model swap are almost never the ones in your README — they're the long-tail inputs your customers actually send."},{"id":"d","label":"D — PromptLibraryCo (60 prompts, no programmatic check)","explain":"Lots of prompts but no way to verify them. A library of unverified prompts is just a bigger pile of vibes. Quantity doesn't replace measurement."}],"answerIds":["b"],"shuffle":false,"code":"# four companies. each has shipped an LLM-powered feature.\n# GPT-5 launches tomorrow. which one still has a working product on day 2?\ncompanies = {\n \"A\": {\n \"name\": \"VibeCo\",\n \"process\": \"founder tunes the prompt by hand, ships when it 'feels right', no test cases on disk\",\n },\n \"B\": {\n \"name\": \"TestsBeforePromptCo\",\n \"process\": \"200 input/expected pairs in a YAML file; CI runs them on every prompt change AND every model version; release blocked on regressions\",\n },\n \"C\": {\n \"name\": \"DemoDrivenCo\",\n \"process\": \"three hand-picked example inputs in the README; runs them manually before every deploy\",\n },\n \"D\": {\n \"name\": \"PromptLibraryCo\",\n \"process\": \"60 prompts in a notion doc, color-coded by which model they were tuned for; no programmatic check\",\n },\n}\nfor k, v in companies.items():\n print(f\"{k}: {v['name']:25} — {v['process']}\")\n","runnable":true},{"type":"read","id":"evals/the-rise-of-evals-as-a-discipline/s04-04-case-studies","xp":1,"hint":[],"personalize":false,"phase":"build","estSeconds":220,"concept":"eval-case-studies","body":"$2a","cta":"Got it","runnable":true},{"type":"mc","id":"evals/the-rise-of-evals-as-a-discipline/s05-05-eval-first-vs-prompt-first","xp":2,"hint":[{"level":1,"body":"Eval-first means the cases that define 'success' exist BEFORE the prompt that's supposed to produce success. The prompt is written to pass the evals, not the other way around. Only one flow does this in that order.","cost":0}],"personalize":false,"phase":"build","estSeconds":70,"concept":"eval-first-vs-prompt-first","prompt":"Anthropic's \"Building Effective Agents\" mantra: **evals come first,\nprompts come second**. Which of these four workflows is genuinely\neval-first?\n","options":[{"id":"a","label":"A — write prompt, then write evals 'eventually'","explain":"'Evals eventually' means evals never. By the time the prompt is shipped, the team has moved on and the evals never get written. This is the vibes era with extra steps."},{"id":"b","label":"B — write 30 cases first, then write the prompt to pass them"},{"id":"c","label":"C — ship, wait for complaints, add eval per complaint","explain":"Reactive eval growth is better than no evals, but it's still prompt-first. The customer is your QA loop. This is the Klarna pattern."},{"id":"d","label":"D — let GPT-4 grade its own output, no ground truth on disk","explain":"This is the tautological-eval trap from chapter 21 lesson 01 step 4. The model judging itself is not an eval. Without ground truth, you're measuring nothing."}],"answerIds":["b"],"shuffle":false,"code":"$2b","runnable":true},{"type":"read","id":"evals/the-rise-of-evals-as-a-discipline/s06-06-why-the-eval-engineer-eats-the-prompt-engineer","xp":1,"hint":[],"personalize":false,"phase":"build","estSeconds":220,"concept":"eval-engineer-vs-prompt-engineer","body":"$2c","cta":"Got it","runnable":true},{"type":"write","id":"evals/the-rise-of-evals-as-a-discipline/s07-07-write-the-eval-readiness-audit","xp":5,"hint":[{"level":1,"body":"Five conditionals, each adds a fixed number of points. Use `.get()` with defaults so missing fields don't crash. Then a four-way threshold for the verdict (>=80, >=50, >=20, else).","cost":0},{"level":2,"body":"EvalMatureCo hits all five: 25 + 15 + 25 + 15 + 20 = 100. VibeCo hits zero conditions, score 0. Thresholds: >=80 mature, >=50 aware, >=20 curious, else vibes era.","cost":0}],"personalize":false,"phase":"build","estSeconds":200,"concept":"eval-readiness-audit","prompt":"Write `eval_readiness(team)` that takes a team profile (dict) and\nreturns a dict with two fields:\n\n- `score`: integer 0-100, higher means MORE eval discipline (good)\n- `verdict`: string, one of:\n - `\"eval-mature\"` if score >= 80\n - `\"eval-aware\"` if score >= 50\n - `\"eval-curious\"` if score >= 20\n - `\"vibes era\"` if score < 20\n\nScore the team on these signals (each adds points to the readiness\ntotal):\n\n- `has_test_set` is True: add 25 (ground-truth cases exist)\n- `has_judge_prompt` is True: add 15 (rubric or LLM-as-judge defined somewhere)\n- `ci_runs_evals` is True: add 25 (the regression gate)\n- `tracks_eval_history` is True: add 15 (can compare runs over time)\n- `eval_count_per_feature` >= 20: add 20 (enough cases to catch obvious regressions)\n\nTwo teams run. Expected output:\n```\nEvalMatureCo: {'score': 100, 'verdict': 'eval-mature'}\nVibeCo: {'score': 0, 'verdict': 'vibes era'}\n```\n","starter":"eval_mature_co = {\n \"name\": \"EvalMatureCo\",\n \"has_test_set\": True,\n \"has_judge_prompt\": True,\n \"ci_runs_evals\": True,\n \"tracks_eval_history\": True,\n \"eval_count_per_feature\": 50,\n}\n\nvibe_co = {\n \"name\": \"VibeCo\",\n \"has_test_set\": False,\n \"has_judge_prompt\": False,\n \"ci_runs_evals\": False,\n \"tracks_eval_history\": False,\n \"eval_count_per_feature\": 0,\n}\n\n# define eval_readiness(team) below\n\nprint(f\"EvalMatureCo: {eval_readiness(eval_mature_co)}\")\nprint(f\"VibeCo: {eval_readiness(vibe_co)}\")\n","grader":{"kind":"stdout-equality","expected":"EvalMatureCo: {'score': 100, 'verdict': 'eval-mature'}\nVibeCo: {'score': 0, 'verdict': 'vibes era'}\n","normalize":"collapse-trailing-newline"},"solution":"$2d","hiddenTests":[]},{"type":"checkpoint","id":"evals/the-rise-of-evals-as-a-discipline/s08-08-checkpoint","xp":8,"hint":[],"personalize":false,"phase":"check","estSeconds":240,"concept":"triage-teams-for-model-launch","prompt":"$2e","starter":"teams = [\n {\"name\": \"CursorClone\", \"has_test_set\": True, \"has_judge_prompt\": True, \"ci_runs_evals\": True, \"tracks_eval_history\": True, \"eval_count_per_feature\": 80},\n {\"name\": \"SeriesBAI\", \"has_test_set\": True, \"has_judge_prompt\": True, \"ci_runs_evals\": True, \"tracks_eval_history\": False, \"eval_count_per_feature\": 30},\n {\"name\": \"SeedAI\", \"has_test_set\": True, \"has_judge_prompt\": False, \"ci_runs_evals\": False, \"tracks_eval_history\": False, \"eval_count_per_feature\": 10},\n {\"name\": \"ConsultingShop\", \"has_test_set\": True, \"has_judge_prompt\": True, \"ci_runs_evals\": False, \"tracks_eval_history\": False, \"eval_count_per_feature\": 5},\n {\"name\": \"PromptShopCo\", \"has_test_set\": False, \"has_judge_prompt\": False, \"ci_runs_evals\": False, \"tracks_eval_history\": False, \"eval_count_per_feature\": 0},\n]\n\n# define triage_teams(teams) below\n\nresult = triage_teams(teams)\nprint(f\"verdicts: {result['verdicts']}\")\nprint(f\"most at risk: {result['most_at_risk']}\")\n","grader":{"kind":"stdout-equality","expected":"verdicts: {'CursorClone': 'eval-mature', 'SeriesBAI': 'eval-mature', 'SeedAI': 'eval-curious', 'ConsultingShop': 'eval-curious', 'PromptShopCo': 'vibes era'}\nmost at risk: PromptShopCo\n","normalize":"collapse-trailing-newline"},"solution":"$2f","passThreshold":0.66}],"xpTotal":21},{"slug":"model-validation-gates","title":"Model validation gates before deploy","estMinutes":7,"prerequisites":[],"status":"live","steps":[{"type":"read","id":"evals/model-validation-gates/s01-01-intro","xp":1,"hint":[],"personalize":false,"phase":"warmup","estSeconds":80,"concept":"model-validation-gates","body":"# Model validation gates: compare before you ship\n\nA model that improves one metric can still be a worse product. Validation gates make the tradeoff explicit before a candidate ships.\n\nA simple gate might require accuracy to stay above baseline, latency to stay under a threshold, and schema checks to pass. The key is deciding the line before you look at the candidate.\n\nThis is eval-driven development applied to models: define the gate, run the candidate, compare, then decide.","cta":"Got it","code":"baseline = {\"accuracy\": 0.82, \"latency_ms\": 90}\ncandidate = {\"accuracy\": 0.84, \"latency_ms\": 140}\n\naccuracy_ok = candidate[\"accuracy\"] >= baseline[\"accuracy\"]\nlatency_ok = candidate[\"latency_ms\"] <= 120\nprint(accuracy_ok, latency_ok)","runnable":true},{"type":"mc","id":"evals/model-validation-gates/s02-02-choose-the-risk","xp":2,"hint":[{"level":1,"body":"The gate is there to catch regressions before users do.","cost":0}],"personalize":false,"phase":"build","estSeconds":45,"concept":"model-validation-gates","prompt":"What should the gate do if the candidate accuracy drops below baseline?","options":[{"id":"a","label":"Block the candidate or require review."},{"id":"b","label":"Ship it because it is new.","explain":"New is not a quality signal."},{"id":"c","label":"Ignore the baseline.","explain":"The baseline is the comparison point."},{"id":"d","label":"Delete the metric.","explain":"Removing the metric hides the regression."}],"answerIds":["a"],"shuffle":false,"code":"baseline = {\"accuracy\": 0.82}\ncandidate = {\"accuracy\": 0.80}\nprint(baseline, candidate)","runnable":true},{"type":"predict","id":"evals/model-validation-gates/s03-03-predict-the-checks","xp":3,"hint":[{"level":1,"body":"Two scores are at least 0.82.","cost":0}],"personalize":false,"phase":"build","estSeconds":60,"concept":"model-validation-gates","code":"baseline = 0.82\nscores = [0.83, 0.81, 0.85]\npassing = [score for score in scores if score >= baseline]\nprint(len(passing))","prompt":"Read the code. What does it print?","grader":{"kind":"stdout-equality","expected":"2","normalize":"collapse-trailing-newline"}},{"type":"fill","id":"evals/model-validation-gates/s04-04-fill-the-return","xp":3,"hint":[{"level":1,"body":"The candidate can tie or beat the baseline.","cost":0}],"personalize":false,"phase":"build","estSeconds":50,"concept":"model-validation-gates","prompt":"Fill the comparison operator so candidate accuracy must meet or beat baseline.","code":"baseline = 0.82\ncandidate = 0.84\nprint(candidate ___ baseline)","blanks":[{"id":"blank","accept":[">="],"caseSensitive":true,"normalize":"trim"}]},{"type":"fix","id":"evals/model-validation-gates/s05-05-fix-the-overtrust","xp":4,"hint":[{"level":1,"body":"Accuracy is not the only gate. Latency must be <= 120.","cost":0},{"level":2,"body":"Add `and candidate['latency_ms'] <= 120`.","cost":0}],"personalize":false,"phase":"build","estSeconds":85,"concept":"model-validation-gates","brokenCode":"candidate = {\"accuracy\": 0.84, \"latency_ms\": 140}\n\npassed = candidate[\"accuracy\"] >= 0.82\nprint(\"ship\" if passed else \"review\")","prompt":"The validation gate ignores latency. Fix it so the slow candidate prints `review`.","grader":{"kind":"stdout-equality","expected":"review","normalize":"collapse-trailing-newline"},"bugLines":[3],"revealAfter":4},{"type":"write","id":"evals/model-validation-gates/s06-06-write-count-ready","xp":5,"hint":[{"level":1,"body":"Build two booleans, then combine them with `and`.","cost":0}],"personalize":false,"phase":"build","estSeconds":115,"concept":"model-validation-gates","prompt":"Write `passes_gate(candidate, baseline)` so accuracy must improve and latency must stay under 120.","starter":"candidate = {\"accuracy\": 0.84, \"latency_ms\": 110}\nbaseline = {\"accuracy\": 0.82}\n\n# define passes_gate(candidate, baseline) below\n\nprint(passes_gate(candidate, baseline))","grader":{"kind":"stdout-equality","expected":"True","normalize":"collapse-trailing-newline"},"solution":"candidate = {\"accuracy\": 0.84, \"latency_ms\": 110}\nbaseline = {\"accuracy\": 0.82}\n\ndef passes_gate(candidate, baseline):\n accuracy_ok = candidate[\"accuracy\"] >= baseline[\"accuracy\"]\n latency_ok = candidate[\"latency_ms\"] <= 120\n return accuracy_ok and latency_ok\n\nprint(passes_gate(candidate, baseline))","hiddenTests":[]},{"type":"checkpoint","id":"evals/model-validation-gates/s07-07-checkpoint","xp":8,"hint":[],"personalize":false,"phase":"check","estSeconds":130,"concept":"model-validation-gates","prompt":"Checkpoint: write `gate_report(candidates, baseline)` so it returns the names that pass the gate.","starter":"candidates = [\n {\"name\": \"small\", \"accuracy\": 0.83, \"latency_ms\": 80},\n {\"name\": \"large\", \"accuracy\": 0.86, \"latency_ms\": 180},\n]\nbaseline = {\"accuracy\": 0.82}\n\n# define gate_report(candidates, baseline) below\n\nprint(gate_report(candidates, baseline))","grader":{"kind":"stdout-equality","expected":"['small']","normalize":"collapse-trailing-newline"},"solution":"candidates = [\n {\"name\": \"small\", \"accuracy\": 0.83, \"latency_ms\": 80},\n {\"name\": \"large\", \"accuracy\": 0.86, \"latency_ms\": 180},\n]\nbaseline = {\"accuracy\": 0.82}\n\ndef gate_report(candidates, baseline):\n passing = []\n for candidate in candidates:\n accuracy_ok = candidate[\"accuracy\"] >= baseline[\"accuracy\"]\n latency_ok = candidate[\"latency_ms\"] <= 120\n if accuracy_ok and latency_ok:\n passing.append(candidate[\"name\"])\n return passing\n\nprint(gate_report(candidates, baseline))","passThreshold":0.66}],"xpTotal":26}],"liveLessonCount":5,"liveStepCount":34,"xpTotal":112}},"chapter":"$15:1:props:tree:detail","lesson":"$15:1:props:tree:detail:lessons:2","step":"$15:1:props:tree:detail:lessons:2:steps:1","stepIndex":1,"next":{"chapterSlug":"evals","lessonSlug":"llm-as-judge","stepIndex":2}}]]

LLM-as-judge — when the judge is another model — step 2 of 9