Skip to content
// The Evidence

What AI got wrong, and what AI caught.

Two lists, one bar. /lessons indexes the failures — fabrications, unit errors, confident-wrong answers. /catches indexes the moments AI caught something I missed — a language tell, an asymmetry, a sharper reframe. Every entry is specific, observable, and falsifiable. "AI was helpful" does not qualify; "Claude was the only tool to flag the word 'underestimate' as one-sided phrasing in the META Q1 call" does.

// Totals — 6 failures · 1 catch All failures → All catches →
// The method

Four prompts that turn the failures into catches.

Read the Prompt Stack →

The two lists grow as new posts document new moments. Same evidence bar applies — a catch is no easier to log than a failure. A catch has to be specific enough that a sceptical reader can re-run the prompt and check.