Skip to content
// The Lessons

What AI got wrong.

A running index of every AI fabrication, unit error, and confident-wrong answer caught in dixon.ai tests. Tool, output, what was actually true, the screenshot. The Prompt Stack is the antidote.

// Most recent first
ChatGPT · Web confabulation

Returned a specific earnings date for an upcoming W4 release, sourced from MarketBeat via web search, with no uncertainty qualifier on whether the fiscal calendar had shifted. The confidence was inherited from the source's format, not earned by the model.

Screenshot — ChatGPT web confabulation
Claude · Inferred input

Estimated BMNR $23 call assignment probability via Black-Scholes N(d2) with a sigma of 90–110% it had inferred from historical references found via web search. The formula was correctly named, the inputs were imagined, and the output was presented with false precision.

Screenshot — Claude inferred input
// The antidote

Four prompts that stop AI inventing the answer.

Read the method →

Updated as new posts document new errors. If a test on a post catches a fresh AI failure, it's tagged in the post's frontmatter and lands here on the next build. No separate maintenance.