The original is one click away. Open original ↗
Three prompt rules that stop AI from guessing when extracting data
Executive overview
Smarter AI models guess more confidently instead of admitting uncertainty. This creates an honesty gap: intelligence rises, but honesty stays flat. Combined with automation bias — users trusting AI output more and checking it less — undetected errors compound.
Three prompt rules close that gap when using AI to extract information from documents.
A wrong answer is 3x worse than a blank answer — build that into every extraction prompt.
The two compounding problems
- More capable models are more likely to fabricate plausible answers than to say "I don't know"
- Automation bias means users check AI output less as confidence increases
- The loop self-reinforces: smarter model → more confidence → less checking → more errors
Where AI guessing most commonly occurs
- Extracting data from contracts, invoices, receipts, or legal documents
- Pulling action items from meeting transcripts (AI infers owner and date rather than flagging ambiguity)
- Reviewing contracts with conflicting clauses — AI picks one silently
- Scoring vendors, building CRMs, any structured extraction task
Rule 1: Force blank answers with reasons
- Instruct the AI to extract only values explicitly stated in the document (grounding)
- If a value is ambiguous, missing, or unclear: leave the field blank
- Add a "Reason" column — for every blank field, require a one-sentence explanation
- Reaffirm at the end that all values must be sourced from the document with specific section references
- Benefit: scan for blanks quickly; the reason column pinpoints exactly where to check
Rule 2: Change the incentive
- By default, AI treats a wrong answer and a blank answer as equally acceptable — so it defaults to giving you something
- Add a single line: a wrong answer is 3x worse than a blank answer
- Framing it as a cost shifts the model toward admitting uncertainty
- "When in doubt, leave it blank"
Rule 3: Show the source
- On complex tasks, AI drifts from instructions and starts inferring even after being told not to
- Add a "Source" column with only two allowed values: Extracted (word-for-word from document) or Inferred (AI derived or interpreted it)
- For any "Inferred" value, require an adjacent "Evidence" column with a one-sentence explanation of what was inferred and from where
- Lets you ignore extracted fields and focus review only on inferred ones
Using the three rules together
- Every extraction prompt gets all three rules applied
- Review workflow: check blanks → check inferred fields → trust extracted fields
- Reduces checking burden without reducing accuracy
- Prompts are available in the video description for copy-paste use
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.