Three prompt rules that stop AI from guessing when extracting data

Executive overview

Smarter AI models guess more confidently instead of admitting uncertainty. This creates an honesty gap: intelligence rises, but honesty stays flat. Combined with automation bias — users trusting AI output more and checking it less — undetected errors compound.

Three prompt rules close that gap when using AI to extract information from documents.

A wrong answer is 3x worse than a blank answer — build that into every extraction prompt.

The two compounding problems

  • More capable models are more likely to fabricate plausible answers than to say "I don't know"
  • Automation bias means users check AI output less as confidence increases
  • The loop self-reinforces: smarter model → more confidence → less checking → more errors

Where AI guessing most commonly occurs

  • Extracting data from contracts, invoices, receipts, or legal documents
  • Pulling action items from meeting transcripts (AI infers owner and date rather than flagging ambiguity)
  • Reviewing contracts with conflicting clauses — AI picks one silently
  • Scoring vendors, building CRMs, any structured extraction task

Rule 1: Force blank answers with reasons

  • Instruct the AI to extract only values explicitly stated in the document (grounding)
  • If a value is ambiguous, missing, or unclear: leave the field blank
  • Add a "Reason" column — for every blank field, require a one-sentence explanation
  • Reaffirm at the end that all values must be sourced from the document with specific section references
  • Benefit: scan for blanks quickly; the reason column pinpoints exactly where to check

Rule 2: Change the incentive

  • By default, AI treats a wrong answer and a blank answer as equally acceptable — so it defaults to giving you something
  • Add a single line: a wrong answer is 3x worse than a blank answer
  • Framing it as a cost shifts the model toward admitting uncertainty
  • "When in doubt, leave it blank"

Rule 3: Show the source

  • On complex tasks, AI drifts from instructions and starts inferring even after being told not to
  • Add a "Source" column with only two allowed values: Extracted (word-for-word from document) or Inferred (AI derived or interpreted it)
  • For any "Inferred" value, require an adjacent "Evidence" column with a one-sentence explanation of what was inferred and from where
  • Lets you ignore extracted fields and focus review only on inferred ones

Using the three rules together

  • Every extraction prompt gets all three rules applied
  • Review workflow: check blanks → check inferred fields → trust extracted fields
  • Reduces checking burden without reducing accuracy
  • Prompts are available in the video description for copy-paste use

More like this — when you're ready for early access.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Get early access to the full library.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.