How AI search engines find, evaluate, and cite content

Executive overview

Most SEO advice for AI search is just tactics disconnected from how the technology actually works. AI search engines pull from two distinct sources — static training data and real-time web retrieval — and citations are probabilistic, not ranked positions.

The foundation of AI search visibility is the same as traditional SEO, but topic breadth now matters as much as keyword ranking.

Two sources of information

  • Training data: a static snapshot of the web used to answer factual, stable queries (e.g. "Who is the CEO of Apple?")
  • Updated roughly every 6 months — newly launched products won't appear here yet
  • Real-time retrieval via RAG (retrieval-augmented generation): AI fetches live web pages for fresh or complex queries
  • Two levers to influence AI: broad web mentions (training data) and pages that rank in real-time retrieval (SEO)

Query fanout: one prompt becomes many searches

  • AI expands a single user query into multiple sub-queries running simultaneously — this is called query fanout
  • A prompt like "plan a 5-day trip to Japan in November" triggers queries like "best neighbourhoods in Tokyo," "November weather in Kyoto," "Japan rail pass worth it"
  • Average prompt triggers 9–11 fanout queries; ChatGPT's deep research mode ran 420 searches for a single query
  • Over 95% of fanout queries have zero search volume — they're synthetic, generated in the moment
  • Don't treat fanout queries as a keyword list; treat them as a map of what topics AI considers relevant to a question
  • To be cited, content must cover an entire topic, not just one target keyword

Probabilistic citations, not fixed rankings

  • AI citations are probabilistic: ask the same question five times and you may be cited three times, your competitor twice
  • There is no fixed position — AI visibility is a probability distribution, not a leaderboard
  • Consensus: the more sources consistently mention your brand the same way, the higher the citation probability
  • Freshness: AI-cited content is ~25% fresher than traditional SERP results
  • Authority: 76% of AI overview citations come from pages already in Google's top 10 — SEO is the foundation
  • 14% of cited pages don't rank in Google's top 100 — real opportunity for brands without traditional search dominance

More like this — when you're ready for early access.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Get early access to the full library.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.