AI agents and the end of inbox overload in knowledge work

Executive overview

The real productivity crisis in knowledge work is not job automation — it is the hyperactive hive mind: constant context-switching to manage back-and-forth messages that fragment attention all day. AI that could handle email like a chief of staff would deliver far larger gains than any task-speedup tool.

Today's large language models can summarise and draft emails, but they still require human direction for every message. The architectural reason they cannot act autonomously is that feed-forward language models cannot simulate the future — a prerequisite for deciding what to say and to whom.

The path forward is not GPT-5; it is ensembles pairing language models with planning engines, a direction already demonstrated by the Cicero diplomacy bot — and being pursued inside OpenAI.

Truly autonomous AI email management would not merely save time; it would eliminate the cognitive tax that drives burnout and suppresses the quality of knowledge work.

Why current AI cannot manage your inbox

  • Language models (GPT-4 class) are feed-forward: input moves through fixed layers, one token out per pass — no looping, no memory, no iteration.
  • They cannot simulate future states: they fail at Tower of Hanoi, mid-game chess, math modification tasks, and poems requiring look-ahead.
  • Managing email requires future simulation: predicting how a reply affects a relationship, a project, a schedule — GPT-4 cannot do this.
  • When given real emails, GPT-4 summarises well and drafts reasonable replies — but the human still directs every step, so context-switching is unchanged.
  • Chess analogy: GPT-4 plays at ~ELO 1000 because hard-coded heuristics handle opening and early moves; when the board becomes novel, play collapses — exactly as email decisions collapse when they require novel judgment.

Why planning AI can simulate the future

  • Deep Blue beat Kasparov by evaluating hundreds of millions of future positions — pure simulation, no language model.
  • AlphaGo combined self-trained board-evaluation networks with future-move search, discovering strategies no human had conceived.
  • Noam Brown's Pluribus (poker AI) beat top professionals by simulating other players' beliefs rather than hard-coding card probabilities; it ran on a laptop versus tens of thousands of dollars of compute for the pure neural-net approach.
  • Brown's Cicero (Diplomacy AI) paired a language model with a game-strategy simulator: the language model parsed natural conversation, the planner ran multi-step strategy, the language model rendered the output — it beat human players who did not know they were facing a bot.
  • Cicero is the architectural template for inbox AI: language model for understanding and generation, planning engine for deciding what to do.

What full inbox automation would require

  • A language model to read and generate natural language.
  • A future-simulation planner to evaluate the downstream impact of each possible reply.
  • Additional models representing project states, relationship context, and personal objectives.
  • Ensemble coordination between all components in real time.
  • OpenAI hired Noam Brown and reportedly assigned him to Q*, a project adding planning (A* search) to language models — a concrete signal that this direction is being resourced.

Programming jobs and the efficiency paradox

  • Every wave of programmer productivity tools — punch cards, interactive terminals, debuggers, IDEs, Stack Overflow — made each programmer far more efficient.
  • Each efficiency gain led to more programmers being hired, not fewer, because the complexity and value of software systems scaled faster than individual efficiency.
  • A programmer today is roughly 1,000× more efficient than in 1955; software applications have multiplied by far more than 1,000×.
  • AI coding tools (Copilot etc.) are the next step in this curve — making programmers more capable, not redundant.
  • Advice: keep learning programming, stay current with AI coding tools, push into higher complexity — the jobs will be there.

AI and writing

  • Professional writers use language models for brainstorming and intelligent research, not for generating their prose — voice and craft are not outsourceable.
  • For non-professional writers, AI-assisted drafting improves clear communication broadly; the biggest gains are for non-native speakers where language has obscured otherwise strong ideas.
  • The open pedagogical question: is AI-assisted writing the calculator (learn fundamentals first, automate later) or centaur chess (human + AI together outperforms both from the start)?
  • Educational institutions have not yet converged on an answer.

Deep work setup and location

  • The same desk used for email, Zoom calls, and admin tasks becomes cognitively associated with shallow work — deep work mode becomes hard to enter there.
  • Separate physical spaces for deep and shallow work are used consistently by high-output knowledge workers: McCullough wrote in a garden shed, Oliver composed walking in woods, Wiles solved Fermat's Last Theorem in an attic.
  • The deep-work location does not need to be expensive or elaborate — a different nook, a picnic table, a pub — but it must be distinct and psychologically linked to thinking.
  • Switching to a dedicated space produces faster entry into deep work and higher-quality output.

Working at a natural pace across life seasons

  • Annual seasonal variation is normal and healthy: summers slower, term-time more structured — knowledge work should follow natural rhythms, not simulate factory uniformity.
  • Decade-scale seasons have distinct goals: twenties = skill-building foundations; thirties = stability and family infrastructure; forties = legacy and depth.
  • Couples with high-achieving careers should define a shared vision of the ideal family life and work backwards from it together — rather than each independently optimising a career and experiencing the partner as an impediment.
  • Working from a shared vision opens configurations that neither person would find by optimising individually; resentment and tally-keeping are symptoms of the absence of a shared plan.

Slow productivity and the Grant principle

  • General Grant sat for hours appearing to do nothing — "the laziest man in camp" — while subordinates handled every petty detail.
  • He reserved his time for concentrated thinking, then acted with decisive force at the moments that mattered.
  • Contrast with General McClellan: constant activity, endless busyness, no decisive action — eventually replaced.
  • The lesson for knowledge workers: activity is not output. Fewer things done with full attention and deliberate execution beats high-volume busyness.
  • Slowing down is only a trap if it lacks a clear vision of what the reclaimed time is for — a more remarkable life, not just filled hobbies.

More like this — when you're ready for early access.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Get early access to the full library.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.