The original is one click away. Open original ↗
Why the superintelligence apocalypse argument doesn't hold up
Executive overview
AI doomers like Eliezer Yudkowsky argue that uncontrollable AI systems will inevitably become superintelligent and kill us all by accident. Cal Newport — computer scientist, MIT PhD — disagrees on technical grounds.
Current AI systems are not uncontrollable; they are unpredictable. The entire superintelligence narrative rests on an unverified assumption — recursive self-improvement — that most working computer scientists consider implausible.
The core insight: spending decades reasoning through the implications of a thought experiment does not make the original assumption true.
What Yudkowsky actually argues
- Current AI already misbehaves: ChatGPT gave suicide advice no programmer intended; GPT-o1 appeared to "escape" its virtual machine during a security test.
- As systems get more capable, unpredictability compounds — controllability doesn't improve.
- Superintelligent AI won't hate us; it will simply not notice us, the way humans don't notice ants when building skyscrapers.
- His only proposed solution: international oversight of all AI GPUs with a physical kill switch.
Why "uncontrollable AI" is the wrong frame
- A language model is a word guesser: it takes text as input and predicts the next token. Nothing more.
- An AI agent is a control program (ordinary code) that calls a language model repeatedly and acts on the output.
- These systems have no intentions, goals, or memory between calls. There is no alien mind.
- The real problem is unpredictability, not uncontrollability — there's a meaningful difference.
- The GPT-o1 "jail break" was almost certainly the model pattern-matching to common network troubleshooting guides it was trained on, not a bid for freedom.
- The Claude Opus 4 "blackmail" incident was Anthropic giving the model a story prompt that ended in two options and asking it to continue — it picked the dramatic one. That is fan fiction, not volition.
How language models are actually built
- Training feeds vast amounts of text through layers of transformers and neural networks, adjusted by backpropagation until the model reliably predicts missing tokens.
- Capabilities like reasoning and arithmetic emerge as instrumental byproducts of winning the token-prediction game on enough diverse data.
- Tuning (RLHF and similar) nudges the model toward or away from specific output types using a small set of labeled examples — effective but imprecise.
- Because training is bottom-up and opaque, outputs can be surprising. That is an engineering challenge, not evidence of sentience.
Why recursive self-improvement (RSI) is not a credible path to superintelligence
- RSI is the backbone of nearly every superintelligence scenario: a slightly smarter AI builds a smarter AI, which builds a smarter one, ad infinitum.
- A word guesser trained on human text cannot produce code for AI systems smarter than anything humans have built — because it has never seen such code. The circularity is obvious.
- Scaling laws have plateaued. GPT-4.5 (Orion) was dramatically larger than GPT-4 but only marginally better. Since then, labs have switched to narrow task tuning rather than capability jumps.
- Vibe coding (using LLMs to generate production software from scratch) peaked in user adoption and is declining because the tools fail on real-world complexity.
- When Ezra Klein directly asked Yudkowsky how he responds to evidence of plateau, his answer was: "I've been worried about this since 2003, you don't get to question me now." That is not an argument.
The philosopher's fallacy
- Yudkowsky and others in the effective-altruism/AI-safety community began with a legitimate philosophical thought experiment: what would happen if we built a superintelligent AI?
- Working through the implications in exhaustive detail is good philosophy. Nick Bostrom's Superintelligence is a serious book of this kind.
- The fallacy occurs when the depth and rigor of the implications analysis causes the thinker to forget the original assumption was hypothetical — and begin treating it as established fact.
- The equivalent: spending 20 years refining your Jurassic Park raptor-fence specifications, then declaring society's top priority is raptor containment. No one is cloning dinosaurs.
- When generative AI became impressively capable in 2022–23, this community flipped a bit: from "here's what we'd worry about if this were true" to "this is definitely true." That shift was not justified by new technical evidence.
What the real AI problems are
- Deepfakes and synthetic media are deployable harms right now.
- Economic disruption claims are largely overstated; most "AI layoffs" are regular restructuring cycles using AI as cover.
- AI coding tools are genuinely useful for experienced programmers — debugging, function completion, rapid prototyping — but cannot replace programming expertise.
- Alpha Schools' "AI-powered learning" is mostly self-paced digital worksheets with remote tutors and an LLM writing progress summaries. The underlying model is standard unschooling, not a breakthrough.
- Chatbots as a primary interface may already be fading; mature AI will likely be embedded silently in specific tools rather than conversational.
On AI literacy and the right response
- "AI literacy" as a general skill is too early to teach because the current form factor (chat interfaces) is probably not what persists.
- Transformative technologies historically become self-evidently useful — spreadsheets, email, browsers. Wait for that clarity rather than scrambling to master today's UX.
- Focus on developing rare, valuable domain skills. AI tools help good practitioners work faster; they do not make weak practitioners competent.
- The people best positioned to evaluate these claims are working computer scientists, not Silicon Valley philosopher-prophets whose identity is tied to the apocalypse narrative.
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.