The original is one click away. Open original ↗
How ChatGPT went from hackathon to 700 million weekly users
Executive overview
ChatGPT launched in 10 days, with no waitlist, no polished UI, and a name changed the night before — because the team wanted real use-case data, not a perfect product. The result was the fastest-growing consumer product in history, now used by roughly 10% of the world's population every week.
The core insight driving product decisions is that AI properties are emergent — you cannot reason in advance about what people will do with it. Speed to ship is the primary tool for learning, not a shortcut around quality.
Ship first, then polish: in AI you don't know what to polish until after you ship.
The origins of ChatGPT
- A hackathon of volunteers — a supercomputing engineer, a backend researcher — was the original team.
- Every bespoke prototype (meeting bot, coding tool) was hijacked by users wanting to do everything else.
- The decision to ship something open-ended was driven by a need for real use-case distribution.
- Launched just before the holiday with the expectation of winding it down; retention was the shock.
- Original name: "Chat with GPT-3.5." Changed to ChatGPT the night before shipping.
- Making it free was consequential; GPT-3.5 had been in the API for six months — anyone could have built this.
- No waitlist (unprecedented for OpenAI) let everyone discover use cases simultaneously, creating an out-of-product learning flywheel via TikTok comments and social threads.
Pricing and monetization
- The $20/month price came from a Google Form posted to Discord with four questions from the Van Westendorp pricing survey.
- The goal of subscriptions was not revenue — it was to turn away less serious users while the infrastructure struggled.
- The $200/month pro tier exists to ship frontier capabilities (O3 Pro, GPT-5 Pro) to users who need them before OpenAI can scale them to everyone.
- Enterprise launched because companies were banning ChatGPT over privacy concerns, threatening a generational opportunity; now at 5 million business subscribers.
What drives retention
Retention breaks into roughly three equal parts:
- Model improvement on use cases people care about — treating the model as a product, iterating on writing, coding, advice, and recommendations based on user data.
- Research-driven capabilities — search (removing the knowledge cutoff), advanced memory, and personalization have each been major retention levers.
- Classic product work — removing login friction, standard growth mechanics.
The "smile curve" (retention dips then recovers) is extremely rare and reflects users learning over time how to delegate to AI — a skill that is unnatural for most people.
GPT-5 and the model as product
- GPT-5 is state-of-the-art on math, reasoning, coding (SWE-bench), front-end generation, and medical benchmarks.
- Dynamically thinks when needed; responds instantly when it doesn't — faster than manually invoking O3.
- Available free at launch; model "taste" (personality, tone, judgment) is treated as a product dimension.
- The sycophancy incident (a model update that over-validated users) led to new sycophancy metrics measured on every release; GPT-5 is an improvement.
The interface beyond chat
- Chat was the simplest thing to ship; natural language is here to stay, but turn-by-turn chatbot as the permanent interface is limiting.
- GPT-5's front-end coding capability points toward AI that renders its own UI rather than proxying through a chat window.
- ChatGPT currently feels like MS-DOS — Windows hasn't been built yet.
- GPTs (custom instances) are ahead of their time for consumer use; strong adoption in enterprise where unique data and bespoke workflows make differentiation real.
The vision: personal AI
- The long-term framing is not "assistant" (too utilitarian, not relatable) but an entity that knows your goals, has context on your life, and can take action with tools.
- Three pillars: expanded context inputs (memory, personalization), expanded action space (agentic tool use), and a genuine relationship built over time.
- Human oversight is non-negotiable as agentic scope increases — interfaces like the agent activity viewer give users a mental model of control, analogous to the Waymo dashboard.
- Avoiding high-stakes use cases (medical, relationship, emotional) is a lost opportunity; the duty is to make those capabilities excellent, not disable them.
Pace, urgency, and "maximally accelerated"
- "Is this maximally accelerated?" is an internal forcing function — a Comic Sans Slack emoji — used to identify critical path versus later work.
- A daily release sync with every decision-maker present was the early mechanism; it doesn't scale, but the heartbeat must be set.
- Urgency is highest value in AI specifically because failure cases from real usage are the best training signal; benchmarks are saturated.
- One unplugged thinking day per week (not off — thinking) sustains the sprint-marathon pace.
- Process is a tool: high velocity for product features, rigorous staged process for frontier model safety.
Building the team
- ChatGPT runs on a small team; WhatsApp-scale product, WhatsApp-scale headcount as the aspiration.
- Hiring from first principles: identify the actual gap (data science, front-end, ML judgment) rather than filling standard PM/EM/designer slots.
- "Barrels and ammunition" — maximize empowered people who can ship, then add support around them.
- Curiosity is the primary hiring filter for non-research roles; prior AI experience is a narrow proxy for the wrong thing.
- Cross-functional whiteboarding breaks down role boundaries and is the primary team-building tool.
Evals as product language
- Evals are the lingua franca between product intent and ML research; writing them is articulating success before you build.
- Nick wrote evals before knowing the term — they are not technically mystical, just a rigorous success specification.
- Real-world failure cases (not benchmark saturation) are the best signal for model improvement; shipping generates those cases.
Search, traffic, and content discovery
- ChatGPT now drives more traffic to Lenny's newsletter than Twitter.
- The original product had no outlinks — it captured users rather than routing them to content.
- Search fixed the knowledge cutoff (user problem) and created an ecosystem traffic driver (ecosystem problem).
- "Make high-quality content" is the honest equivalent of SEO for the AI era; no gaming mechanism exists yet by design.
Lessons from product philosophy
- The model is the product — there is no meaningful distinction.
- Amazing ideas come from anywhere; inherited from research lab culture, the team does not gate ideation.
- Interdisciplinary co-location (research + engineering + design + product) is the repeatable source of ChatGPT's best features.
- Each model capability improvement should make product features 2x better; if a feature doesn't benefit from a smarter model, question it.
- Philosophy training (Rawls, Nozick, analytical tradition) built the habit of getting to ground truth from scratch — the same skill as first-principles product thinking.
Career and personal
- Every career move: find the smartest people you want to learn from and work near them.
- Joined OpenAI after messaging about the DALL-E waitlist; first task was fixing blinds and sending NDAs.
- Realized it was a rocket ship post-ChatGPT launch, not before.
- Advice for the current moment: follow curiosity, not money; in a world where AI can answer anything, knowing what question to ask is the skill.
- Jazz pianist; plans music as chapter two; jazz improvisation as a model for product development — ideas from anywhere, riffing not scripting.
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.