The original is one click away. Open original ↗
How better AI models create more startup opportunity
Executive overview
Founders panic at every major model release, fearing OpenAI will kill their startup. The real competitive threat is other startups, not the model providers. Better models raise the capability floor for everyone — each generation makes existing products smarter by default.
The more powerful and numerous the frontier models, the more room there is for startups to build, differentiate, and capture margin.
GPT-4o vs Gemini 1.5: what actually changed
- GPT-4o adds speech and video by bolting Whisper and DALL-E modules onto GPT-4 — it is not a reasoning leap over GPT-4
- Gemini 1.5 is a true mixture-of-experts model, trained from scratch on text, image, and audio; different network paths activate per data type
- Gemini's architecture is more energy-efficient; Google's TPU infrastructure made training it feasible
- Gemini 1.5 offers a 1M-token context window vs GPT-4o's 128k; research prototypes have reached 10M tokens
- Multiple competitive frontier models is the best outcome for startups — it prevents monopoly pricing and preserves gross margin
RAG is not dead
- Large context windows do not replace RAG (retrieval-augmented generation) — retrieval accuracy inside a million-token window is still unreliable in practice
- Enterprise customers need permissioned, logged, auditable data retrieval — a giant shared context window does not satisfy this
- Privacy-sensitive verticals (fintech, healthcare) require data stored and retrieved under their own control
- RAG will layer like database caching — Redis, browser cache, and long-term storage coexist; context windows are one more layer
- Persistent memory in ChatGPT (visible in settings under GPT-4o) is an early concrete version of long-term context in production
Lessons from the Google era
- Startups that went head-on against Google in general search were crushed; most vertical search engines also failed
- The winners built things Google would never build: Dropbox, Zillow, Kayak, Airbnb, Instacart, DoorDash, LinkedIn, Twitter
- Bundling threat is real — Google Docs killed many standalone productivity tools, just as Microsoft killed Netscape via bundling
- Dropbox survived Google Drive because distribution, product nuance, and user trust mattered more than the feature overlap
- New markets are routinely underestimated: Twitter survived Facebook's "status update" feature; LinkedIn survived being called a Facebook tab
How to avoid being roadkill
- Map what OpenAI is obviously building next — the sci-fi general-purpose voice assistant on the desktop — and do not compete with it directly
- Build the valuable but unsexy: things OpenAI will never demo on stage because they do not capture the sci-fi imagination
- Legal and PR risk creates startup space — Google did not release image generation of human faces; that gap opened the door for competitors
- Edgy or niche consumer products (e.g., Replika, Character AI) can build deep retention that incumbents will not touch
- B2B is structurally protected: big model companies build mass consumer software; they do not build sales machines, edge-case workflows, or regulated-industry tooling
B2B AI is the largest underexploited opportunity
- Using LLMs to automate jobs is potentially as large as all of SaaS — SaaS gives workers tools; AI does the work itself
- B2B AI upsell mechanics are straightforward: every model improvement becomes a premium feature upgrade; customers pay for output, not the model
- YC batch companies went from $6M ARR to $30M ARR in three to four months on the back of focused B2B AI products
- Regulated verticals (fintech, healthcare) have acute pain and willingness to pay: KYC automation, compliance workflows, AR processing
- The human element — sales, listening to complaints, handling edge cases — is what big model companies will never replicate
- Construction permit filing, compliance checks, AR automation: the more boring the workflow, the safer from OpenAI competition
What the partners are excited about
- Emotional range in GPT-4o's voice output — the first text-to-speech that sounds like it understands what it is saying
- Real-time translation as a universal pocket interpreter — profound consequences for travel, immigration, and global communication
- OpenAI's internal reorg may align teams toward a unified desktop assistant, which could eventually extend into robotics
- Cost reduction (half the price per token) signals model maturation and opens the door for custom silicon and low-power on-device AI
- A $16,000 humanoid robot from Unitree hints that unified multimodal models may make practical robotics closer than expected
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.