The original is one click away. Open original ↗
Spotify CEO Daniel Ek on building an audio platform and creator economy
Executive overview
Spotify started as a music company, then deliberately expanded into podcasting and audiobooks by recognising that the underlying user needs — discovery, ubiquity, freemium access — were identical across all audio formats. The decision to put podcasts inside the music app, not a separate one, was the most contrarian and most obvious move at once.
The core insight: exponential growth is not a single curve — it is many different linear strategies stacked on top of each other, each targeting a new audience or geography.
Streaming did not simply take off; Spotify had to reinvent its expansion playbook at every stage. Culture, not features, is what determines whether a company can keep doing that.
From music to audio: the platform decision
- Spotify discovered podcasting organically: German music labels uploaded audiobooks to the platform, revealing that users just wanted to consume audio content, regardless of format.
- The team was personally frustrated by having to switch apps to listen to podcasts — all the infrastructure built for music (discovery, ubiquity, freemium) applied equally to podcasts.
- The biggest internal resistance was combining podcasts and music in one app; prevailing wisdom was that separate apps were correct.
- The decisive logic: if you search for audio content, the format boundary is arbitrary. Radio always combined talk, music, and sports on one device.
- The TAM could not expand if new listeners had to opt into a dedicated "podcast app" before ever encountering the medium.
Stacking S-curves: how Spotify scaled to 500 million users
- Growth was not a single macro tailwind — each phase required a distinct strategy to reach the next cohort of users.
- Geographic constraints turned into an advantage: being forced to launch in Europe first allowed Spotify to saturate markets, learn, and iterate before entering the US.
- At early Spotify, a user needed to hear about the product from eight other people before signing up; this made geographic density (college cities) the core marketing lever.
- Ek's rule: nearly every company benefits from constraining itself to a geography, a demographic, or a niche first — "global from day one" is almost always wrong.
- Definitive claim: Spotify would not exist today if it had launched in the US as its first market.
The podcasting business model: higher ceiling, higher complexity
- Podcasting offers better potential margins than music streaming because rights are more fragmented and Spotify can own more of the value chain.
- Music streaming: Spotify keeps ~30%, pays ~70% to labels. Podcasting: no equivalent structural constraint.
- The offset: podcasting requires content moderation at scale — a genuinely large variable cost. Meta at one point employed over 100,000 content moderators.
- Moderation costs don't behave like fixed infrastructure; as bad actors use more sophisticated AI, platform defences must match them, adding ongoing cost.
- The podcast ad business required building a full ad network from scratch against user-generated content, not the curated catalogue Spotify was used to.
Audio formats and the blurring of boundaries
- The meaningful distinction between podcasting and audiobooks is the business model, not the format: podcasting is ad-supported audio; audiobooks are paid audio.
- Ek's prediction: the audiobook market (currently tens of millions of listeners) should reach hundreds of millions — it resembles music in 2008, constrained by a broken business model and a discovery problem.
- The format line is already blurring: long-form podcasts like Acquired's deep dives function more like conversational audiobooks than traditional episodes.
- AI voice cloning makes real-time multilingual podcast distribution technically feasible now; cost-per-minute is the only current barrier.
- Platforms will need to play a role in verifying content authenticity as AI-generated audio becomes indistinguishable from real recordings.
Creator economy and monetisation
- Ek's goal for Spotify: be the best partner for creators, not the only one — win through low friction and flexible business model options, not lock-in.
- Many creators treat monetisation as binary; the next decade of music and podcasting will offer far more customisation — windowing, subscriptions, ad-supported tiers.
- Attribution across platforms remains the biggest unsolved problem for creators: platform-level metrics exist, but cross-platform causality is opaque.
- Risk of platform concentration: creators who build entirely on one platform are exposed to algorithm changes even when those changes are user-beneficial, not adversarial.
- The creator economy's "middle" will likely be wiped out by AI-lowered barriers to entry; the very top will increase in value — the same pattern as fine art photography after Instagram.
Global music, local culture
- Reggaeton's rise to Spotify's largest genre was visible in the data years before it became broadly recognised; Ek invested in it globally when it was still breaking out of regional clusters.
- The Hispanic diaspora in the US was an early signal of crossover potential — genre growth typically follows diaspora geography first.
- K-pop operates as an industrial system: hundreds of songwriters, dedicated development pipelines, proprietary fan platforms. Taylor Swift operates lean with a tiny core team. Both reach comparable scale.
- Artists at mega-scale today are effectively CEOs of their own enterprises; business acumen is no longer optional at that level.
- Rihanna becoming the first female recording artist billionaire through Fenty Beauty illustrates that streaming-era fame is a platform for adjacent businesses in a way CD-era fame was not.
Culture as the compounding asset
- Ek's most important realisation, only truly internalised in the last two to three years: being intentional about culture is the primary lever for sustained growth.
- Early Spotify copied cultural expressions from Google (20% time), Facebook (move fast), and Amazon (long-term, bottoms-up) — ending up a "Frankenstein monster" with the downsides of each.
- Cultural expressions (20% time, six-pagers) are not culture itself; the underlying assumptions and priorities are what actually shape decisions.
- Spotify's non-Silicon-Valley origin forced first-principles thinking; iconic companies often built distinct cultures partly because they started outside existing tech clusters.
- Spotify's practice of releasing imperfect products intentionally (e.g., early podcasting, audiobooks at StreamOn) is itself a cultural choice — valid, but only if understood and committed to.
- AI DJ is the counter-example: launched only when genuinely high quality, because the stakes of getting it wrong (misrepresenting a creator's voice) were too high. It is now one of Spotify's most impactful product metrics, on track to exceed Discover Weekly.
On the emotional reality of founding
- Every successful entrepreneur Ek has known has had at least three near-death experiences with their company.
- Spotify's US launch was originally projected at three months out; it took three years due to label negotiations. The stress of that gap caused visible physical consequences.
- Ek's honest admission: had he known in advance how hard it would be, he would not have started. He is glad he went through it anyway.
- Media and founder narratives systematically understate the chaos and luck involved; the mythology of day-one clarity is almost always retrospective.
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.