The original is one click away. Open original ↗
How Stripe builds a culture of engineering excellence
Executive overview
Stripe's core thesis — the internet economy will be far larger than today — shapes everything: who it hires, how it builds, and what it measures. The engineering culture is deliberately product-minded, meaning engineers own user insight, not just code.
Three practices operationalise this at scale: friction logging (structured UX empathy runs), engineer occasions (CTOs embedded in teams), and walking the store (whole-company product reviews). Each converts abstract values into repeatable behaviour.
The core insight: a value without a practice behind it is just a slogan — the practice is what makes the culture real.
Hiring and talent
- Mission specificity attracts self-selecting candidates; Stripe's focus on infrastructure for the internet economy filters for long-term, collaborative builders.
- Patience is a deliberate strategy: Stripe will maintain a relationship with a candidate for months or years until timing aligns.
- Hiring is personal — managers invest heavily in understanding candidates' motivations, not just their skills.
- References are treated as high-signal data: a reference caller benefits from thousands of hours of observation vs. eight hours in an interview loop.
- Structured loops apply consistent evaluation criteria so interviewers can calibrate against a broad range of responses.
- Interviews simulate real work: engineers pair-program with Google access; PMs complete take-home written exercises on real-problem types.
Building product-minded engineers
- Stripe's first PM joined around employee 200; until then, every engineer was expected to act as PM.
- Co-creation with early users is the default: shared Slack channels, regular product previews, and shipping only when the alpha group is "super, super happy."
- Engineers surface user insight directly — the relationship with users is not intermediated through a PM layer.
- PMs now serve as cross-functional linchpins: synthesising user learning, driving product strategy, and providing locomotion across legal, partnerships, risk, and design.
- Developer-focused products reward technical PMs; Stripe's early team embodied this naturally.
Friction logging
- A friction log is a stream-of-consciousness record of the user experience, written from the perspective of a named, specific user persona.
- Every product team runs a recurring friction log; the PM or engineering manager typically owns it.
- David Singleton runs a personal onboarding friction log monthly, tagging owners across the company on specific issues.
- Praise is explicitly part of the log — recognising what works well is as important as flagging problems.
- Senior leaders do recursive versions across their whole area to maintain coherence as parallel teams diverge.
- A public template exists at stripe.dev.
UX reviews and walking the store
- UX reviews combine async friction logging with a synchronous walkthrough involving the team, cross-functional partners, and executives.
- A shared live document captures issues in real time while the PM walks through the product flow; the group then debates and triages each item.
- Walking the store extends this to the entire company via Stripe's weekly Friday Fireside: critical product flows reviewed with the whole organisation.
- The chef analogy: just as a chef tastes soup with the kitchen, product leaders must experience the product together to establish a shared bar for craft.
Engineer occasions
- An engineer occasion is 3–4 consecutive days where a manager clears their calendar, joins a team, and ships a small feature end-to-end.
- The goal is direct experience of developer tooling, build infrastructure, review processes, documentation quality, and deploy latency.
- A buddy on the team provides onboarding support; the manager is expected to go through normal code review.
- The output is a friction log shared with the team — demonstrating understanding and feeding directly into prioritisation.
- New engineering managers are advised to do one in their first quarter; ongoing cadence is roughly once a year.
Reliability at scale
- Stripe deploys to production 16.4 times per day on average with 99.999% uptime — achieved by combining automation with a learning culture, not by slowing down.
- Every change runs a full automated test suite (~15 min) in parallel with human code review; on merge, tests run again, then auto-deploy completes within ~30 min total.
- Changes ramp from a small traffic percentage upward, allowing early detection before broad exposure.
- Selective test execution targets only the tests relevant to a given change, keeping the pre-merge suite fast even as coverage grows.
- Incident response goes beyond remediation: Stripe mandates root cause analysis and prioritises systemic fixes above roadmap items.
- Chaos testing (injecting errors to verify no user impact) is a newer addition to the reliability stack.
AI and developer productivity
- Stripe has used ML at the core of Radar (fraud detection) for years; LLMs are a newer layer on top of this foundation.
- GPT-4 embeddings across Stripe's documentation now power natural-language Q&A inside the developer docs.
- Sigma (Stripe's SQL analytics product) is being extended with LLM-driven natural-language query generation.
- An internal GPT-4 UI with shared prompt presets democratises AI access across marketing, support, and engineering — without exposing sensitive data to public APIs.
- GitHub Copilot is rolled out broadly; early finding: it is especially valuable for generating test boilerplate, freeing engineers to focus on what the test should actually assert.
- Auto-deploy and an auto-merge checkbox on PRs were cited as the two highest-impact developer productivity wins in recent years.
- The crying octopus button — a one-click friction reporter in every internal tool — feeds directly into the developer productivity team's roadmap.
Planning and prioritisation
- Planning does not carry a high internal NPS at Stripe, in part because the process is redesigned from first principles each cycle as the company grows.
- Annual deep planning plus a mid-year lighter pass; teams reserve explicit bandwidth for polish and operational remediation alongside feature work.
- An inverted-W process: teams surface priorities → leadership synthesises into a draft company strategy → teams pressure-test and adjust → final strategy is distributed with context.
- Teams are expected to define the right user metrics for their area and review them on a predictable cadence; everything else follows from that.
Leadership and management lessons
- At scale, the most important job is hiring people who can be trusted with significant autonomy — most consequential decisions happen without the leader present.
- Default to generous trust early; hold people accountable enough that they prove they can handle it.
- Delegate slightly beyond comfort level — that is the only way to operate at significant scale.
- Manage your own time proactively: a Sunday-evening weekly planning ritual identifies what "good" looks like for the coming week before the inbox takes over.
- How you show up sets the culture around you — consistency matters most when things are going wrong.
- Manage energy, not just time: some tasks that give personal energy justify doing them even if they are not the highest-priority item.
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.