Physical Intelligence: building a foundation model for any robot

Executive overview

Most robotics companies are locked into a single hardware platform, requiring years of effort before deploying a single useful task. Physical Intelligence (Pi) trains one cross-embodiment model across many different robot platforms, so intelligence transfers rather than needing to be rebuilt per robot.

The result: tasks that required hundreds of hours of data collection last year can now be performed zero-shot. Real deployments are already running in laundromats and e-commerce warehouses — two years into the company's life, not five.

The GPT-1 moment for robotics is not a future event — it is beginning now, and the cost of building on top of it is falling fast.

Why robotics was hard — and what changed

The three pillars of robotics are semantics, planning, and real-time control — all historically solved separately with brittle, hardware-specific code.
Language models (SayCan, RT-2) first cracked semantics: common-sense knowledge transferred into robot planning without robot-specific data.
RT-2 showed that a vision-language model fine-tuned on robot data could reason about entirely unseen objects and spatial relationships.
Open X-Embodiment was the first large-scale training run across 10+ robot platforms — and the generalist model outperformed per-platform specialists by 50%.
Even a single robot platform drifts over time through hardware and software changes, making old data less reusable; a diverse fleet is actually easier to generalise from.

The cross-embodiment scaling insight

Training on heterogeneous robot data teaches the model an abstract concept of "how to control a robot," not "how to control this robot."
Data scarcity in robotics is two problems: data generation (operationally expensive) and data capture (incentive and infrastructure gaps).
Pi's approach: be ready to absorb data from many different robot types already in the field rather than manufacturing and scaling one proprietary platform.
Emergent transfer properties are now visible — tasks zero-shot today required hundreds of hours of collection a year ago.

Cloud inference and real-time control

Almost all of Pi's robot evaluations — including laundry folding and mobile manipulation — run with the model hosted in a remote data centre, not on-device.
The robot queries a cloud API endpoint, sending images and language commands and receiving back action chunks in a high-frequency control loop.
Real-time chunking: the robot pre-fetches the next action chunk while still executing the current one, hiding inference latency.
This eliminates the need for expensive, fast-obsoleting onboard compute, and decouples hardware choices from model capability.

Current state of deployment

Pi partnered with Weave (home laundry robots) to fold diverse, unseen clothing items in a real laundromat — built to working demo in roughly two weeks.
Partnered with Ultra (logistics automation) to pack varied items into soft e-commerce pouches over a full eight-hour day in a live warehouse with minimal human intervention.
A mixed-autonomy model — human corrects errors, robot improves — is sufficient today to reach economic break-even and justify scaling.
Pi intentionally stays ignorant of each partner's hardware internals; the model parachutes into their data pipeline, which tests true generalisation.

The playbook for vertical robotics startups

Identify a workflow with a clear insertion point — where does a robot make the biggest difference without redesigning the whole process?
Use cheaper hardware; the model compensates for imprecise motion.
Collect data, run real-deployment evaluations, build a mixed-autonomy system.
Reach economic break-even before scaling unit count — historical robotics companies failed by scaling a money-losing deployment.
The barrier to entry has dropped: scrappiness and customer understanding now matter more than 20 years of robotics expertise.

Infrastructure gaps and internal tooling

When Pi started, no off-the-shelf tooling existed for large-scale robot data collection, annotation, evaluation management, or operational monitoring — the team built it all.
This is a significant opportunity: teleoperation services, data collection pipelines, annotation services, and evaluation infrastructure are needed by every vertical robotics company.
Pi runs a Claude-based pre-training monitor that babysits large training runs autonomously, remedying errors in real time — producing ~50% improvement in compute utilisation.
The next frontier: an automated robotic research scientist that ingests multimodal failure data and proposes and tests hypotheses across the full training stack.

Physical Intelligence: building a foundation model for any robot

Executive overview

Why robotics was hard — and what changed

The cross-embodiment scaling insight

Cloud inference and real-time control

Current state of deployment

The playbook for vertical robotics startups

Infrastructure gaps and internal tooling

More like this — when you're ready for early access.

Get early access to the full library.

Be among the first to get personalised recommendations tailored to your stage in business.

Be among the first to get personalised recommendations tailored to your stage in business.

Executive overview

Why robotics was hard — and what changed

The cross-embodiment scaling insight

Cloud inference and real-time control

Current state of deployment

The playbook for vertical robotics startups

Infrastructure gaps and internal tooling

More like this — when you're ready for early access.

More in AI

Building $10,000 software MVPs with AI in under an hour

How to actually make money with AI: five brutal truths

How to choose the right home for your AI workflow

Get early access to the full library.

Be among the first to get personalised recommendations tailored to your stage in business.

Be among the first to get personalised recommendations tailored to your stage in business.