Manus AI agent: multi-agent architecture and the wrapper debate

Executive overview

Most AI agent products are "wrappers" — combining existing models with tool integrations. The debate is whether that's a valid approach or a dead end. Manus shows wrappers can deliver real value through thoughtful architecture, transparency, and targeted fine-tuning.

A wrapper's competitive durability depends entirely on what it builds that competitors can't easily replicate.

How Manus works

  • A planner agent breaks the user's prompt into a master plan of subtasks before any execution begins
  • Subtasks are handed to specialised sub-agents, each with its own domain (memory, knowledge, execution)
  • Sub-agents draw on 29 integrated tools — web navigation, code execution, file parsing
  • An executor agent synthesises all outputs into a final result
  • Chain-of-Thought Injection keeps agents stable across dozens of reasoning rounds by forcing active plan reflection

Benchmark performance

  • Humans score ~92% on the Gaia benchmark; OpenAI Deep Research peaks at 74%
  • Manus scored 86.5% — the highest any AI agent has achieved on Gaia
  • Gaia tests reasoning, multimodal handling, web browsing, and tool proficiency

The wrapper question

  • Manus uses Claude 3.7 Sonnet as its core model, plus open-source tools (browser-use, E2B sandbox)
  • Cursor, Harvey, and Windsurf are also wrappers — the category label doesn't determine quality
  • Manus co-founder's strategy: work orthogonally to model development, so each new model release is an opportunity, not a threat
  • What separates good wrappers: intuitive UI, proprietary evals, careful fine-tuning, well-designed multi-agent architecture

Strengths and limitations

  • Per-task cost ~$2 — significantly lower than Deep Research
  • Exposes the file system so users can watch agents work in real time; competitors are largely opaque
  • Users can inspect, customise, or replace individual sub-agents and tool integrations
  • Coordination breaks down as task complexity scales
  • Core advantages (UX, fine-tuning, integrations) are replicable by well-resourced competitors
  • Vulnerable to API pricing changes or provider policy shifts

What durable differentiation looks like

  • Invest early in proprietary evals that are expensive or slow to replicate
  • Embed workflows deeply into specific user routines to raise switching costs
  • Secure integrations with platforms or datasets competitors can't easily access

More like this — when you're ready for early access.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Get early access to the full library.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.