The original is one click away. Open original ↗
AI browser agents are replacing manual web workflows
Executive overview
Every major tech company is building an AI layer between users and the internet. OpenAI's Atlas browser is the clearest early example: an agent that logs into your accounts, remembers your history, and takes actions on your behalf.
The agents work today for simple, constrained tasks. They break when tasks chain too many steps together.
The gap between what agents can do today and what they will do in 6-12 months is enormous — prepare now.
What Atlas browser does
- Agent mode logs into your SaaS tools and acts autonomously on your behalf
- Memory tracks browsing history so you can recall past articles, emails, or actions
- System instructions let you set persistent custom behaviour per website
- Currently Mac-only; agent mode limited to Pro and Plus subscribers
Scheduling 50 YouTube shorts: what worked
- Agent completed the task with a detailed, step-by-step prompt specifying every click
- Required periodic nudges — it stopped every 5-10 videos and needed manual prompting to continue
- Finished correctly: all 50 shorts scheduled for daily publishing
- AI-driven agents handle UI changes automatically; code-based automation breaks on every UI update
Post-meeting automation: where it struggled
- Task: ingest transcript, draft follow-up email, find attendees, send email, update CRM
- Agent stopped mid-flow after 3-4 of 5 steps without completing the sequence
- Action items were sometimes misattributed; email tone did not match preferred style
- Key lesson: keep tasks micro and individual — complex chained sequences fail today
LinkedIn comment replies: iteration required
- Prompt needed 3-5 rewrites before the agent reliably replied to comments
- Limiting scope (last 48 hours, unanswered only) was essential to keep the task achievable
- Critical fix: explicitly instruct the agent to press Reply after drafting, one comment at a time — without this it drafted but never submitted
Prompting principles that improved results
- State context and task clearly at the top
- Specify each step explicitly — name the exact buttons to click
- Constrain timeframes and scope aggressively
- Start with one item to confirm success before running the full batch
- Emphasise completion actions (e.g. "press submit") at the end of the prompt
The capability roadmap
- Today: simple, single-thread tasks — scheduling, summarising pages, basic replies
- 1-3 months: chained workflows such as full post-meeting automation in one run
- 9-16 months: outsourcing entire activity areas (e.g. expense management) to agents with minimal intervention
- Long term: voice-driven agents that remember context and act without detailed prompts — the "Jarvis" model
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.