The original is one click away. Open original ↗
AI interface design patterns for the next decade
Executive overview
Software interfaces have always been built around static nouns — buttons, forms, dropdowns. AI shifts the paradigm toward verbs: workflows, autonomous actions, adaptive responses. Designing for verbs requires entirely new UI patterns that don't yet have established conventions.
The reviews here cover voice agents, autonomous agents, prompt-to-output generators, adaptive email UIs, and AI video — each surfacing recurring design challenges around latency feedback, human-in-the-loop control, iterative refinement, and trust signals.
The central challenge of AI interfaces is keeping the human in control while the machine acts autonomously.
Voice AI: latency is the interface
- VAPI demo showed voice AI for developers — fast, natural-sounding responses, but no visual feedback while speaking or listening.
- Pairing multimodal cues (visual indicators) with voice is essential when a screen is present.
- VAPI's killer feature: a millisecond latency label on every response, building intuition for what feels human vs. robotic.
- Latency is not a technical detail — it is the primary determinant of whether a voice interaction feels natural.
- Retail AI demo: live phone call from a debt-collection AI agent; realistic voice, but higher latency broke the illusion.
- The agent adapted mid-call when the caller changed their stated name — a meaningful capability, but it still shut down when the conversation deviated further.
- Hybrid model — AI handles ~50% of calls, humans handle the rest with AI-generated transcripts — reduces grunt work while preserving quality.
Agent workflows: making autonomous decisions visible
- Gumloop uses a canvas-based visual workflow editor to expose what an agent will do at each step before it runs.
- Canvas interfaces are not new (flowcharts, chip design tools) but are now interactive — the old paradigm resurfaced for AI orchestration.
- Color coding distinguishes node types (input, action, output); a legend would improve readability.
- Progressive zoom fidelity — hiding small text at low zoom, collapsing nodes to colored blocks — would improve usability at scale.
- The real power of canvas interfaces is modeling branching, non-linear decision trees, not just linear pipelines.
- Templates with inline explanatory text blocks help new users onboard without separate documentation.
Spreadsheet agents: trust through sourcing
- Answer Grid takes a free-text prompt and returns structured spreadsheet data, with each cell populated by its own agent.
- Collapsing example prompts into single-click buttons lowers the blank-canvas problem for new users.
- Inline sources per cell — click any result to see where it came from — is the key trust mechanism when AI retrieves external data.
- The footnote/citation pattern (pioneered in academic papers, refined by Perplexity) is now a standard AI design primitive.
- Units were missing from numeric results — a small but impactful gap when presenting financial data.
- Dynamic column addition lets users extend the dataset with new agent queries on demand, not just static predefined columns.
Prompt-to-output: feedback loops for generation
- Polymet generates production-ready UI code from a free-text prompt, including multimodal input (sketch upload, microphone).
- Pre-built prompt examples reduce the barrier but the core interface is fully open-ended — both the strength and weakness.
- For long-running generation tasks, showing early low-fidelity results (like flight search engines) is better than a spinner alone.
- A richer prompt builder — design terms as selectable pills or LEGO bricks — would lower the expertise barrier without removing flexibility.
- Feedback overlay showing which parts of the prompt the model acted on (and which it ignored) would accelerate iteration.
- Sub-element prompting — editing a single component without regenerating the full output — is the frontier for iterative workflows.
Adaptive UI: context-driven interfaces
- Zuni (AI email for founders) generates suggested replies dynamically per email, replacing a static compose button with context-aware options.
- Abstraction level sits between full autopilot and manual drafting: predefined adaptive prompts the user selects, not writes.
- The UI knows which follow-up input is needed per response type (e.g., confirm a time triggers a time-picker, not a text field).
- Single-letter hotkeys keep hands on keyboard, but the lack of modifier keys creates a risk of accidental triggers when focus is unclear.
- A key design insight: the button labels change per email, but the key bindings stay constant — consistency at the interaction layer, not the visual layer.
AI video: fidelity-latency trade-offs
- Arjel generates deepfake video from a typed script, using a few minutes of source footage per person.
- Body language and camera angle are manually selected per script block; auto-detection would be a natural next step.
- The blurred preview pattern: audio generates quickly, video takes ~12 minutes; show blurred video with audio immediately so users can validate the script before committing to full render.
- Trading fidelity for immediacy keeps humans in the loop at the right moment — before a slow, expensive generation step.
- Inline text selection for body language cues (highlight a phrase, pick a gesture from a dropdown) would tighten the text-to-video editing loop.
Cross-cutting design principles
- Show the machine's work: latency labels, source citations, canvas nodes, and blurred previews all give users visibility into what the AI is doing.
- Human-in-the-loop checkpoints before expensive or irreversible steps are better than single-shot generation.
- Adaptive interfaces require a new consistency contract: visual elements can change if interaction patterns stay predictable.
- The current AI moment parallels the 2010 shift to touch-first design — every software component is being reimagined from scratch.
More like this — when you're ready for early access.
Join the waitlist for a personal account and content recommendations based on what you're working on.
No spam. Unsubscribe at any time.
You're on the list. We'll be in touch before launch.