Cursor's Third Era: Cloud Agents

67 min

•Mar 6, 20264 months ago

Summary

Cursor's founders discuss their latest Cloud Agents launch, which gives AI agents full computer access in cloud VMs to write, test, and deploy code autonomously. They explore the shift from tab completion to agentic coding, parallel agent workflows, and how coding patterns are fundamentally changing as agents take on larger development tasks.

Insights

Cloud agents with full VM access represent a paradigm shift from local coding assistance to autonomous development workflows
Video recordings of agent work are becoming crucial for code review, making it easier to evaluate large diffs and agent decisions
The future of coding involves parallel agent workflows where developers manage multiple concurrent tasks rather than writing code directly
Agent labs like Cursor are building routing systems to automatically select optimal models, reducing user cognitive load
Self-aware agents that can modify their own system prompts and understand their environment constraints will be key to scaling autonomous development

Trends

Shift from individual coding to collaborative agent-human development workflowsRise of parallel agent architectures for increased development throughputVideo-first code review replacing traditional diff-based review processesAgent memory systems using file-based annotations and dynamic context managementIntegration of development tools with communication platforms like SlackEmergence of sub-agent architectures for complex task delegationGrowing demand for multi-model routing and best-of-N agent selectionCloud-based development environments becoming primary coding interfaceVoice coding interfaces gaining traction for natural language programmingAgent self-auditability and environment awareness becoming critical capabilities

Topics

Cloud Agents Autonomous Code Testing Parallel Agent Workflows Video-Based Code Review Agent Memory Systems Multi-Model Routing Sub-Agent Architectures Development Environment Setup Agent Self-Awareness Voice Coding Interfaces MCP Integration Stack Management Agent Collaboration Long-Running Agents Development Throughput

Companies

Cursor

AI coding assistant company discussing their new Cloud Agents product launch

Autotab

Browser automation company acquired by Cursor to build cloud agent capabilities

OpenAI

AI model provider whose models are used in Cursor's agent routing system

Anthropic

Creator of Claude models including Opus and Sonnet used in Cursor's platform

Datadog

Monitoring platform that provides MCP integration for agent debugging workflows

GitHub

Code hosting platform integrated with Cursor's development and review workflows

Vercel

Deployment platform mentioned as potential integration for full-stack development

Graphite

Code review and stacked diff tool mentioned for scaling development workflows

Shopify

Example enterprise customer for discussing deployment integration challenges

Supabase

Backend platform mentioned as part of common development stack

People

Sam

Cursor co-founder discussing cloud agents and development workflow changes

Jonas

Cursor team member mentioned in context of collaborative development workflows

Wilson

Cursor engineer who built experimental browser and long-running agent systems

Andrej Karpathy

AI researcher who highlighted Cursor's transition from tab completion to agents

Tanner

Creator of TanStack Router mentioned in technical discussion

Quotes

"We think that over the months the big unlock is not going to be one person with a model getting more done. Like the water flowing faster, it will be making the pipe much wider and so paralyzing more."

Sam

"If you put yourself in the model shoes and you are seeing tokens stream by and all you could do was site read code and spit out tokens and hope that you had done the right thing. No chance I'd be so bad."

Sam

"We have found that in this new world where agents can end to end write much more code, reviewing the code is one of these new bottlenecks that crop up."

Sam

"10 person startups need the DEVEX and pipelines that a 10,000 person company used to need."

Sam

"I think honestly it's 1%. If I just am like, get frustrated and I'm like, I don't want to go have it tell an agent to change this one thing."

Sam

Full Transcript

3 Speakers

Speaker A

So this is another experiment that we ran last year and didn't decide to ship at that time, but may come back to you LLM judge, but one that was also agentic and could write code. So it wasn't just picking, but also taking the learnings from two models or N models that it was looking at and writing a new diff. And what we found was that there were strengths to using models from different model providers as the base level of this process. Basically you could get almost like a synergistic output that was better than having a very unified like bottom model tier.