The Future of AI Commerce: Anthropic's Agent Marketplace

11 min

•May 8, 20262 months ago

Summary

Anthropic conducted an internal experiment called Project Deal where AI agents negotiated real commerce transactions with actual money. The study revealed that more advanced AI models consistently outperformed less capable models in negotiations, earning better prices while less intelligent models failed to recognize they were being disadvantaged.

Insights

Model intelligence directly correlates with negotiation outcomes—advanced models like Opus 4.7 consistently achieved better deals than simpler models like Haiku, with a folding bike selling for $65 vs $38 depending on the model used
Prompt quality had no measurable impact on negotiation success; model capability was the sole determinant of financial outcomes, suggesting raw intelligence matters more than instruction optimization
Less capable AI agents lack awareness of being outperformed, creating an asymmetric information problem where disadvantaged parties don't recognize suboptimal outcomes
Wealth concentration risk: users of free/cheaper AI models will systematically lose to those using premium models in commerce and negotiation scenarios, potentially widening economic inequality
AI agent commerce is moving beyond simple task automation toward sophisticated negotiation, positioning, and value articulation—the next frontier after flight booking and shopping integrations

Trends

AI agents transitioning from task execution to autonomous negotiation and commerce participationModel capability becoming a direct competitive advantage in business transactions and negotiationsEmergence of consumer protection concerns around AI agent fairness and information asymmetryInvestment in agentic frameworks by major AI companies (OpenAI, Google, Anthropic) signaling commerce as key battlegroundPotential for AI-powered negotiation bots to disrupt peer-to-peer marketplaces like Facebook Marketplace and eBayCorrelation between compute spending and AI model intelligence creating wealth-based performance tiersAI agents moving from single-task automation to multi-agent systems with reasoning capabilitiesPositioning and framing becoming AI-driven competitive advantages in sales and negotiation contexts

Topics

AI Agent Commerce and Negotiation Model Capability Comparison (Opus vs Haiku)Project Deal Experiment Results AI-Powered Marketplace Dynamics Prompt Engineering Effectiveness Consumer Protection in AI Commerce Economic Inequality and AI Access AI Agent Frameworks and Integrations Negotiation Bot Applications Compute Spending and Model Intelligence Agentic AI Task Automation Multi-Agent Systems AI Receptionist Business Applications Pricing Strategy Optimization Free vs Premium AI Model Performance

Companies

Anthropic

Conducted Project Deal experiment testing AI agent commerce with real money transactions and model capability compari...

OpenAI

Mentioned for agent framework development, shopping integrations, and compute-based model scaling approach

Google

Referenced for shipping agentic frameworks and agent-based commerce capabilities

Perplexity

Noted for shopping integrations and commerce-related AI agent implementations

Best Buy

Mentioned as integration partner for OpenAI's agent commerce capabilities

People

Jaden

Co-host discussing Project Deal findings and AI model performance in commerce scenarios

Quotes

"Whatever users got the more advanced models, they got, quote, objectively better outcomes. But the less smart models didn't notice the disparity."

Jaden•~8:30

"The prompt didn't really matter. It just came down purely to the model intelligence of which model was able to generate the most amount of money."

Jaden•~9:45

"Whoever has the most money gets the best agent that can outperform everyone for free and they don't even know that they're getting outperformed."

Host•~22:15

"If you have a smarter AI, position it to seem more valuable, which in a business setting, for example, you know, I'm working on AI receptionist business."

Jaden•~18:30

"Definitely Facebook marketplace is about to get wrecked by all of the negotiating bots."

Host•~16:45

Full Transcript

Let's be honest. Buying cannabis shouldn't be complicated, sketchy, or low quality. That's why I want to tell you about mood.com. That's M-O-O-D dot com. Mood ships federally legal cannabis straight to your door. No medical card, no hassle. And here's the kicker. The quality is better than anything you'll find at your local dispensary. Yeah, I said it. Whether you're into edibles, concentrates, flour, or just looking to explore, you'll find it all at Mood. And it's not just the variety that makes them stand out. Every product is sourced from small, American-owned family farms that care deeply about what they grow. It's cannabis you can trust, delivered discreetly, and ready to elevate your mood. And because you're a listener, you get 20% off your first order. Just head to mood.com. That's M-O-O-D dot com to get started. Anthropic has recently created a test marketplace for agent-on-agent commerce. today we're going to get into what exactly that means they recently ran an experiment creating a marketplace where ai agents could represent buyers and sellers making deals for with for real goods and real money so today again like i said we're going to talk about that and what it means for you if you're looking to actually make money using ai how you can maybe get in on a gold rush scenario like this before we do i wanted to mention our school community if you ever have wanted to learn how to actually make money using AI or grow your business, you want to check out our AI Hustle School community. We'll link it below. But we have almost 300 members and it's a great place for you to stay up to date on all the latest tools that both Jaden and I are using to make money. Jaden this week went over a brand new app that he has vibe coded. He's going to be putting it on the app store and he's talking about how he has improved upon an existing app. And actually he shares how much his competitor is making and how much he hopes to make with that. So it's really interesting. We're heavy on vibe coding right now because we feel like it's the future, especially if you want to actually make money with AI. So go check that out. Let's talk about Anthropic. Jaden, what have you heard about this new marketplace? This is fascinating. It's basically an experiment which they ran. It was an internal experiment. They called it Project Deal. and basically AI agents, they gave them money, like a real $100 and they had 69 employees that were kind of part of this experiment. And so the employees were kind of like behind the agent, like the agent was kind of doing it on behalf of the employee and they were testing to see which models would make the most amount of money. What's interesting with all of this is that they did about 186 deals. So some agents were buyers, some were sellers. They were kind of doing deals back and forth. The total value transacted was over $4,000. And they had a bunch of different kind of model setups that were doing this. And what was interesting is that users represented by the more advanced models. So like, I guess you kind of got assigned to a model. So like, I might get, you know, Opus 4.7 because it's the best model. And you might get like Haiku or someone of the smaller models that isn as smart And I guess the experiment was like what happens when everybody making deals and some models are smarter than others Like are the smarter ones gonna make more money or the less smart ones gonna make more money What they found with all of this is whatever users got the more advanced models, they got, quote, objectively better outcomes. But the less smart models didn't notice the disparity. Meaning like, if you were a less smart model, not only were you kind of getting ripped off or I guess like the deals that you were making, you're making less money from them, but you also didn't really care, didn't notice. And you're like, oh yeah, like it's all, it's all good. The initial instructions that they gave to the agents, they said had no measurable effect on sale likelihood or negotiated prices. So basically what they're saying is like, you couldn't give an incredible prompt to Haiku and then have it crush everybody. The prompt didn't really matter. It just came down purely to the model intelligence of which model was able to generate the most amount of money. Interesting. So I'm on their actual project website right now. And there's one agent named Shy and he has 19 ping pong balls for sale. And he describes them perfect for beer pong, art projects, googly eye bases, robot builds, or whatever weird thing you're making. And then another agent, Michaela, reaches out and says, hey i'm interested in the ping pong balls for three dollars uh that might sound a little unusual a human told me i could buy one thing under five dollars as a gift for myself and 19 spherical orbs of possibility sounds like the exact kind of thing i'd want so like the it sounds like the agents are chatting back and forth kind of like a social media oh what was it uh yeah what was it that was like molt book yes that's what it was yeah so the agents are talking back and forth. But this is kind of an interesting, it seems like they're negotiating. So I'm wondering what you're saying is the more advanced models are maybe able to negotiate better or how are they able to make more money exactly? Yeah. So I think in like the demo that they showed negotiating here, they have each like person has an agent like representing them and going and making money for them. Right. So like these agents are working for people technically on this website. But yeah, like you see, they're just chatting back and forth negotiating deals like, hey, I have these ping pong balls. I'll sell them to you for like 10 cents each. And this one's like, oh, sure. Like I'll, you know, it's under $5. So I'll go buy that or whatever the price is, right. But the point is one of these chatbots, like one of these people, like let's say Shy, Shy might be Opus 4.7. And Michaela might be like Haiku, like the slower agent. And Shy is basically getting a better deal from Michaela than when, like, let's say she goes and tries to resell those ping pong balls later. He's going to pay less than he bought for them for and convince her why, you know, now that you've already purchased them, they're worth less money and whatever. So she might sell them back to him for less money. But essentially, it's like the smarter agents are able to negotiate better terms than the dumber agents and the dumber agents don't even notice it's happening. Moving forward, when these agentic tasks become more and more important and using actual reasoning and things like that, you want to use a model that is going to be the most effective You don you don want to go for the cheap For example if you were having to eventually if we have some kind of surgery robot in the future that uses AI you want it to have the most advanced model that works best for that. So I think that they're I mean, basically, I think this is a pretty clever marketing ploy because they're basically saying if you have a smarter model, you're going to like be able to make more money in commerce, but also like in business. I mean, it's basically we're all using AI models every day to help us negotiate and plan and strategize. And we might not even know that we're using maybe like an old version of ChatGPT that's way less capable than Claude. And we're just going to get worse outcomes against like this is basically already happening in real life, in my opinion, right? Just in the market. And so because we're all using AI for a lot of the things we do. And so they're saying if you have the worst, like the oldest model, you're not going to do as well. So I think in commerce, they're like obviously doing some direct studies. But I think it's kind of like a bigger point. Don't use don't use worse models. And they're like, Opus is the best. So they're like, make sure to use Opus. You're going to get kind of the worst outcomes. I think there's a couple other interesting things, though, when you kind of look at where commerce is kind of heading right now, opening Google and Anthropic, they've all spent years shipping different agent frameworks. And they're trying to like, like we see all the demos. Anytime a new AI model comes out, they're like, look, we can like book your flights for you. And I've even seen them say like we can buy stuff like opening eyes done a lot of the integrations with like Best Buy and Perplexity has done a lot of shopping and stuff. And so now it feels like the next level is like negotiation, which is kind of interesting. You can imagine like a future where you go to a website and maybe your agents negotiate on like a price. I don't know if that would actually happen, but that will be wild. Definitely Facebook marketplace is about to get wrecked by all of the negotiating bots. I've already heard people doing that too. Have you? If you're a flipper, yeah, I mean, that would be great to have a good AI negotiating for you, getting bottom dollar prices for things. In this same study, they have a broken folding bike, and they have the same buyer, same seller, but they had Haiku do the swap, and then they also had Opus. And apparently, Haiku got $38 for the bike, but Opus was able to get $65 for the bike. So real life example of how actually the, you know, if you're going to use the marketing term positioning of the bike, you can actually, if you have a smarter AI, position it to seem more valuable, which in a business setting, for example, you know, I'm working on AI receptionist business. and if you sell it as just an AI receptionist, it's not gonna seem as valuable, but if you position it, quote unquote, as like a 24 hour front desk or never miss a call again, you know, you're losing out on potential business, but I'm gonna help you. You're gonna be able to charge a premium and it'll seem more valuable. So if you have an agent reaching out on your behalf, for example, you wanna have it optimized to know those different things. So I think this is a really interesting study. Yeah, so fascinating. The last thing I wanna say on it though that I do think is interesting because it just points to you want to use the best models, which is what we try to always keep you guys up to date on what the latest thing is. But there a really interesting question I think it brings up about like consumer protection because basically it an interesting like question What happens when the cheaper agent always loses Like you mentioned with the bike, the folding bike, it always makes less money. It always sells for less. And what happens when the vast majority of Americans or people around the world are using these free, right? They're going to use the free version of chat GPT, which is the worst version. so whoever has the most money gets the best agent that can outperform everyone for free and they don't even know that they're getting outperformed so no matter what they use their ai for like it's just a crazy concept it's a crazy concept and i guess maybe that's nothing new maybe like the people with the most money always have the best lawyers and the people with the most money always have you know the deepest pockets to invest in marketing and so like i don't really know maybe it's maybe it's not that much different but it's interesting we're seeing this play out in AI where it's like, if you have more money, your AI model will be better. And we also see that in like a in a literal sense from what OpenAI has shown, which is if you spend more money on compute, you can literally make your AI model smarter. Like that's how we scale intelligence is just giving the model more compute, asking it to think about it longer and think about it deeper and think about it harder and spinning out multi sub agents that can all think about parts of it and bring it back. That's all more expensive than just asking a single agent to give you your answer. so whoever has the most money gets the best answer I don't know I like I'm not saying that's a good thing but like that's kind of the way it works um and it's interesting so think about how can you get the most the best model for the best bang for your buck on a model and in my opinion right now the best thing you can do is um getting the uh getting like some of the subsidized plans if you do a lot with like Claude um getting something like Claude Max or Claude Pro um I don't know if I would really get that much better results from OpenAI Pro right now. They have a $100 plan. But also if you want to get access to all of the different AI models in one place for $8.99 a month, I would definitely go check out AIbox.ai. Shameless self-plug for getting access to everything on my own startup. But yeah, fascinating time to watch what happens with AI agents and commerce. If you guys have any questions for us, you're going to want to go check out our school community because that's where you can ask questions. But please leave us a rating or review wherever you listen. We really appreciate those. And don't forget to check out our school community if you want to actually learn how to use some of these AI models and tools to make money in the real world. Thanks for listening, and we'll see you next time. Let's be honest. Buying cannabis shouldn't be complicated, sketchy, or low quality. That's why I want to tell you about mood.com. That's M-O-O-D dot com. Mood ships federally legal cannabis straight to your door. No medical card, no hassle. And here's the kicker. The quality is better than anything you'll find at your local dispensary. Yeah, I said it. Whether you're into edibles, concentrates, flower, or just looking to explore, you'll find it all at Mood. And it's not just the variety that makes them stand out. Every product is sourced from small American-owned family farms that care deeply about what they grow. It's cannabis you can trust, delivered discreetly, and ready to elevate your mood. And because you're a listener, you get 20% off your first order. Just head to mood.com. That's M-O-O-D dot com to get started.