How agents will change banking forever | E2260
Jason Calacanis and Alex Wilhelm discuss Andrej Karpathy's Auto Research tool that enables AI models to improve themselves recursively, the growing popularity of OpenClaw in China versus negative sentiment toward AI in the US, and demonstrate three AI agent applications for banking, mobile automation, and website testing.
- AI recursive self-improvement is becoming accessible to non-researchers, democratizing AI development beyond the traditional 3,000 PhD cohort
- The social contract between companies and workers has broken in America - profits surge while compensation and headcount decrease, fueling AI skepticism
- Americans should be cautious about AI adoption given the track record of tech companies prioritizing profits over worker welfare
- Open source AI tools like OpenClaw are enabling rapid experimentation and practical applications across banking, mobile automation, and business optimization
- The productivity gains from AI agents managing routine tasks like bill payments and social media could be transformative for knowledge workers
"The horse has left the barn in my mind. If Toby Lutke, the CEO of Shopify, is like, you know what? I'll try using this. I have no experience in this. Let me give it a shot."
"Americans should not trust AI or the AI industry, because until that social contract is fixed, they should assume the worst."
"For the first time in the history of America, we're seeing that flipped. Now profits are surging because we're lowering comp and moving jobs offshore or we're cutting headcount."
"This is the damn cracking from the developers owning the world to everybody building the future. And I'm here for it."
"The pace of improvement in AI this year should be insane. And that's very bullish for startups, for VCs, for everyone who's put money behind a data center."
All right everybody, welcome back to twist. Monday, March 9, 2020 Six lots going on. Alex, how you doing?
0:00
I am fantastic. The snow's melting. I'm finally coming out of winter. I'm tank top. Season's on the around the corner. So Jason, I'm happy. This week in startups is brought to you by Northwest Registered Agent. Get more when you start your business with northwest, in 10 clicks and 10 minutes, you can form your company and walk away with a real business identity. Learn more AtNorth registered agent.com sl twist quo founders move fast. Their phone systems should too. Quo formerly open phone gives you a clean, modern way to handle every customer, call, text and thread all in one place. Try it free@quo.com Twist and Gusto. Check out the online payroll and benefits experts with software built specifically for small businesses and startups. Try Gusto today and get three months free@gusto.com twist
0:06
what's the top story in our world? Startups, Venture capital. Technology.
1:00
You know, I think it has to be the Andrej Karpathy Auto research story. We've talked so much as an industry about the future of AI models eventually being able to improve themselves, getting that loop going and then at that point we have real takeoff towards superintelligence. But in this case what Andre has done, and if you don't know him, he's a former AI head over at Tesla. Jason, he's just one of the best and most followed developers, I would say, in the world fair.
1:05
Yeah. For AI specifically. Obviously worked for Elon for a long time. So yeah, that he would be, you know, top 20 recognizable names in the space.
1:30
Yeah.
1:40
So when I see his stuff, I immediately take a look at it. And in this case he has put a tool called Auto research over on GitHub and what this is, it's a really stripped down LLM training loop and it runs in five minute increments. So you bring your own AI model to be an agent essentially and then you give it a prompt and then what the system does is try to improve its own code over a five minute training period. Then it retests itself and then if the code is improved, if the result is improved, it keeps the changes and then continues to iterate. So it's a very simple loop of AI actually improving itself across certain tasks that you give it. So it's not the full meal deal. Jason, we have, we haven't solved AI recursive self improvement, but we have shown that it's simple and possible in some contexts. Very cool.
1:41
Yeah. And he worked at OpenAI, I think twice. He was like in between stints at Tesla or maybe Tesla was between the two OpenAI stints. So he knows what he's talking about. I think he started, he was one of the founders over at OpenAI in 2015 timeframe. So I think without being an AI researcher myself, what we're seeing here is as the models get smaller, as people get more comfortable with running a local model, this isn't like an abstraction, there's tons of open source models and then people are building skills, people are running tests and people are interested in playing with this technology. As they become interested in playing with technology, you'll have somebody like Toby from Shopify who on the weekend is like, I'll try. And the next person tries. And this is all claw pilling and just having folks get excited and tinkering with this technology. So it's kind of like the horse has left the barn in my mind. If Toby Lutke, the CEO of Shopify, is like, you know what? I'll try using this. I have no experience in this. Let me give it a shot. And as you can see here from the tweet, he had some success.
2:29
Yeah. So he went ahead and gave it a try over the weekend. By the way, all CEOs are dangerous now because they have Claude code and they can tinker. If you're a developer who like to have the CEO far from the code base, it's going to be a tough couple of years. Anyways, he ran some tests with Auto Research, the tool, and he says, I'll just quote here, woke up to a plus 19% score on a 08B model. So 800 million parameters, Jason, very small, higher than the previous 1.6 billion parameter models result after 8 hours and 37 experiments. So essentially his auto research setup ran 37 different tries of 5 minutes apiece and over 8 hours managed to find nearly a 1/5 improvement in results. He says, look, I'm not an ML researcher. I presume people are doing this in the labs at a much more sophisticated level. But he learned so much from doing this because you can watch how the AI thinks as it goes through the process. The other thing to keep in mind is the gains that we see. Here is an image from Andre himself showing Progress. This is 83 experiments of which 15 had improvements. The gains, Jason, as measured by declines in this particular axis do seem to get smaller over time. So there is a diminishing level of return here. But it works, it works.
3:45
I think people are going to keep experimenting with this and the idea that there's only 3,000 or so PhDs in AI who are fought over for $10 million a year. A million dollars a year, a billion dollar package, whatever it happens to be. We've seen these crazy numbers from Meta trying to catch up and people poaching each other back and forth. That 3,000 will turn into 300,000 people who understand how LLMs work and who can make meaningful progress on them. And that's great. We used to live in a world where only a small cohort of people could make iPhone apps. Only a small cohort of people understood how to use at scale databases. So it's an interesting time to be in. And if you are an AI researcher, getting in there and playing with it, just like you have knowledge workers, you and I, Alex, who maybe weren't developers or maybe had toiled a little bit, being able to vibe code, being able to use Claude Cowork to make a reoccurring job, being able to install OpenClau, being able to write a skill, maybe publish it to GitHub. This is the damn cracking from the developers owning the world to everybody building the future. And I'm here for it. It's exciting.
4:54
It's very exciting. It is democratizing. And I hope also that what we're seeing here on the edges of public AI work is happen happening inside AI labs at like twice the speed. Because if this simple setup can show, you know, AI driven improvement of AI outputs, then we must be cooking really, really fast in XAI, OpenAI, Anthropic, et cetera. So to me this shows that yes, we can tinker, yes, it is democratizing, but also the pace of improvement in AI this year should be insane. And that's very bullish for startups, for VCs, for everyone who's put money behind a data center. So I think this is one of the most bullish stories that I've seen in. In weeks, if not months.
6:17
Yeah, since Open Claw, I guess really quickly.
6:52
I want to talk about the US public AI and then China and AI. There was a big boom in Open Claw over in China. I thought this was just a thing that I saw over on Reddit, people posting pictures of GitHub, sorry, OpenClaw meetups. Here's a screen, an image, Jason, of one of those meetups. In case you were curious what that looks like. Here are people outside in Shenzhen teaching each other how to set up OpenClaw. You can even see OpenCloud right over there in English. I thought it was kind of crazy. You Know, this is so cool. People are really getting into it. It turns out also there are certain governments inside of China that are setting up incentive structures to get more people to use open claw. All of that to one side. Over here in the US there's a new poll that came out that showed that AI here in the states is incredibly unpopular. So we get kind of a split screen here between people in China, even older folks really diving into openclaw. And here in the states, poll numbers that are pretty terrible. According to the NBC poll, Jason, 26% of people in the US are pro AI and 46% are opposed to it for a minus 20% differential. I was surprised by this.
6:54
All right, I'm going to give my opinion on both of these stories after we talk about Plod.
7:57
Yes, Jason, we've been talking about the Plod Notepen tools. Essentially a great way to get all your audio from your ambient area into your device or into your personal cloud. You are a big note taking guy. You absolutely love a checklist and tell people why Plod has a place in their life.
8:03
Yeah, it's pretty straightforward. I have the Plod pin here. It records what I'm doing. Obviously, you know, you want to do this with disclosing to people you're doing it. And when you're in a meeting, instead of like fumbling with your phone and trying to find a recording app, you put this on the back. See this? This is the pro. And when you press the button, hold it down, it gives you a nice haptic, tells you how many, how much battery life is. It's got a little LED on how much battery life is available. And you hold it and now it does a little recording stream. I don't know if you can see that audio stream going there. Just to let you know it's working. And that means one click, you're recording. Then it syncs it with your device, whether it's the pin, whether it's the pro, and you're all set, you're ready to then transcribe it, put it in the cloud and it will do a straight up summary. It will do a mind map. There's a whole library of different ways for you to use your plod. And it's a game changer for me because I frequently forget things or I get inspired in a flurry because I'm hiking or doing some ranch work or, you know, the kids are getting dropped off at school. And then I'm like, oh, I remember I have an idea for the show. Boom. I just say action Item boom. I say, hey, book this person for the show. So go to Plod, AI Twist. They'll give you 10% off.
8:17
We do a lot of notes here at Twist, so we live in deny our ability to keep our information straight. So shout out to Plod for sponsoring the show now, Jason.
9:34
Yes.
9:41
Over in China. Open. Claude.
9:42
Yes. Here in the us, Gusto is a musto. You've heard me say it a million times. It's true. Now more than ever. Gusto is a must, though. They're going to save you so much time on your hr. Gusto is the essential payroll and benefits software built for your startup or small business. It's all in one remote, friendly and incredibly easy to use, so you can pay, hire onboard and support your team from anywhere. We use it. We love it. My team loves it. Everything in HR settled down when we went to Gusto. No more drama. People want a perfect HR platform, and Gusto is that platform. They make it fast and easy to move your data over. They're an ideal place to get started with basic payroll for your new business. They can also provide more comprehensive services to help you scale up. I think we have 22 people on payroll now, and Twist listeners are going to get three months free when they run their first payroll. So try gusto today@gusto.com twist and get three months free when you run your first payroll. That's three months of free payroll at gusto.com twist G-U-S-T o.com twist two separate issues. The claw thing is happening all over America, and it's open source. And what's great about open source is when something trends, everybody embraces it. We've never seen a project get embraced as quickly, as violently, as lovingly as Openclaw. It's number one on GitHub. It has the largest number of stars. Why is that happening? I would say it's two reasons. First and foremost, people are getting value from it. Second, it's inspiring for people because if they're starting a company or running a business or they want to stand out at work, they can use this tool to be a better performer. So high performers are drawn to this. And there are tons of high performers around the world. Obviously, in China, you've got a lot of aspirational people. If you're aspirational, you're pulled to this. Now, there are normies, there are citizens, there are human beings who might not be looking to make a career and they might not live to work. They may work to have a great life and there's nothing wrong with that. So you're not seeing like these open claw things happening in France. I don't think like people are like, you know what, instead of going to the bistro tonight, I'm gonna go do a Claude meetup. Maybe they are. Who knows? I don't need to give a hard time to the French.
9:43
March 19 Open Claw meetup Paris. It's at 5P Rue de Bouss, which is French, apparently.
12:13
We'll see if like six people show up or whatever. But anyway, the second piece to this does dovetail with it nicely, which is people in China feel like AI is going to make their lives better. They've recently seen their lives get better. They have an immediate history that they can look back on and say, hey, for 20 or 30 years we've watched people go from living on a farm without running water or sharing a cold spigot between three or four people. This is literally reality in China. No electricity or limited electricity in the village to working in a factory, living in a dormitory. The dormitory has air conditioning or heat. They have some spending money. They can go to a bar, they can go have lunch somewhere. They can start to build a life for themselves, eventually get an apartment. So they've seen this, so they're excited about the future. Now you go compare that to America. People have for the last two generations said, I've gotten 400k in debt, 200k in debt with my degree, et cetera. And since I'm in debt, then I can't buy a home. And if I can't buy a home and my job out of school is paying me 50 grand a year and I'm 250k in debt, I'll never get out from under. So there is like a resentment that's brewed over 20 or 30 years. And if you pull up the chart of what people don't trust in America, you know, it has gotten really acute for a couple of groups of people. Obviously the government, obviously journalism and media, both parties seem to have had their reputations, just become completely untrustworthy and obviously AI.
12:19
So Jason, here is the polling data that we're discussing. This is from NBC and I've gone ahead and highlighted a couple things so people can read it. Essentially here in the lower red box is AI. As you can tell, based on this poll in March of 2026, 5% of people are very positive in the US about AI, 21% somewhat positive, neutral, 27% more people than I thought were neutral. But on the somewhat negative and very negative it's 24 and 22%. So many more people on the negative side. And people are pointing out on Twitter and other places that ice, Immigrations and Customs Enforcement, a very controversial. I think it's fair to say agency of the government has a 38% return to people saying very or somewhat positive sentiments about it. So AI in the US is somehow less popular than ice.
14:00
Well, if you. There are some extremes going on here. So let's look at the extremes here. If we put the two numbers together, 24 and 22, somewhat negative, very negative. You get 46% of people are very negative or somewhat negative. They're negative about AI here in the United States. Now, if you look at the ice ratings at their peak, because you have five dates there. But March 2026, when we go back to this data here and you look at March 26, 47% very negative. 9%, somewhat negative. You put those two numbers together, you get to 56. So the ice is still in terms of total negativity eking out AI, but it should be concerning to people. And then if you look at. I'm assuming when they say Iran, they're looking at it negatively. They're talking about either the war or I'm gonna guess it's more the government of Iran. I'm not sure.
14:47
I think it's the government. I forget the exact phrasing of the question, but it was what is your opinion about item? And then it's somewhat positive, very positive, negative.
15:42
Those two numbers together, 61% are negative or somewhat negative by Iran. So why is AI as unpopular as a brutal dictatorship that murdered tens of thousands of peaceful protesters and ICE agents? An agency that has, you know, had two deaths, whether you consider them murder or not. I'll leave that to the investigations. I have a theory of why AI is so unpopular in this country, and it's that the industry has done itself no favors in terms of explaining itself. The second is we have seen a social contract in America be broken in big tech and in large corporations in America. It's just not limited to big tech. And some big tech companies are not just tech companies. Like Amazon is a E commerce company, right?
15:51
Sure.
16:48
There was an explicit agreement that if profits were going up, compensation, bonuses and headcount would go up. These two things were tied to each other for the history of modernity. The company's doing well, raises. The company's not doing well. Okay, we're gonna freeze raises. Oh, the company did well this year. You get a bonus. Oh, the company lost money this Year, hey, the bonus program's on pause. And everybody was like, hey, that's reasonable. Right? We're all working together, swimming in the right direction. And they said, hey, you know, if things go well, yeah, your group. Oh, you're in the sales group. You're in the customer support group. Yeah, we're going to get you a couple of extra headcount. You won't have to work the weekends as much or burn the midnight oil. We understand you need more headcount to hit the goals. And hey, why not? The company's growing. For the first time in the history of America, we're seeing that flipped. And it used to be if profits are surging, if profits are surging, bonuses and headcount will increase. Now profits are surgering. And now profits are surging because we're lowering comp and moving jobs offshore or we're cutting headcount. This is apparent to Americans now. They see it themselves and they also. The gig economy is a major contributor to this. Everybody in their family knows somebody who maybe they didn't fit in in corporate America. Maybe they weren't able to get up in time and do the nine to five thing. And this magical little thing, press a button, get a job came out. It got criticized. Oh, my God. You know, you don't get benefits, whatever, but let's call it what it is. Alex. We all knew somebody. I don't want to call them an F up, but they just didn't fit into the 9 to 5 gig. And when they were able to support, or maybe they were a stay at home parent and only had four hours and they wanted to spend time with their kids, which is, that's actually laudable. But we have somebody in our life who's like always down on their luck. They get fired, whatever. And all of a sudden they became their own boss. They started doing doordashing. They started doing Uber. They might have done Wonolo. All these other services where you can do shift work or make your own hours, gig work. They see that going away. So you see two things happening concurrently. Hey, your aunt who worked at Microsoft or Amazon and was in a senior position was making six figures, just got automated. Oh, your cousin who couldn't keep, who couldn't keep that job and wasn't as aspirational. Maybe as Aunt Susan, Waymo's in their town. And Doordash has robots that they made themselves zipping around. And there's other bots on the street. Writings on the wall for cousin Sal. So now Aunt Susan and cousin Sal are hitting it and by the way, you know your Uncle Joe, who is the programmer, he's now wondering if he's going to have a job. This social contract's been broken. And Americans should not trust AI or the AI industry, because until that social contract is fixed, they should assume the worst. You can have the most brilliant team members in the world and a great product, but if your team is not communicating well with one another or with your customers, more importantly, it could all be for nothing. You want to make sure your team remains totally in sync and responsive to new customers and returning customers. Right? The best new customer is keeping your last customer. And that's why you need to check out Quo. That's Q U O. They're the modern alternative for running your business, communication. And they're already being used by more than 90,000 companies. I use Quo. Why do I use Quo? Well, we use it for our syndicate, where we have thousands of members calling us. And these are VIPs. We want to make sure somebody picks up the phone for them. It's so easy. Your entire team can share that phone number or they can have their own, and you just install an app on your desktop, on your Android, on your iPhone, or through the web interface. No more missed messages, no more missed phone calls. It's the greatest. So try quo for free. Plus get 20% off your first six months by going to quo.com twist. That's Q U O.com twist quo. No missed calls, no missed customers.
16:49
I do not disagree. And I've always struggled to match my AI enthusiasm, my belief in the progress of technology, lifting boats over the longer term with what I consider to be the potential for quicker job loss than we can replace with new jobs. And I think the technology industry, if they want to avoid a political crisis in the next four years, maybe a couple of elections, need to start doing more than they are to ameliorate the public's discontent. Because this is not just people saying no data centers in my backyard. This is people potentially voting people into Congress who want to put relatively strict guards and guardrails around what AI can and cannot do, which could disembowel the industry, which I think you and I both agree would be a net negative for the country over the long term. So I don't know what the solution here is, but there appears to be a political cloud forming in the future. People are.
21:08
It's not forming. It's here. I would argue it's here. You know, there's a group of people who are denying it. There's a Group of people saying, hey, it doesn't matter, we'll create new jobs. And then there's a group of people who are being labeled doomers. And you can always tell when people lose an argument because they want to label you. This happens to me all the time on All In. I was just about to say co
22:01
host who doesn't like it when you bring up AI job loss.
22:23
Yeah, well, yeah, I mean it's obviously if you're in charge of that, you probably want to try to create jobs. Now I'm not saying Sachs is wrong or I'm right. I just think the American people should have their guard up. They should assume the worst. Why? Because they gotta hit rent. Because they gotta, you know, get groceries. Because they have bills to pay. Assume the worst and then be delighted if your job playing the violin, you know, at a cafe. Because everything because there's no jobs becomes a reality. Let's hope that this becomes the Star Trek version. It could. There's a third of a chance of that, a third of chance of massive job loss and a third of something in between. But a lot of good questions coming in. I think this is a very emotional issue. And our noti gang, these are the people who have notifications turned on on YouTube. They get an alert on their phone. Hey, Jake, Al and Alex are talking. Here's the topic. And then they click the link and they come into the chat room. They watch us live Monday, Wednesday, Fridays about Texas time, 12pm Texas time, 10am Pacific time, 1pm East coast time. What questions do our noties have for us?
22:25
So Niall Roche says, why so negative on openclaw doctor Adoption in Europe. Don't forget openclaw was started by a European Fair point, Jason. You now must defend your anti French agitation.
23:34
I'm just talking about how delightful the lifestyle is in Europe and how it's turned into a retirement community. Epcot Center. You know, anybody who is in Europe and who wants to build a great company is probably looking at going to another region because you need to have a bunch of people working for you. Hardcore. Doesn't mean it can't be done. Obviously the Scandinavians in Berlin stand out as places where interesting things are happening. But I think we all can call it what it is. Europe's a retirement community, full stop.
23:45
I wonder about that because we're seeing you and I have had this conversation every three months for several years now. But I keep my eyes open and in scale. The UK based neoclass just raised $2 billion. The UK government just rolled out. They rolled out a 50 million pound fund which I made fun of. Then they rolled out a 500 million pound fund. Okay. A little bit progress.
24:18
Yeah. The fact that the government's doing that tells you everything.
24:39
They have large companies. So I think using. Actually Jason, I think using public funds to build out sovereign compute is a good idea.
24:43
You know, most people would say that's socialism or it's putting your thumb on the free market and you really want more entrepreneurs battling it out. And the government having nothing to do with that. Typically when the government intervenes like that, it's because the private sector's not getting it done. But we'll put that aside. You know, the state of Europe and you know, having a couple of, you know, unicorns here or there. Yeah, sure. It just feels to me like with the education level and the resources, they greatly underperform where they should be at. They should be producing much more and they have work to do. In terms of the taxation there on startups, on investors, if you put up too many roadblocks for founders in terms of hiring and at will employment and you put up too high taxes, then people will invest less and they'll move to other places. Europeans have a very unique ability to live in Dubai. Insert. Yeah, okay, bombs are going off, but they can move to Singapore, Dubai and other places and pay no taxes or pay under 10% taxes. Any logical investor in Europe faced with 50, 60, 70% taxes will get out of dodge and that's what they've done.
24:49
We had a couple questions about AI driven job loss. Jason. Before we jump into the Jason productivity hack, I'm just curious what your advice is for people out there who are concerned. We talked about, you know, AI won't take your job. People using AI will take your job. I think the agentic world has shown that to be different. So if you're a normie, what's the best way to have armor on your body to keep your job secure?
26:06
Short version, if you're working in corporate America, being the person who knows how to manage, being a maestro who manages AI is the key person. If you're in any other field, moving up the stack to things that a robot can't do and outrunning the robot is the best piece of advice. So what can a robot do? It can deliver a burrito and it can drive full self driving. It's going to be a decade long deployment of those technologies. So what's the next thing up from there? Being a carpenter, being a handyman, being a plumber, being an electrician and if you look at that generation tool belt, if you were cousin Sal who is not a nine to fiver and loves his Uber doordash Winolo draw, you know, on demand economy, you might need to buckle down and say, you know what, I'm going to learn a trade. I'm going to take out YouTube, I'm going to take out chat GPT. I'm going to learn how to be a carpenter. I'm going to learn how to build a fence. Because the robot ain't building no fence. I can tell you, I just built the fence. It cost me like six grand to build a tiny little fence. It was a two day job, you know, do you know what handyman who can build the fence get paid? It's like, yes, 100 bucks an hour.
26:27
I, I've done fencing. Yes, I paid a lot of money.
27:37
You know what an Uber driver gets paid? 30 bucks an hour. You know, in a city or a doordasher, 20 bucks, 30 bucks. Sometimes they hit it, make 40 bucks on the weekend. It's easier than ever to build a great new product. And launching a startup, even as a solopreneur, is getting easier and easier. But as my experienced founders already know, there's a lot more to starting a company than just putting up a website or even building a product. If you're serious about going into business, you need a Delaware C corp that's going to give you a serious leg up on your competition and you can be taken seriously by investors. That's where Northwest Registered Agent comes in. Just a few clicks, they're gonna give you the perfect start to your new enterprise. A real identity, a domain, a custom website, a business email, or all the public filings done, and even a phone number. And you'll complete this process in under 10 minutes. Plus, you're gonna get all sorts of free tools and resources from step to step guides, compliance reminders, and an online account that's going to keep everything organized even as your business grows. So get all the advantages of a Delaware C corp, regardless of where in the US you're operating out of. Northwestregisteredagent.com twist. You can just build a fence and that's not going away. And that's a hundred bucks an hour, 75 bucks an hour. So you're going to have to outrun the robot. The robot will build defense and five years, seven years. Then you're going to need to learn how to make the beautiful desk and carve it that the robot can't. And then in 10 years, forget it. It's over anyway. You'll have a robot and then you'll just be managing robots.
27:40
I'm just gonna. I'm on the moon in 10 years, so don't call me. It's not my problem. Jason, one thing that you do is a lot. You do a lot of stuff. You keep us all on our toes here. And I was curious if you could drop some gems on us of knowledge. But how you are productive.
29:15
Oh, I have a productivity hack. Okay. This is a really easy one. You know, I work with my Athena assistant on this. I call it super distribution. Okay. There's a ton of platforms out there. When you have a great piece of content for your company, for yourself, what people do is they're like, oh, I had a great piece of content. What you need to do is take that piece of content and then go super distributed. What does it mean to super distribute it? It means you post it two or three times in two or three different versions across all platforms. So if you have an EA like I do from Athena, I will tell them, hey, take this, put it onto my Twitter, put it onto my, you know, put it in the drafts, maybe put it onto LinkedIn, put it in all these different places. Hey, we have these different accounts. And go find me other pieces of content that people in my network like say in my case portfolio company founders are doing and give me a list of that so I can engage them them. This social media 101 work, if you were to hire a social media person would be again 50 bucks an hour, 75 bucks an hour for a freelancer. But you can do it with an Athena assistant. If you want to get into these hacks, just go to Athena.com jcal and you will get $2,000 off your Athena assistant. They're also learning how to use I don't want to tip anybody's cards here, but they know how to use agents at Athena. So Athena plus agents plus your agent. Put them all in the same Slack instance and you will get incredibly productive. You have a human in the loop with your Athena assistant who can make phone calls. You'll have your agent who is openclaw or you know, your Google agent, your Microsoft launched agents, your notion agent, your Slack agent. There's all kinds of different agents out there that you can use multiple ones. But having a human in aloof loop is going to make you even more productive.
29:29
Can I tell you my, my, my recent Athena brainwave?
31:19
Oh, go ahead.
31:22
All right, so right now we have a nanny which is given what we we pay top of market. Because we care about our children.
31:23
Yes.
31:30
The moment. The moment we don't have a nanny, I'm hiring an assistant. And I was like, we can just get Nathan assistant. Because. Because it's so much cheaper than a nanny. And it's going to be so helpful because we have so many things now going on constantly. I can't wait. This is my weekend. Juggling children, diapers, chaos zoo, you know,
31:32
also, it's a luxury item, I'll give you that. But it's also a productivity hack for mom. Here's another one. I like a certain bakery in the Austin area. It's called Abby Jane. They make incredible bread. They use the whole grains, that artisanal grains. So it's just. This is literally the best bakery in Texas, in one of the 10 best in the country. Problem is, they're open Thursday to Sunday and they sell out and they don't deliver. They don't need to. The best. Right. But they do take orders and they do open at 8am Athena assistant calls at 7:59. 8am There it is. Abby Jane. Oh, man.
31:50
These are their bagels getting ready to go.
32:31
I mean, it's just unbelievable. The chocolate croissant is, like, literally from France. So, you know, I was talking to the woman there, and I was like, hey, you know, you should open up, like, other days of the week. Maybe you should get an investor, you know, Hint, hint. She's not interested. She's got her thing. It's dialed in. And I was like, why aren't you on Uber Eats? I don't want to be on Uber Eats. I sell out. Okay, great. Fair enough. Uber has a courier service. The courier service is like 10 bucks, but you have to coordinate. So what do we do? My Athena assistant looks at their menu at 7am finds out what's new on the menu because they change it every week. Sends me it with a number list. I say four of these, two of these, three of these. I would forget to do this. Then he puts the order in, puts it on the credit card, tells them he's coming to pick it up, tells them the name of the Uber driver who's coming to pick it up. This is for Jason. The person picking it up is Joe. They pick it up. Then they call the Ranch, tell the ranch, hey, this person's coming. Here's the link to the Uber courier. Boom. All done. All done every week. But, you know, there's like 10 steps to this. It's not gonna be done by an agent anytime soon, but it is done by my Athena Assistant My girls eat healthy. We stopped ordering Schlocky Croissant from H E B Schlocky Bread. And now they have Chef's kiss. The best in Texas. Little hack there for you. There's another product. It's a double productivity hack. People don't know about the courier service. Do you know about the courier service?
32:33
I had read about it and then it had fallen immediately out of the back of my head. Just instant forgetting.
33:58
The best thing that you want to get for your. For your family. It's not available on Doordash or UberEats, but they are available for pickup. Yeah. So there you have it, folks.
34:03
All right, let's do some demos. Let's do some stuff. All right, so first up, I'm going to bring up Cresh Ramamurthy. He's from NetX. Hey man. Suresh is going to show us moving money and interacting with bank accounts securely using OpenClaw and a skill from his company, Net XD. So Suresh, talk us through it and let us enjoy.
34:14
Hi, Alex.
34:34
Hi, Jason. I hope you guys can hear me.
34:35
Yes, sir.
34:37
So we have a full stack for banks and we decided that we will connect it to OpenCloud and we built cryptographic security for that so that obviously I can't let OpenCloud run away with my money. So we integrated right into our bank mobile app. So whatever OpenCloud does, finally, I approve it from my bank app and my bank runs on a blockchain.
34:39
You were one of the lunatics brave enough to give OpenClaw access to your bank?
35:04
Yes.
35:11
Which means you gave it your password in two factor and authenticated it or you used a password manager. Explain that step in plain English.
35:12
I built a banking platform myself. I have 20 plus banks using my platform and I built a special open banking API so agents can access read only as well as set up transactions that I as a user can use my secure banking app, which has my private keys, ECDSA keys, to approve it.
35:22
Got it. So what's the name of your private company that does this? The platform netxt.net x t. This existed pre OpenCloud because today OpenCloud, today's AO44. It's been 44 days since we started talking about OpenClaw here on the program.
35:42
Of course.
35:59
Yes.
36:00
I built it to handle 100 billion machine IDs about six years ago. I'm at Google. I've always been thinking about machine to machine commerce. So when I built this ledger, I planned for 100 billion machine IDs which seems kind of small right now.
36:01
Okay, so Net XD existed before openclaw and it's a banking ledger. Now openclaw users can open an account on nextaxd, no, on a bank that
36:16
supports Net XD platform.
36:29
Ah, got it. Then the queries go from your openclaw to xd, correct? Yes, got it, got it. So that will only allow it read only access. But if I want to pay somebody like, you know, I often will get, you know, here's the gardener, here's the pool cleaner, you know, here's the garbage service, whatever it happens to be. Here's the, you know, septic tank. You got a ranch. You probably have 20 different vendors to maintain a ranch. I could have. When their email comes in with their bill, sometimes they just send an email with a document assigned to it. I could have my openclaw file that queue up the payment through Zelle or whatever and then not let my openclaw pay it, but queue it up for me to then approve it. Yes.
36:31
It'll pop up in your bank mobile app for approval. And your bank mobile app has a secret element with a private key which approves it. What the bank sees is assigned transaction from you, Jason, to the bank, which is the only thing they would approve, which is the only thing they would accept.
37:21
I mean this is Alex, the productivity we need. This is the power of an open source. You would not have been able to do this if this wasn't open source. Right. You would have had to do all this manually. And the fact that there's an open source agent makes it so easy. Right.
37:36
We had to build so many other plugins. We first did it on Claude a year and a half ago we did it, we tried to build all the intention discovery ourselves and we stopped and said, I know where the developers are going. Instead we built agentic memory because you may want to pay your bills in a certain way and you want the memory to remember, hey, I want to pay this only on the last day of the due date. This one I want to pay early so that he doesn't stop me from, doesn't supply. So you can have your own long term memory on how to do things. We have a memory that goes along with this as well.
37:50
Suresh, can you show us how it works?
38:23
Sure. Openclass got a little jealous of bricks and ramps. Every time I ask you to list capabilities it says how much better it is. So I'm going to ask three questions at the same time. I'm going to say, check my bank balances, check my memory. For any rules. I have to optimize my savings and checking and check. Have I paid the late edge invoice, which is what Jason was asking, kind of. So now that I've sent that message through Telegram to openclaw, I'm also keeping my mobile app handy. As you can see, I have two controls, Jason. One to control openclaw. Anyway, you can see the request coming. I'm going to.
38:25
Okay, so we see that happening on the right side. You've got 3700 bucks in your checking. You got almost 10k in your savings. You're doing okay, kid. You've. Oh, no, wait, that's 9 million. You've almost got 10 million in the bank.
38:58
It's a startup that just got a Series A. Let's think of it that way. It's a hypothetical startup. Okay, so now it says, hey, you're off by $1,300 on your checking because you plan to keep 5,000 there for optimization. So I'm going to say, go ahead, do it. Optimize it. Move the money.
39:10
This is something everybody's got to deal with. Because you might. If you dip below a certain amount, the banks, the scumbags at the banks may give you a fee for going under 5,000.
39:26
Yes.
39:36
I mean, why do they do that? Like, this should be something they do automatically.
39:38
You can see that it's moving. $1,300 to optimize. I choose it. I approve it with my private key, which is linked to my biometrics. And only then the bank knows it will process the transaction.
39:41
You have given it optimization rules in memory. Keep a 5000 buffer in checking, sweep excess into savings. We all want that. Weekly auto transfer, $100 to savings. Let's say if you were on the savings tip, windfall capture, 50% of large inflows, savings buckets, emergency 50 investments, 30, yada, yada, monthly go, 20% income save triggers paycheck, large inflows, end of the month, boom. So this is something everybody has. You might say, hey, every month I want to move anything above this amount in savings into a 529 for my kids, into my 401k. But my 401k could only accept Alex, this amount. So this is where these things can get super powerful. Then you know what I really want? I wanted to go find better deals from other banks for my savings or for my deposits. Right? You put a deposit into Robinhood, you get like some bonus or something. So I love the idea of it checking my deals with my bank against other best offers. And then I would give it permission to queue up an email in my drafts folders to send to my bankers. Because this is what I have. Operations people like Heidi, who does operations
39:51
for me, Shout out, Heidi.
41:04
Shout out to Heidi. Like, I'm like, are we getting the best deal here? And like, I realized I had like a million dollars in some accounts not getting interest. I'm like, why aren't these in interest accounts? Oh, we're supposed to have them in interest account. I'm like, what's the interest rate? It's like, oh, 4, 3%. I'm like, well, why don't we have 5%? Other places have 5%. Okay, we got it up to 4.25. I'm like, you realize that's like $45,000 a year. Over three years we've left 150K. Like, why do I have accountants and all this operations people if we can't get this right? It's like, oh, it's tedious. It's a chore. People forget. That's the key to openclaw is humans forget. Humans are fallible, especially when it comes to chores. I forget to do my chores. You think the dishwasher is clean here? I may have forgot to put stuff in the dishwasher. I may have forgot to bring my dry clean into the dry cleaner. That's where AI shines, isn't it?
41:05
Are you going to take any of the underlying net XD tech that was used and open source it or make it available for other folks to play with and Tinker?
41:50
Make XT control available to anybody using OpenCloud. That is the one that is right on the right hand side. That is the XT control app which you can use to control XT so that it doesn't used to control OpenCloud, the banking platform we are opening. One bank's gonna announce in three months they're gonna have an open banking API that'll connect to any AI, not only OpenCloud, Cloud, whatever it is, at which point anybody can open account at that bank and connect all these things on their own.
41:57
All right, well done.
42:20
Netxd.com Suresh thank you so much for your time, man.
42:22
Thank you.
42:25
I really appreciate it. That was freaking awesome. Next up, we're gonna talk to the Phone Claw. This is Rohan Arun. It's an openclaw extension, if you will, that allows Asians to operate smartphones like humans, which is incredibly cool. And he's gonna show us a demo of multi step phone automation, essentially doing some video work and posting it to multiple phones at once. And Rohan, are you gonna show us with the glasses on or the glasses off?
42:25
I wanna show you with the glasses on. Cause we've been focusing on wearables and mobile devices, so I think that'll be exciting. Yeah.
42:48
All right, so we're gonna do this from your first person view through your Android AR glasses, right?
42:54
Yes.
43:00
Yeah. We also love the iOS app that was just launched, but we're focusing on mobile and AR devices.
43:01
All right, let's do it.
43:05
The window on the left hand side of the screen is our app. You can go to getsupers.com to try it. Right now you can essentially connect Android devices. You can also connect a Mac and you can connect multiple different agents. You can see that I've connected three different Android devices here in parallel that I'll automate in a second. On top of it, we have a real time assistant. Pete steinberger said that OpenClaw is not intended for non technical people. So we've been trying to approach it from the angle of mobile devices and wearables. And so our assistant allows you to, basically you can connect it in maybe 30 seconds. You can start automating entirely with voice.
43:07
So what's happening with your hands there? I see when you show your hand in front of the glasses, it's putting some notes on each finger and it's, it understands what your thumb is and your index finger, et cetera. What's the point of all that? Is that just it? Yeah, good question.
43:42
So basically, I suspect the reason Amazon, Alexa and other devices, AI devices previously failed is because voice alone doesn't solve the problem, because it doesn't solve discovery and navigation. So what I'm seeing here in my hand is actually all the different options that I have for voice commands. So we see prowers, browsers. We can automate the Android device, we can automate the chat interface.
43:58
Those are buttons you can press with your hands.
44:19
These are actually voice commands. So we're going to be automating four different Androids and they're going to be posting to social, posting to Twitter. We can automate all your social apps and things like that.
44:21
Okay, so you're a social media manager and you're going to, with these four different phones, post to your Twitter, Instagram, whatever.
44:30
Yes, correct.
44:38
This is an exciting demo we've got ready to go here. He's going to tell his agent across three virtual phones what to do. So this is particularly important. We have an election coming up in 2026 and Putin and Xi, they need more Phones, Alex. More phones and more social accounts to cause chaos in our elections in 2026. So here we're gonna see how Putin's gonna do this next version of Election where she's gonna do election interference here in the uk.
44:39
That's the worst pitch I've ever heard. Hey. Hey.
45:14
Do you wanna take. You're thinking it. I'm just saying it out loud. Hey, super.
45:16
Can you hear me?
45:23
Yes, I can hear you loud and clear.
45:24
Can you go to device 2 and can you open Twitter and can you post to Twitter about this week's In Startups podcast? Okay, so what it's doing is it went to my second Android, it posted and it made a post. Check out the latest episode of this week's In Startups podcast. Always insightful, inspiring.
45:26
So you basically take an old Android phone and then you give your assistant the keys to that kingdom and let them rock and roll on apps and they can do things. Now, if I had my Mac Mini, it does have native iPhone mirroring, so I can mirror my phone, but I wonder if I have mirrored my phone, if my open claw could then take over my desktop and do things in my phone.
45:45
So you can click things on your phone. So that's also one of the things we solved is we're using the Moon Dream API. We figured out how to solve this kind of computer use problem with really free models. So you can go to getsupers.com and connect your phone or a Mac, and you don't have to pay $200 a month, for example. And so there are other tools that allow you to kind of hack your way to this. We allow you to go directly to the phone. And the problem with automating certain things on the phone, that screen sharing it with iOS actually prevents you from automating certain things like calls.
46:16
And for security reasons, this is absolutely the future. Being able to use ar. You really have two startups here. One is using ar, which is years in the future, but then actually giving your openclaw, your assistant, its own phone to do things because of the app ecosystem. Kind of interesting, kind of compelling. Very cool. And I'll look forward to see where you're taking this next. All right, let's keep moving.
46:46
Rohan, thank you so much. Appreciate it, man.
47:13
But I was joking about the Putin stuff. But if you were actually trying to do research on apps, let's say being able to have OpenClaw or another agenting technology, provision 10 different Android phones, download the apps, install them, play with them, authenticate with them, like even just for research as a firm to understand the changes happening, competitive intelligence. I mean, I could see a lot of interesting use cases here. I never thought, what if the agents had unlimited access to app stores and apps? It's a pretty interesting idea.
47:15
Go back to the top of the show. Andrej Karpathy's idea of auto research and letting LLMs train themselves. Now take that. Build an app, put it on your dummy device, use phoneclaw to let the agents interact with it, run tests. I mean, you could build a self reinforcing, like mobile development loop here with just a couple of pieces kind of duct taped together. And I wonder if that's going to do away with certain testing because then you don't need to have a human in the loop at all. You can just run it autonomously.
47:53
It'll certainly accelerate tests. Right. So if you were com.com and you wanted to test different aspects of the app, you need to have humans testing it. But you might also want to test responsiveness and speed. Speed and other things. And yeah, you could create a test flight and just say, go, here's your metric. And that was the key to Andrew's innovation this weekend, is you need to have a metric that the recursive loop can key off of. So you'd have to really set a North Star. I want the app to be faster. I want the app to be, gosh, I don't know, less buggy. I want the app to work like if you said, I want it to work better on iPhone 12s and earlier. So make me a light version of the app that detects you're on an iPhone 12 and just downgrades everything to make it as fast as an iPhone 17. Like that kind of benchmarking and instruction would work. But if you don't have a good benchmark, you don't have a North Star, it's not gonna work. So I think that's a key to the end of the rails.
48:21
All right, we have one last demo for us, and that is from someone that you know, Jason. We're gonna talk to Eugene Stuckless of Aehr Inc. Aehr is a founder, UX Japan company. You met him in Tokyo, and he's going to show us an AI native testing tool that helps agents test. It's a little complicated. We spent a lot of time talking about it before the show. So, Eugene, maybe we should just start with explaining this to Jason.
49:22
Why don't I show you?
49:49
Even better. Much better.
49:50
So Air Inc. Is an AI native automation platform. Our first product is Aehr Tests it's using agents to test websites, right? So you provide a URL and then with this URL, about 15 minutes later you get a link in this link or you get an email. And this email is a deep, deep dive into your website, analyzing it from 10 different criteria. SEO, performance, trust, UX, friction, security, right? This happens in about 15, 20 minutes. This is a free report. This is what it looks like when you go to the site, you see and you drill down the evidence about what went wrong, how you fall short. This is free, right? So you pay 20 bucks to buy 150 credits and then you can run this report maybe four or five more times. What's pretty cool about this is that you see there's, it's very, very in depth. You'll see you drill down into one specific area, let's say SEO. This particular founder has been focused on SEO. So he has put a lot of effort in and you can see that this test report has tracked his changes over time translates into business impact that he can understand. This is AI testing top to bottom, right? So this generating of a site readiness report is one workflow we have, right? So agents run workflows. So I would like to show you what it looks like to run and train agents running workflows. This is how I deal with the workload that I have around agent testing. I have built a proprietary piece of software that self trains at maybe four or five different levels. And you see here a dashboard that tracks a particular workflow and its evolution over time, how many tokens it reduced each loop, what kind of change was made to allow for that to happen. You could see as well a graph that shows me the workflow distribution of which models got which tasks within that workflow and how we learned how to feather more expensive tasks to the expensive models and the cheap stuff to the cheap ones. And importantly, you'll see here on the governance page, this is my AI asking me for permission to evolve for high risk that are, let's say adding a new tool. It asks me if it can evolve because that's a very important thing, adding a new tool. And then one more. I think you'll like this one, Jason. So this is the workflow view. On this workflow view you see the phases broken down. My favorite part is that it has a blast radius. So if you have. Yes. So if you consider when you get signal from the outside world that goes into your open claw, that agent is compromised, prompt injection is impossible to prevent. So instead of focusing on trying to, let's say secure the whole system or secure on prompt injection, you focus on isolating the blast radius of that bad actor. So you see here in this workflow, it flagged that this phase is potential for bad actors. So it's flagging it. To me say, hey, maybe they should have less tools. Hey, maybe this agent should have not had the ability to do this thing because it's getting signal from an outside source. So this is what it looks like to build internal tooling to manage and train agents to improve efficiency for the workloads that they provide to the customer.
49:52
Okay, so just to separate this out and explain to the audience, you are building a startup that helps people optimize their websites. You have to build tools for those website owners and those tools need to get better over time. So you build agents to make each tool get better and be recursive. So if it one of those might be SEO advice. So when you run the SEO advice tool, you want to make sure that it doesn't cause damage and that it is actually getting better and that you're controlling the costs.
53:40
Right. So there's a couple of cool things that are feathered in here. The top of the news, the guy said his recursive loop, this is applying this at scale. So that recursive loop is happening across many, many agents that you see in here. Not only agents, the workflows, the tools they have. It's multi layered.
54:16
Right.
54:34
So my job is more like helping it learn how to run my business. That's the point at which this is now peak efficiency. I no longer have a job. Crazy.
54:35
Yeah. So it's being like a product manager, but it's got to go get intelligence from the open web. So you're using yes protocol to get better at SEO. So you have to make sure that it's not garbage in, garbage out. So how do you do that step that the LLM agent you have?
54:45
Sure.
55:05
Specifically for SEO or maybe for, I don't know if you have like call to actions on the website or copy.
55:06
Really good question.
55:13
So how do you ground data in reality? There's two ways. One, deep research is really, really effective. Two, you allow the AI to model its research in a way that it can understand so that it understands the context better. It's a better data model, easier to talk to. The focus that I was going for, the reason why I put all this engineering effort in to get this self training is because my mind has been blown since openclaw dropped. Like you, I've been listening to jazz nonstop. I can't do Anything else? Because my mind is so expansive. So this, this codex that I invented to allow myself to encode this learning, this is called Coltrane.
55:14
Okay.
55:52
Because the only thing I can use to express myself as. As he can. Because the way the agents learn, the way I can talk to the system, it learns so fast. So the way it models domains, it learns better how to build something in its language rather than relying on me to do it. It's very interesting.
55:53
Yeah.
56:10
So applying this infrastructure to dominate the QA testing market and maybe something else later on.
56:10
Oh, I was just thinking about taking the idea of self improving agents that have a stricter kind of permission structure around them so they're safer to other areas. Eugene. So if you could just throw some ideas out, like what are the top four or five areas where you think this is going to go next? I think all the founders would like to know where agents are going to get better, faster.
56:18
So I think this is something. There's two things. One, inference efficiency. So LLM's magic thing is they can do inference really well. Right. So the well trained frontier models, they have the ability to reason through deep stuff and come with very good insights.
56:36
Right.
56:52
But we are burning 95, maybe 98% of our tokens on stuff that doesn't require reasoning.
56:53
Right.
57:01
So what we'll find is that people will start to encode their knowledge about how to be more efficient with these things. What's cool about my system, the way it's coded, is that if someone makes a contribution to an agent on my system, if that agent is invoked elsewhere on another customer's, let's say, platform, they get attribution for that because they participated in the encoding of their knowledge into the AI.
57:02
Right.
57:27
So it creates attribution. It allows for you to pay somebody for knowledge work as they transfer it into the AI.
57:28
Right, I see. Well, it's super cool, man. And if people want to learn more. AIR Inc. But it's spelled E I R. Thank
57:37
you very much, EIR Inc. All right, we appreciate it.
57:46
Thanks for G. All right. Well done. And Alex, as you know, all jobs are being replaced with AI. There'll be no more employment. I've said this over and over again. That being said, we have three open job requests, so take that for what it's worth. We're doing pretty well over here and we're adding Headcount. If you want to be the community manager and you love founders and you love Discord and X communities and Slack and Circle and all these different platforms. But most of all, you love founders, you love angel investors, you love venture capitalists. You want and tech enthusiasts. You want to build that community and help us figure out, hey, of these millions of users, who are the top 1 or 2% who engage with the content the most? Email us what you're good at and what you've done. Communityaunch co. Communityaunch co. Just tell us. We don't care about your resume. We do care about your experience, but tell us what you've done in plain English. Just email us. Communityaunch.co. we're also hiring two researchers. Researchers at our venture firm are looking for. They're hunting for great companies to invest in or have on the podcasts. That's one job function. Then they're writing coverage of those companies. That's another job function. And then eventually after they're a researcher, they become an analyst and they get to get on the phone and maybe do some calls. So there's two different aspects to this job. One is hunting for companies and finding them out there. And then two is sorting through the ones we already have writing coverage. And this is the onboarding. This is the year that will tell you if you're qualified to eventually get that seat at a venture capital firm. So instead of hiring venture capitalists from the existing pool of them, we said, let's make our own venture capitalists. Let's invest 3, 4, 5 years in professional development. Rung one on the ladder. Researchersaunch co. The email, just tell us about yourself, your analytical ability, why you're passionate about this, what you think you bring to the table. Researchersaunch co. We're hiring people out of school for that position. Entry level, 50, 60, 70 hours a week, seven days a week. You're gonna try to crush it and fight to get that job in venture capital. Finally, we have a producer role here at this Week in Startups. The shows are doing so well. We launched this week in AI. We're gonna bring three other podcasts into the fold shortly and we need a producer. Produceraunch Co produceraunch co. If you have produced podcasts before and you have experience. Produceraunch co. This is not an out of school one like researchers at launch. This is a. We need you to have experience and bring something to the table. We'll see you all next time on this Week in Startups. Bye bye, bye bye.
57:49