The Artificial Intelligence Show

#189: Is Claude AGI?, AI Change Management, Nvidia-Groq Deal, Meta Acquires Manus, Yann LeCun Speaks Out & OpenAI Preps AI Device

84 min
Jan 6, 20265 months ago
Listen to Episode
Summary

Episode 189 explores whether Claude Opus 4.5 represents AGI, discusses the critical people-management challenges blocking AI adoption, and analyzes major industry moves including Nvidia's $20B Groq deal, Meta's $2B Manus acquisition, and Yann LeCun's departure from Meta to launch a new AI research venture focused on world models.

Insights
  • AI capability improvements are accelerating dramatically—task horizons doubling every 7 months—but organizational adoption is blocked by change management and workforce anxiety, not technology limitations
  • The gap between AI-forward leaders and laggards is widening at unprecedented speed; those not actively experimenting with current models risk obsolescence within 1-2 years
  • Sustainable AI businesses require genuine product-market fit beyond 'vibe revenue'; many startups mask churn problems with novelty-driven adoption that fades within 3-6 months
  • Near-term job displacement will be severe and concentrated (3-5 years), regardless of long-term abundance narratives; society lacks concrete reskilling infrastructure and entry-level role definitions
  • Major tech companies (Meta, Salesforce, OpenAI) are making aggressive bets on AI infrastructure before models are fully reliable, creating technical debt and organizational tension
Trends
Agentic AI moving from research to production: coding agents now reliably handle week-long tasks; business leaders using AI as collaborative thought partners for strategic decisionsShift from LLM-centric to world-model-centric AI research; industry leaders questioning whether pure language models are sufficient for superintelligenceDeterministic automation re-emerging as fallback strategy when LLMs prove unreliable; companies hand-coding logic to compensate for model inconsistencySupply chain leaks revealing hardware form factors (pen-based AI devices, screenless audio companions) as next frontier beyond softwareConsolidation through acqui-hire: Nvidia, Meta, OpenAI aggressively hiring research talent and acquiring startups to fill capability gaps rather than building internallyReward engineering emerging as new job category; need for specialists to define success metrics and prevent AI reward hacking in long-horizon tasks10-person unicorn companies predicted to achieve billion-dollar valuations through AI-driven efficiency; traditional scaling models becoming obsoleteJevons Paradox debate intensifying: tech leaders optimistic about new job creation vs. practitioners skeptical about near-term displacement and demand elasticityAudio AI becoming critical differentiator; OpenAI investing heavily in audio models for hardware device launch expected 2026Regulatory and political backlash anticipated as AI-driven automation increases corporate profits while reducing headcount; automation tax and reskilling mandates gaining traction
Topics
Claude Opus 4.5 AGI DebateAI Change Management and People ProblemsAgentic AI and Long-Horizon Task CompletionAI-Driven Job Displacement and ReskillingJevons Paradox and Labor Market ImpactDeterministic vs. Generative AutomationAI Hardware Device DevelopmentWorld Models vs. Large Language ModelsVibe Revenue and AI Startup SustainabilityAcqui-Hire Strategy in AI IndustryAudio AI and Voice Interface DevelopmentAI Safety and Preparedness FrameworksReward Engineering and AI AlignmentOpen Source AI Competition (DeepSeek Impact)AI Productivity Tools and Enterprise Integration
Companies
Anthropic
Claude Opus 4.5 model achieving near-5-hour task horizons; Jackson Kernion claims it represents AGI; company building...
OpenAI
Hiring head of preparedness for frontier model risks; developing audio AI architecture for screenless hardware device...
Google
Principal engineer revealed Claude Code replicated year-long distributed agent work in one hour; Gemini struggling wi...
Meta
Acquired Manus for $2B+ to boost AI productivity tools; Yann LeCun revealed Llama 4 was 'dud' with fudged benchmarks;...
Nvidia
Executed $20B non-exclusive licensing deal with Groq; acquiring LPU (Language Processing Unit) IP and talent to compe...
Groq
Specialized in low-latency inference processors; acquired by Nvidia in $20B deal; remains independent entity focused ...
Salesforce
Promoting deterministic automation over LLMs in Agent Force due to reliability concerns; reintroducing if-then logic ...
Manus
Singapore-based AI startup acquired by Meta for $2B+; crossed $100M ARR in 8 months; specializes in autonomous agents...
DeepSeek
Chinese AI lab's open-source success shifted competitive dynamics; impacted Meta's market position more than other la...
Khan Academy
CEO Sal Khan published NYT essay proposing 1% corporate profit tax for worker reskilling; early OpenAI partner integr...
Scale AI
Meta acquired 49% stake as part of broader talent acquisition and AI capability building strategy
HubSpot
CRM platform with more measured AI integration approach; additive AI features for internal use rather than core workf...
SmartRx
Host Paul Raitzer's AI transformation company; used AI co-CEO GPT to analyze strategic business paths; case study in ...
Xai
Elon Musk's AI company; co-founder Igor Babuskin tweeted about Claude Opus 4.5 capabilities; referenced in singularit...
You.com
CEO Richard Socher released 2026 AI labor forecast predicting reward engineering jobs and 10-person unicorn companies
Advanced Machine Intelligence Labs (AMI Labs)
Yann LeCun's new research startup focused on world models and fundamental AI research beyond LLMs
Epoch AI
Research group indicating AI progress rate nearly doubled in last 2 years; identified inflection point in early 2024 ...
Meter
Evaluation group measuring AI task horizons; found Claude Opus 4.5 has nearly 5-hour time horizon, highest published ...
People
Paul Raitzer
Founder/CEO of SmartRx; host of The Artificial Intelligence Show; shared personal experience using AI co-CEO GPT for ...
Mike Kaput
Co-host of The Artificial Intelligence Show; Chief Content Officer at SmartRx; discussed AI adoption barriers and pro...
Andrej Karpathy
Former OpenAI co-founder and renowned AI researcher; tweeted that Opus 4.5 is 'very good' and people with outdated wo...
Dario Amodei
OpenAI CEO; predicted in March 2025 that AI would write 90% of code within 3-6 months and essentially all code within...
Jackson Kernion
Anthropic researcher; tweeted that Opus 4.5 represents AGI as he envisioned it; motivated by building AGI and now que...
John Doogan
Principal engineer at Google; revealed Claude Code replicated year-long distributed agent orchestrator work in one ho...
Ronan Anil
Former Google engineer who led Gemini model work; now at Anthropic; claimed agentic coding would have compressed 6 ye...
Dave Holtz
Founder of Midjourney; tweeted about doing more personal coding projects over Christmas break than in 10 years using ...
Igor Babuskin
Xai co-founder and former Google DeepMind/OpenAI researcher; tweeted about Opus 4.5 capabilities; referenced singular...
Elon Musk
Replied to singularity discussion tweet saying 'we have entered the singularity' in response to AI capability acceler...
Yann LeCun
Meta's Chief AI Scientist stepping down; launching AMI Labs; revealed Llama 4 was 'dud,' benchmarks were 'fudged,' an...
Mark Zuckerberg
Meta CEO; lost confidence in Gen AI team after Llama 4 failure; hired Alexander Wang and began aggressive acqui-hire ...
Alexander Wang
28-year-old hired by Meta from Scale AI for $15B acqui-hire; LeCun criticized him as inexperienced in research leader...
Jensen Huang
Nvidia CEO; stated company intends to integrate Groq's low-latency processors into products for real-time AI workloads
Sal Khan
Khan Academy CEO; published NYT essay proposing 1% corporate profit tax for worker reskilling to address AI-driven jo...
Aaron Levy
Box CEO; published essay arguing Jevons Paradox explains AI labor impact; predicts new work will offset job displacement
Greg Eisenberg
Entrepreneur warning about 'vibe revenue' in AI startups; argues many companies mask churn by relying on novelty rath...
Richard Socher
You.com CEO and former Salesforce Chief Scientist; released 2026 AI forecast predicting reward engineering jobs and 1...
Jony Ive
Former Apple designer; championing OpenAI's hardware device design philosophy focused on reducing screen addiction
Jack Saslow
CEO of Kiraday AI transformation firm; emphasized that AI adoption success requires human change management, not just...
Quotes
"There are decades where nothing happens and there are weeks where decades happen."
Igor Babuskin (Xai co-founder)January 3rd discussion
"We have entered the singularity."
Elon MuskJanuary 3rd response
"I'm not going to change my mind because some dude thinks I'm wrong. I'm not wrong. My integrity as a scientist can now not allow me to do this."
Yann LeCunFinancial Times interview
"Technology is the easy part, finding the problem is harder. But the hardest part, the part almost nobody wants to do, is the human work of driving change."
Jack Saslow (Kiraday CEO)LinkedIn post
"If your company isn't generating significant ROI from AI adoption, then you have a people problem."
Paul RaitzerLinkedIn post
"What jobs are they training for? If you know that their technology is basically designed to replace all cognitive labor, what are they training for?"
Paul RaitzerEpisode opening
Full Transcript
2 Speakers
Speaker A

What jobs are they training for? If you know that their technology is basically designed to replace all cognitive labor, what are they training for? What are they providing $10 billion to prepare them for when no one seems to know what the jobs three, five years from now look like? Welcome to the Artificial Intelligence show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Raitzer. I'm the founder and CEO of SmartRx and marketing a Institute and I'm your host. Each week I'm joined by my co host and Smarter X Chief Content Officer, Mike Kaput, as we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join us as we accelerate AI literacy for all. Welcome to episode 189 of the Artificial Intelligence Show. I'm your host Paul Raitzer, along with my co host, Mike Kaput. We are back after, I guess, a week away. Like we were technically kind of away for two weeks, but we did drop an episode right before Christmas. So we are recording this on Monday, January 5th. It is good to be back. This was like, I tried real hard not to work a ton over Break Mike, but then you like wake up. I get up at like on days ago. I'll get up at like 5am and start prepping for the podcast. And it was just like, all right, like back in it. But there was so much I wanted to talk about while we were away. So so many, like new models weren't dropping and stuff, you know, in the 10 days or so in between these episodes. But a lot happened and just a lot of like conversations online in AI circles that we want to get into. So we're going to do a variation of our weekly here usually. So if you're new to the podcast, this is, I'll explain kind of the format. If you're not new, you're familiar with what we do. But basically each week we do three main topics, the big three things that we want to talk about, and then we do rapid fire items and usually it's about 7 to 10 of those. So today is going to be a bit of a hybrid. We were going to do just all rapid fire to try and get through everything from the break. But there's one topic up front that just sort of took on a life of its own in some ways as I was prepping today. So we're going to have kind of one main topic, but you'll see throughout the episode how everything else sort of connects to this lead off topic. I guess so. So that's gonna be the format today. We'll have one main topic and then a couple of the other ones will go a little bit longer. But mostly we're just gonna do rapid fire style here and it probably will run a little bit longer than our usual, you know, 1:10, 1:1:5 I think is what they usually run. All right, so this episode is brought to you by AI Academy by SmarterX. AI Academy helps individuals and businesses accelerate their AI literacy and transformation through personalized learning journeys and an AI powered learning platform. There are nine professional certificate course series available on demand right now with more being added each month. There's also over 20 gen AI app reviews. We drop a new app review every Friday, so definitely check those out. If you're already a member, make sure you're checking out those new ones as they drop each week. This week we want to spotlight the AI for Industries collection. So each collection is made up of series and certificates and so there are three currently available. AI for Professional Services is again a certificate series, AI for healthcare, AI for software and technology, and then coming very soon, AI for insurance. So again, the idea is each month we will be dropping one or two AI for industry collection series and certificates. So check those out as we add more. If any of those sound great to you. If you are already an AI Mastery member, they are on demand for you right now. If you are not, you can buy an individual series. So you can just go get AI for Professional Services as a standalone series or just become an AI Mastery member or, you know, buy memberships for your team. And all of these certificates and series are right there so you can learn more about AI Academy and our AI mastery membership program at Academy SmartRx AI again, that is Academy SmartRx AI. Okay, um, we also each week we do an AI pulse survey where we ask questions of our audience. So these are informal polls to see how our audience feels about topics that we talk about. And so the questions are a prelude to a couple of the things we're going to get into today. So there is two questions this week. You can go to SmartRx AI forward slash pulse to to participate in this poll. The first question is to what extent are people problems which we're going to talk about. Fear, resistance to change, lack of buy in hindering your organization's AI adoption. So again, how much are people problems affecting your organization's AI adoption? And then the second question is how concerned are you about a fast takeoff in which AI begins to significantly impact jobs before society and the economy have time to adapt. That is going to be a big focus of today's conversation. Mike. So again, go to SmartRx AI forward slash pulse if you want to participate in that poll and then join us for episode 190 at the start of it. We will kind of give you a breakdown of how that poll plays out. All right, Mike, so like I said, not a lot of, like, breaking news. I mean, a couple of crazy acquisitions that no one was sort of like predicting at the end of the year, including one on Christmas Eve that was like, I'm sitting there like, oh, geez, look at acquisitions. Was not ready for that one. But again, just big picture topics. So let's, let's get into the first main topic of the day.

0:00

Speaker B

Yeah, for sure, Paul. It's good to be back. We're starting off with, you know, over the break there was a bunch of chatter and we'll get into this about certain model capabilities and how closely we're kind of approaching what some might call AGI. So a few different stories here and some commentary around this topic. So first, the evaluation group Meter M E T R estimated that the new Claude Opus 4. 5 model has what they call a time horizon of nearly five hours. We've talked about this before, but this metric measures the max duration of a task based on human expert speed that AI can complete successfully at least half the time. So researchers noted that their current testing suite is actually reaching its limit for measuring these upper bounds, and the result is the highest time horizon the group has published to date. So claudopus 4.5 is essentially miles ahead of their previous benchmarks in doing these long time horizon tasks. Now, a separate analysis at the same time from Epoch AI indicated that the overall rate of AI progress has nearly doubled in the last two years. The group's capabilities index identifies a sharp inflection point in early 2024 where improvement rates jump significantly, driven largely by the emergence of reasoning models and reinforcement learning. So we talked about before, that meter's data specifically shows task horizons are doubling roughly every every seven months. Which means basically if these trends are holding, researchers are predicting that AI agents could reliably handle tasks as long as a week long within the next two to four years. Now, Paul, this kind of kicked off because there was all this buzz around Claude Opus4.5 Claude codes specifically in these AI circles on X over the holidays. Some of it was spurred by these findings, others were just people. Several notable people basically saying that they felt these tools indicate that we're nearing or we're already at some type of AGI. I don't know if this was just coordinated conversation or what, but when you looked at that, what can you kind of break down for us? What were people talking about and what does it mean for where we're headed?

5:41

Speaker A

Yeah, so just to recap, like, Opus 4.5 came out November 24th, right around Thanksgiving, and we talked about it on episode 183, December 2nd. So it wasn't news that Opus4.5 was out, but when they released it, they also released an updated version of Claude code, which is basically like the coding assistant. So if you're, if you're not a coder, you're probably not experiencing Claude code and sort of feeling what other people are feeling in this space. So, you know, it's interesting. As I said, I tried not to work much during the holidays. My work usually consists of I still get up at like 6, 6:30 in the morning, and then I would, while I'm drinking my coffee, you're like checking Twitter and going through, you know, making note, flagging things, liking things, putting them in our sandbox of stuff to talk about before the kids are kind of up and moving. So I would get my couple hours of just, you know, regular monitoring of what's going on. And right around Christmas, you just started to notice this weird, like, bubbling about Claude code. Like it was just everywhere all of a sudden in, in the circles of people that I follow. And so the way I sort of went back this morning and tried to piece the timeline together is I just went back and looked at my likes because I'll usually like, and sometimes I'll retweet things or share, you know, context. But a lot of times I'm just liking things almost as like a bookmark for me, for the show. So best I can tell, I'm going to walk you through the timeline because I think it's really interesting how this did sort of emerge. So, Igor Babuskin, I'm probably saying the name wrong, but Xai co founder. So he's a co founder of Xai with Elon Musk and former Google DeepMind and OpenAI. So on December 26th, day after Christmas, he tweets, opus4.5 is pretty good. Then Andrej Karpathy, who we've talked about many times on the show, a co founder of, of OpenAI and a renowned AI researcher. He replies and said it's very good. People who aren't keeping up even over the last 30 days already have a deprecated worldview on this topic. So that one, to me, was already interesting. Like, just to have him saying this and reinforcing this idea of how quickly these things are moving. The thing we often say, Mike, about, like, today's version of AI is the dumbest form we're ever going to have in human history, reminding people all the time that the labs have smarter, more powerful versions of the models than you do. And so a lot of the times that we spend talking on the show about what these people are saying and doing, part of the reason we do that is they are seeing and experiencing the world differently than you are. They have access to things that you and I do not. And so trying to piece together what they're saying, what they're doing, what research papers they're putting out gives us the ability to look around the corner. Now, back when I started researching AI in 2011, 2012, you could see around the corner, like, 18 to 24 months. Like, there was a. There was a pretty good sense. And in some cases, you could argue we were sitting around the corner by a few years. That timeline has shortened dramatically. So now when we see around the corner, quote, unquote, it's probably like three to six months, maybe. Like, if we're lucky, that's for the people who are tuned in now, for the rest of the business world who aren't paying attention. You may still be seeing a year or two in advance of what they're seeing. But so Andres Karpathy, who everyone in the AI world pays attention to, for him to be saying it's really good, makes people stand up and listen. So now, context, if we rewind to March of 2025. So 10 months ago, Dario Amadei gave an interview at the Council on Foreign Relations. It was an event and I'm sure we talked about on the podcast at the time, but there was a quote that kind of captured the headlines from Dario. He said at that conference, if I look at coding programming, which is one area where AI is making the most progress, what we are finding is we are not far from the world. I think we'll be there in three to six months where AI is writing 90% of the code, and then in 12 months, we may be in a world where AI is writing essentially all of the code. Okay? So at that time, most people thought that was crazy. Now, we always caution people that Dario doesn't tend to exaggerate. So usually if he says something like that, he has already seen it or sees a very clear path to something like that occurring. Okay, so having this context on December 26, also day after Christmas, Jackson Kernion, who is an Anthropic researcher, tweets, I'm trying to figure out what to care about next. I joined Anthropic four plus years ago, motivated by the dream of building AGI. I was convinced from studying philosophy of mind that we're approaching sufficient scale and that anything can be learned in a reinforcement learning environment. And so now I feel like Opus 4.5 is as much AGI as I ever hoped for. And I'm not sure I know what I want to spend my waking hours focused on. And then he goes into, like, some of the things he's thinking about, like specifically around, like, alignment and safety is one of the key things. So the next day after this tweet blows up, because now you have someone at Anthropic saying, yeah, we basically kind of got the AGI, I think, with Opus 4.5, like, what am I going to do next? So he said. Some reactions to my AGI framing are a refreshing reminder of people's current experience with chatbots. To use Claude code is to see Claude write arbitrary software, run into errors, reliably fix them, make helpful suggestions, and perfectly follow any given instructions. We don't yet have a great experience like this. For non coders, try Claude code in the Claude app on non code tasks though, is what he said. But I think that's largely an app problem, not a machine intelligence problem. So, Mike, I'm going to stop there for a second. I'll keep going because it got more interesting. But this is where the AGI thing started to sort of emerge again from best I can tell. And I don't know if you were paying attention as this was going along throughout the break too, but this is where I really started to kind of. My ears started to perk up. I'm like, okay, this is going to be an interesting break.

7:56

Speaker B

Yeah, exactly. I started hearing these signals too. And honestly, over the last couple months, hearing people talk about Claude Opus 4. 5 led me to really dive back into it. And I don't know what it is.

14:09

Speaker A

It is.

14:21

Speaker B

Again, like we've said a lot of times, it's really just kind of going on vibes. But there's something different about this model. And maybe it's just simply the personality or the way it works with how my brain works, but I had a few moments where I took a step back and was like, whoa, okay, that did a lot of things that I didn't exactly expect it to do in a really, really high level of competence. So I can assume on the coding side it's supercharged and that's what's driving this discussion.

14:21

Speaker A

Yeah, and as we've said, the precursor to the disruption of all knowledge work is coding and AI research, because that is the most valuable thing for these labs to build. So they're fine tuning models to be great at doing this. So if 90% of all code is being written by the AI, that enables them to generate more code, take more shots on gold from an AI research perspective. So this is the first domino. So then we go to January 2nd. So you know, there's other things happening, there's other tweets to note, but the one that then jumps out and just kind of seems to blow up is John Dogan, principal engineer at Google. So this as of this morning has 7 million plus views on X. So she tweets, I'm not joking and this isn't funny. We have been trying to build distributed agent orchestrators at Google since last year. There are various options. Not everyone is aligned. I gave Claude code a description of the problem it generated, what we built last year in an hour. So now this is a principal engineer at Google, one admitting that they're using Claude code, which they should be, they should be testing the other models. But then two, just straight up saying like, it did what we just spent a year doing. So then she continued, it wasn't a very detailed prompt. So someone said like, well what did you give it basically? So she replied, said, it wasn't a very detailed prompt and it contained no real details given. I cannot share anything proprietary. I was building a toy version on top of some of the existing ideas to evaluate Claude code. It was a three paragraph description. Someone then replied and said, when will Gemini get to this point? And she replied, we are working hard right now. The models and the harness. Now I got to be honest, I didn't actually know what the harness meant. So here's a little tip. If you are an X Twitter user, Grok is phenomenal as, as an integrated element of X. So anything you don't understand, you can literally just click the Groq icon and it'll like pull up and then you can just talk to it. So I said like explain harness for me. So harness refers to the agent harness or orchestration harness, which is the software infrastructure and frameworks that surround a foundational model like Gemini or Claude to turn it into an effective AI agent system. So key aspects include tool calling, memory management, multi step reasoning loops, context persistence, execution, control, verification. So basically what this Google researcher is Saying is Anthropic crushed this. They managed to take the base model and then build this infrastructure around it, which everyone is trying to do. So that was January 2nd. Then also on January 2nd, Ronan Anil and I don't know if that. That's his actual name or if that's just the Twitter name. I didn't have time to check into that. But this is also January 2nd. He says, My current theory is that everyone was secretly using Claude code to do real work and seeing improvements from Sonnet over Opus. Then once Karpathi said he used it over the holidays. So going back to our first Tweet on the 26th, everyone else followed, my entire timeline is Claude code heaven. So this then blows up because someone says, even Google Engineer, and then retweets the JHNA post that we just talked about. So then Ronan replies, I used to be a Google Engineer tool too. Leveled up all the way and feel if I had agentic coding and particularly opus, I would have saved myself first six years of work compressed into a few months that then blows up. So then he comes out, he's like, hey, full disclosure, I work at Anthropic now. But like I was at DeepMind in January. He led the work on the Gemini models and worked prior to that in the Google Brain team on foundational research around training algorithms. So this is not some like random person on X. So now we have a current Google Engineer, we have a former Google Engineer who is a lead on building Gemini, who is now saying, this is different. Like we have entered a different realm. Then on January 3rd, so just two days ago, we have Dave Holtz, who is the founder of Midjourney. So he tweets, I've done more personal coding projects over Christmas break than I have in the last 10 years. It's crazy. I can sense the limitations, but I know nothing is going to be the same any anymore. To which Igor, who we talked about earlier is the co founder of xai, said, there are decades where nothing happens and there are weeks where decades happen. To which Elon Musk replies, we have entered the singularity. So we've talked about, you know, Ray Kurzweil's books on the singularity and the singular is near and near. So the singularity is again, I'll pull Grok into this. Like said, hey, Grok, explain the singularity. It's the hypothetical future point where technological growth, driven primarily by AI, becomes uncontrollable and irreversible, leading to profound and unpredictable changes in human civilization. So we've talked about the Singularity on this podcast many times. The basic premise was first we would get to AGI where it's basically, you know, human level, average human level at most cognitive tasks. Then we would get to superintelligence where it's beyond human level at all tasks. And then we end up in this self improving realm of the Singularity. So going back to Amade in March, when asked about the impact of, well, if you're doing 90% of your code with AI, like, what happens? And he said, I think that eventually all little islands, meaning all forms of work, will get picked off by AI systems. And then we will eventually reach the point where the AIs can do everything that humans can do. And I think that will happen in every industry. I think it's actually better that it happens to all of us than to kind of pick people randomly. I actually think the most societally divisive outcome is if randomly 50% of jobs are suddenly done by AI. Because what that means the societal message is we're picking half, we're randomly picking these people and saying, you are useless, you are devalued, you are unnecessary. So then the interviewer says, and instead we're going to say, you're all useless. Laughing. Amadeus says, well, we're all going to have to have that conversation. Like we're going to have to look at what is technologically possible and say we need to think about usefulness and uselessness in a different way than we have before. Our current way of thinking has not been tenable. I don't know what the solution is, but it's going to be different. We're all useless. Right? We're all useless is a nihilistic answer. We're not going to get anywhere with that answer. We're going to have to come up with something else. So, Mike, I'm going to share a. To kind of wrap the session. I'm going to share a personal experience to take this outside of the coding realm. But did you have any thoughts on any, any of those notes or tweets or anything before?

14:50

Speaker B

No. I mean, I think you're about to get to this, but I do. It just feels like a weird tipping point in using at least opus 4.5 specifically for some of the knowledge work and complex documents and strategies I've been working on, especially over break. Yeah, I think the models are all within what, three to six months of each other. So this isn't Even like an Opus 45 like promotion here. It's this idea that something feels like it changed a little bit. And it sounds like you might have experienced that too.

21:42

Speaker A

Yeah, so I will. And again, like, I, I get. And I know our whole staff listens to this podcast and sometimes our staff learns about things through the podcast. And so like, I'm going to preface this with I haven't talked to anybody about this. Like, literally, I haven't. I really interacted with the team much over the last 10 days. I've been pretty much, you know, just kind of hanging with my family and, and playing games and going to the gym and, and reading stuff online. So again, in my quiet moments though, in the mornings, I was in business planning. And so I've shared this story before, but I have a co CEO GPT I built. I actually, the template to do this is online. I shared it publicly. I did a whole webinar on how to do this. There's a class in our academy of how to do this. So I'm just going to preface this by saying this co CEO is a GPT I built that has knowledge about our company, our revenue plans, our growth plans, our structure, what matters to us, our value system, things like that. So the co CEO already has a knowledge base of some context. So, so I would say our, our business, SmartRx, is at a bit of an inflection point. We consider ourselves sort of an AI transformation company. Our, our job is to drive transformation for individuals and for businesses. We do that, you know, as the, the foundation is we are a media and research company. So we, we create content, create value through things like this podcast, through doing research and sharing that with people, builds an audience. That then our revenue model is largely education and events. That's how, that's how we grow the business. So it, it, the business is going very well. It's growing very quickly. This business is already two to three times the size of my agency was when I sold the agency. So this is a much bigger business already than what we built back then. So right before break, I had meetings with my accountant and my attorney to explore a few paths forward of directions we may go with the organization. When I look at the future of our company again, I'm putting my CEO hat on here, but translate this to what you do. Whatever the problems you're trying to solve is the growth challenges you have in front of you, the strategies you have to build. So as I'm explaining my scenario, put, put this, put yourself in, you know, your situation as you go into this year. So there are unknowns and complexities to both paths forward. So I would Say there's like two these two paths that fall outside of my expertise. So for me as the CEO, I have to like think about finance, I have to think about HR, operations, legal, customer success, sales, etc. Like I don't have leaders in place in all those areas. And so I have to be the Chief X in many of the cases for our company. Plus my experience and the experience of our team is a bit of a limiting factor because we're scaling a company to potentially hundreds of thousands of customers is not something I or anyone on our team has ever done. So I am, I am in every day I wake up in a place I've never been in before to have to figure out how to build a company. So without getting into a bunch of proprietary detail, we'll call Path a scale and path 2 or path B hyperscale. Both are really good scenarios to be in, but one scales pretty quick and one scales much faster. So I honestly like I'm at a crossroads and I'm trying to make a lot of really important decisions simultaneously and I'm not sometimes sure which domino has to fall first. And so who do you talk to about this stuff? I mean I have great attorney, I have amazing accountants, I have advisors in other areas, I have a lot of friends who've done a lot of amazing things in business I can reach out to for specific things. But I do not have a co CEO like someone who's been through this. But I do have ChatGPT 5.2 thinking in my co CEO. So I started. I won't get into all the proprietary context but I basically said here's Path A scale, Path B hyperscale. Let's analyze this together as a starting point, create a list of all the questions I should be asking when comparing the two paths. Then we will go through each of those questions one at a time. This was like Christmas Eve morning. I want to say that I started this thread so it then in about a minute or so comes back with 56 questions. It breaks it into 10 categories of North Star and personal objectives, strategy, clarity, market and competitive dynamics, growth model and unit economics, financial realities, partner alignment, stakeholder politics, funding, governance, business structure and tax considerations and execution readiness. So I would estimate I spent over break maybe 10 to 15 hours talking to CO CEO in this thread each morning and then periodically throughout the day when I would have another thought I had but I had not co CEO. Like if I hadn't had this GPT we're talking about 200 plus hours of manual research, note taking Summarization writing. I mean, there are. I haven't counted, but I would guess there's somewhere north of 20,000 words in this thread, which, for equivalent purposes, the book Mike and I wrote is 50,000 words. So, like, over break, I had this conversation that lasted probably half the equivalent of half of a book in the end. And I'm not done yet. But like, in the end, as of my stopping point yesterday, I produced six documents that each would have taken me five to ten hours to create. Once I edit and finalize those documents, Co CEO will have written 95% plus of the final words. So I guess my point here, Mike, is the state of AI is not just a story of automating AI research and coding. Like we're talking about coding because that is the thing all the labs are focused on, right? But the business world and the future of work has fundamentally changed already. Most practitioners and leaders just aren't aware of how much. But I know you and I both pursue this in a very similar path, Mike. And so what I'm saying is when I produce these final documents for my team, I'm the expert, I'm the CEO, I'm the leader. It has to be my voice, my tone, it has to be me. And it will be, because I'm the one that pushed the model. I'm the one that thought of the questions to ask and then took it down different paths and said, you know what, let's not go to the next 10 questions, let's push on this one. Because now I'm thinking differently about something. And when I started five days ago, I thought path A was this. And now, now I'm actually thinking path B is possible. But now I need to solve these five things There chat. GBT would have never gotten to the end game without me, without a collaboration, I guess, is what I'm saying. But in the end, the final output is 95% plus what ChatGPT wrote, not words I personally wrote. And I don't see anything wrong with that from a leadership perspective. Like, the whole point is to get to the end game and to do this the right way and to solve for the customer and to build the right team and to put the right funding in place, like make all these decisions, not spend the first quarter thinking about it and researching it. I want to spend the first quarter doing the thing. And so the faster I can get to the point where I can do something, the better it is for everyone involved, all stakeholders. And so I. I guess what I'm saying, Mike, is I'm making the argument that I think the same that we're seeing in coding probably happens in business if you know how to use these tools.

22:11

Speaker B

Yeah, couldn't agree more. I have very similar experience with a few things I've been working on. Nothing nearly as complex as what you're talking about, but I found that my end result was basically 95% written by an AI model. But not only did save me 90% of the time it would have taken, but the output is a million times better because I spent a huge amount of time on what I should actually be doing, which is thinking, reasoning, verifying AI outputs rather than figuring out how to do the research, figuring out which questions to even ask. It truly was. I mean, I've always been impressed over the last few years by the technology's ability to act as a thought partner. Something. Something changed in the last three to six months, maybe. It's breathtaking.

29:47

Speaker A

Yeah. And I think, like, the. Again, the, the point we're making here is not to, like, just let AI do everything. Like, what this did is I couldn't call my attorney during break, or I couldn't call my banker. I couldn't call these people at, like, random times. I was thinking of stuff. But if, If I didn't have this tool, if I didn't have this CO CEO, I'd be here on January 5th, like, all right, like, reach back out to my attorney. Like, okay, can you re. Explain that concept? I don't need that anymore. I did my own work on it. I understand the concept now. Now I can give my attorney direction and say, okay, here's. Here's what I've decided. This is where we're going to go. And this is what I now need you to do, because now I need your expertise that me and co CEO don't have. So you're shortcutting to get to the final thing. And like, what is the final thing? And sometimes when it's publicly created, like this podcast, the final thing is very human and it is not being done by the AI. And so, like, I want to. I don't want to spend three months figuring out who to hire and job descriptions and all that. I want to spend three months interviewing people face to face and meeting these people and finding the right people to put into place to enable path A or path B. So I want to get to the human part of this. I don't want to spend three months on the stuff that the AI is just really good at.

30:30

Speaker B

Yeah, it's. It's a really exciting time. I'M super excited to see how this evolves over the next quarter even.

31:40

Speaker A

Yeah.

31:47

Speaker B

All right, so next up, we are starting to see some signals that companies are failing to generate significant ROI from AI, likely because they are not facing a lack of technology, but more of a people problem. So, Paul, you actually wrote this out really well in a recent post about how pressing this issue is. He said, quote, if your company isn't generating significant ROI from AI adoption, then you have a people problem. You also argued that employees often view AI as a threat to their jobs or a replacement for or fulfilling work, which creates resistance that can only be overcome through education and empathetic change management, not just through handing employees better AI technology. Now, this was backed up by a really interesting post from Jack Saslow, CEO of an AI transformation firm called Kiraday, who wrote, quote, this is what Silicon Valley doesn't understand about AI transformation. Technology is the easy part, finding the problem is harder. But the hardest part, the part almost nobody wants to do, is the human work of driving change, sitting with people, earning trust, refining the product until it fits their hands, pushing until adoption actually happens. So, Paul, I love the post you put out. This obviously just hits really close to home because we've seen this pattern play out and experienced it for nearly 20 years in the marketing world, right? You give people marketing automation software or CRM systems. We saw this on the agency and consulting side, and nothing happens unless you actually shepherd them through the change, which is why transformation is so hard and often so expensive. So I'm curious about your thoughts here, why this is so important now. What is Silicon Valley missing here?

31:48

Speaker A

Yeah, and I think in the context of what we just shared, this, this makes a lot of sense as to why I put this on LinkedIn and why it sort of hit home for me at the moment. So I was in the midst of going through this co CEO conversation over break and in the midst of watching these Claude code posts happen. So Jack's post was on January 1st. So this is New Year's Day. I'm, I'm reading this post. I'm like, man, this nails it. This is exactly what we've been talking about. He, he breaks down three things working together to get meaningful ROI of business acumen to find the real problem, technical skill to build a solution that works, and then people skills to drive behavior change. And he said, these capabilities rarely exist in one person. They barely exist in most teams. So I, I, again I, I am the example here. Like, I don, have all these expert skills. I have general capabilities in a Lot of areas required of me to run this company and grow this company, but I'm not an expert in those things. And even across our team, we don't necessarily have the full expertise needed. But because we understand what AI is capable of, because we know to ask the questions, we understand what problems could potentially be solved more intelligently, we can drive that change. And we have a company of people willing to use the tools to drive that change. But Mike, you and I interact with organizations every day that are filled with people who do not want to do this. They, they do not want, for different reasons to embrace AI. Maybe they're just not convinced it's important enough yet. Maybe they do fear for their job. Maybe the leadership just hasn't committed yet and so there's no motivation for them to do it. Whatever the reasons are, it basically comes back to a people problem we have demonstrated already. Again, just the co CEO example, some of the things you're sharing, like the technology is already good enough. If it stops today, it is good enough to completely transform your work. Regardless of what your role is or what industry you are in. If you do knowledge work, it is already at the transformational period. If you are not seeing that transformation and the ROI that is a people problem, either the leaders are the wrong leaders who don't get it and aren't pushing hard enough, or your people within the organization are not understanding the significance and what's going to happen to their careers and they're not having a sense of urgency to do it. So it is not a tech problem unless your company refuses to provide the technology to the people. So that was again, like, I'm like over the break, I'm just wanting to explode because it's like you're seeing this. Like I'm living it in the moment, seeing the transformation in our own business. What's possible to do, I couldn't have done a year or two ago until we had reasoning models. I couldn't have done what I was doing over break, right? And so like we see it and like we look at all these other companies and all these other people's jobs, it's like they don't see it yet. Like they're just missing it. And it goes back to that example of, you know, Andre's quote about like sometimes like 30 days is all it takes and it's like the world changed and you just, you didn't know. So if over break you literally just like went offline and weren't paying attention to anything, like, good, like I'm all For that. Like, I, I hope our team did that and unplugged over break. I, I don't have that ability. Like, I can't mentally do it because I know sometimes that it just takes a day, two days, three days. I mean, like, half of what I just shared happened on New Year's Eve, Christmas Eve, Christmas Day, New Year's Day. Like, I still spent 20 hours of my family those days, but, like, I learned the things I needed to learn in the quiet moments.

33:29

Speaker B

Yeah, for sure. And I wonder too, if this is why you see individuals racing ahead with AI. Right, because there's no real barriers in your own life to be experimenting with this, except time. I mean, you don't have any change management to do except changing your mind and your habits, which are big, but you don't have a lot of barriers. So you see these people saying, like, look, there's these wonders possible that you're not seeing in your company because it's just a very different level of velocity that you're able to do if you're a solo programmer, an entrepreneur, a leader, whatever.

37:00

Speaker A

Yeah. And I, I just, I'm more convinced than ever that the AI Forward professionals and leaders, the gap between them and everybody else is going to get so significant so fast. And the AI native and AI emergent companies, or like the AI Forward companies have, have just a massive, massive advantage right now. And I think it's going to be a very, very disruptive year for the people and for the companies that continue to sit on the sidelines and not go full steam ahead again, just what we're living ourselves each day. I, I don't know how you compete with people who have the knowledge of what these things are capable of and apply them to build their businesses. Like, I, I just don't, I don't get it. Like, how, how they don't become obsolete in like one to two years.

37:30

Speaker B

All right, next up, OpenAI has opened a search for a head of preparedness to manage the emerging risks associated with its most advanced AI models. CEO Sam Altman came out describing the position as a, quote, stressful job that requires jumping into the deep end immediately. And he noted that while models are improving quickly, they are beginning to present real challenges. So this role offers actually a base salary of over half a million dollars plus equity, and involves leading the company's technical strategy for tracking frontier capabilities that create risks of what they would call, quote, severe harm. So according to the job listing, this new executive will oversee the preparedness framework at the company, designing evaluations and safeguards for all these high stakes areas where AI can present troubling capabilities, including things like cybersecurity, biosecurity and deceptive behavior. So Altman explained the company needs a more nuanced understanding of how these expanding capabilities could be abused. And he cited specific concerns regarding systems that can self improve or find critical computer security vulnerabilities, as well as some of the early signs that AI can have an impact on mental health, which the company was dealing with in 2025. So Paul, I guess what struck me here is like you don't create a role like this at this salary unless you genuinely believe the next models that you're releasing or have already could cause physical or systemic harm. So whether you agree with OpenAI's assessment or not, it seems like they are putting their money where their mouth is. How are you looking at this?

38:19

Speaker A

I definitely think it's a sign that they are very confident that they're very close and that they need to take their preparedness framework much more seriously. So you can go read the framework, but it was at least the current version was released in April of 2025 and the one thing I'll call out that I bold faced in revisiting it was of the tracked categories you mentioned to two of the critical ones, the biological chemical capabilities, cybersecurity capabilities. But one of the other ones they highlight is AI self improvement capabilities that in addition to unlocking helpful capabilities faster could also create new challenges for human control of AI systems. So this is something we've talked about especially in Q4 of last year and you'll hear a lot more about this year is self improvement meaning the models can improve themselves and the other component is continual learning. And there was actually quite a bit of buzz over break in AI circles that Google has in particular has made some advancements that have not been publicly acknowledged yet around continual learning meaning and again this is a topic we talk about on the podcast numerous times. But when these models sort of come out of the oven, when they're, when they come out trained through their pre training phase, they don't then have the ability to learn. They're kind of like frozen in time. And the way they then provide answers to you or help you is by using tools like search and things like that. If the models can come out and then continually learn and make updates to their own knowledge base that is a major unlock and it a lot it leads to other things like memory and this ability to do self improvement. So there's, there's a lot of chatter around Self improvement, continual learning that tell me that the, the labs are much farther along in both of those areas than we might be aware of. And so I think all the labs are starting to take this area very seriously.

39:57

Speaker B

Next up, in a new guest essay for the New York Times, Khan Academy CEO Sal Khan argues that AI is poised to displace workers at a scale the public does not yet fully realize. So he opens this editorial or essay with a report from a venture capitalist he knows regarding a major call center in the Philippines. They basically recently deployed AI agents capable of replacing 80% of their workforce. And Khan argues this rapid automation threatens a sector that in that country currently generates up to 10% of the GDP in the Philippines. Now, beyond call centers, he warns that autonomous vehicles and robotics will soon reduce human labor demands in diverse fields ranging from long haul trucking to software engineering. Now, these are all examples, but really the crux of this is that he, he proposes what he calls a quote, 1% solution. To address this, he's calling on companies that benefit from AI and automation to dedicate 1% of their profits, not revenues, their profits, to worker retraining. He estimates that if the world's largest corporations participated, this initiative could create a 10 billion annual fund. This capital would support a centralized nonprofit platform designed to finance apprenticeships and verify skills for high demand roles in industries like healthcare, construction, and education. So Paul, what struck me here is kind of his framing. He said, you know, if companies don't do this, they're going to face backlash once we start seeing job loss, they might face backlash from regulation, taxes, bans on AI. And he also notes that if AI causes mass unemployment, there won't be anyone left to buy their products anyway. So what did you take away from this? I did find it notable, like Khan Academy is a huge education partner. They use AI plenty. They're not anti AI at all. It was pretty striking.

42:00

Speaker A

Yeah, they're one of the early OpenAI partners. They infused ChatGPT and Conmigo, which again is like a learning assistant built right into the platform. Yeah, I mean, obviously I'm a big proponent of AI literacy and reskilling and upskilling. I like the idea of a very tangible, like 1% tax. You know, he's referencing specifically the world's largest corporations have combined profit over a trillion dollars and you know, 1% would be nothing to them. I do agree that there's going to be backlash as profits skyrocket and staffing levels are reduced. So take your big tech companies. We've already seen, you know, Amazon has alluded to many more layoffs coming meta others like they're going to keep reducing staff regardless of what some of these other topics we'll talk about believe staffs will be reduced and profits will increase. Wall street will be very happy when that happens, which means more profits, you know, for these companies. And that eventually will cause, you know, political upheaval and societal revolt against these tools. Like this is. Again, I don't want that to be the outcome, but I don't see how it's not the outcome in the near term. I had mentioned, I think in fall something about an automation tax. So as these layoffs happen and as robotics and AI agent automation replaces people that these companies need to be paying probably some sort of tax. Again, I don't know where this ends up falling politically, but they're, they're not going to be able to just benefit from reducing the human workforce in favor of profits forever without backlash. So I, I think like, I will say like I've had, I see how I can frame this. I have made proposals to some organizations about the need for very high level funding of reskilling and upskilling workforces and economists. And I would say the response has been pretty lukewarm at best in the past. Like, I think it was maybe just too abstract or like people didn't want to admit the amount of disruption the technology was going to have. And so they just weren't ready to have these conversations. But I do think at some point this is an inevitable, like some version of government funded or, you know, private funded. You're going to have to get into this. We see it like in Ohio there's like a $40 million called Tech Cred which provides, you know, reimbursements to companies that invest in technology education. I know of like eight states that have something similar. I think Canada actually has a program somewhat like that. So I, whether it's state done or federally done, I think you're going to see a lot of movement around this. And I think these tech companies are going to wake up this year to the fact that they need to be putting hundreds of millions of dollars behind this. The question becomes training for what? What jobs are they training for? If, if you know that their technology is basically designed to replace, I mean we'll use the term augment for now, but like slash replace all cognitive labor. What are they training for? What are they providing $10 billion to prepare them for when no one seems to know what the jobs three, five years from now look like? That, that's, that's the big challenge, I would say is a space I think deeply about and I, I'll probably have more to say down the road, but I'm glad he's. I'm, I'm glad Salkhan is saying these things and proposing ideas like I think we have to have more of this kind of discussion.

43:53

Speaker B

All right, next up we're going to talk about something called Jevons Paradox, which sounds like kind of a fancy term, but real quick. It's a 19th century economic theory that basically argues that historically efficiency improvements in resources in Jevons Paradox, the example is coal have consistently led to massive increases in demand rather than reduction. So the reason we're talking about this is that Box CEO Aaron Levy, we've talked about a bunch, basically published an essay saying that Jevons Paradox explains the future impact of AI agents on the labor market. So he argues that that AI agents are bringing this dynamic to what he calls non deterministic knowledge work. They're automating complex tasks like coding, market research, contract review. And by dramatically lowering the cost of investment for these activities, Levy Produce predicts that small teams will soon possess the capabilities of a Fortune 500 company from a decade ago. Now, here's the crux of this. He says that based on the paradox displacement concerns about jobs are overblown. He concludes that the vast majority of future AI computing power will actually expand demand dramatically for all sorts of jobs and work, and it will end up being spent on work that doesn't even currently happen yet. So, for instance, software projects or medical research, they're currently too expensive to justify. So Paul, this idea that is one side of the job discussion or debate is that AI is going to create so much demand and so many new roles that we don't have to worry at all about losing jobs. Do you agree with that?

47:30

Speaker A

I don't. That's not going to come as a surprise to anyone who listens to the podcast. I'm a big fan of Aaron's. Like I, I love what he's doing. He's one of the most intellectually engaged technology leaders. I would say on this, I would love for him to be right. This is a talking point that's echoed by others like Satya Nadella is a big fan of referencing the paradox. It comes up all the time with the tech leaders. I, I just don't think that it's going to play out the way they hope it does this time. And I don't like you and I talked offline Mike about this before we got on it's like I, I don't understand Aaron's angle here, like where his optimism comes from. And I, I want to like, I, again, I love reading these other perspectives that you know, maybe kind of lean toward that, that I'm, I'm not seeing something that I'm just like missing how this plays out. And so I, I love this and I encourage people to read it and again form your own perspective and consider historical instances where we've learned and is there something from that past that we can pull forward? I, I just don't, I, I don't know why he, he would be so optimistic. We've talked about like David Sachs. I get why Sachs has to be so optimistic because he's part of the administration and he has to toe the political line. Like I get that. Yeah, I don't understand the people who seem to be neutral have being so overly optimistic. And maybe what it is, Mike, is that they just gloss over the near term pain. So let's say like they're realistic that three to five years is probably going to suck. Like there's going to be lot and, but in the end, and maybe they're just thinking five to ten year terms. All this new stuff we can't imagine we created. And I don't, don't, I don't disagree with that. Like I do think the innovation is going to be incredible and it's going to create all these jobs and all these businesses. We can't fathom. Like that is absolutely true, but both things can be true at the same time. That 10, 15 years from now, maybe there is this amazing future of abundance and life is amazing for all of us, but three to five years is gonna suck for millions of people. Like I, I think that's the part they just for whatever reason refuse to admit or refuse to like drill into and they just gloss over the shitty part to get to like the future of abundance because it's just a better mind frame to live in. I guess. So. I don't know. Like I, again I would encourage people to read it. I, I think there's just flaws in the argument anytime someone brings this up, like demand for knowledge outputs is not infinitely elastic. Like just because we can create more doesn't mean there's more demand for like attention for content online, for distribution of your product or services. Like just because you can create more doesn't mean people are going to be there to buy more. Especially if they're one of the millions who don't have a job in 12 months because of AI, you know, I just, I don't know, I think there's just a lot of flaws around how people think about the speed with which this is going to happen when they try and connect it to past transformations and general purpose technologies. It's just going to happen so much faster than this, than society can prepare for and the economy can prepare for. So again, I am, I try to be as optimistic as possible and I do believe I wouldn't be doing what I'm doing if I didn't believe there was a possible, like really optimistic outcome. Yeah. But I think we're all fooling ourselves if we think that there is a very, very messy transition to get to the future of abundance.

49:07

Speaker B

Yeah, I couldn't agree more. I have to do a lot more research on the differing perspectives here. But I just think about, like, let's take marketing as a quick example. If there are agents next year or this year that do a lot of this work that I would need humans for, it's like, sure, I may pay an elite tier marketer a top wage to help me out with the final 10% of whatever we need to have done, but there are a lot of marketers in between to get to that point that I didn't hire. And so, and maybe the demand for the intelligence is there. I'm paying an AI company, not the marketer at this stage.

52:44

Speaker A

Yeah. And I will say again, like I'm, I'll just be real about some of the things I've been thinking about at Smartr X. Part of what I've been working on is the organizational structure at different levels. You know, at 20 million, at 50 million, at 100 million, at 10,000 customers, 50,000 customers, 500,000 customers, like what does it look like? And the thing I will say is like, I'm trying to find entry level roles in there and I, I cannot right now find out what those are because what I know we need is human expertise and experience that can work with and build and manage agents and that know what questions to ask of the AI and then what to do with that. And so my challenge becomes when, when the, let's say senior level person, you know, senior level, executive level, whatever, director level and above, when they are able to talk to these things and then say, okay, go do it to your point, Mike, go build the campaign, write the emails, create the landing page, allocate the media, spend, create the ads, buy the ads, go do the things that we used to have entry level people doing. I'm not convinced that the AI agents can't do all of those things in the next one to three years. And so as someone who plans to employ probably a hundred plus people, I'm not clear yet what the roles are for professionals with less than five years experience. And I hate saying that, like I, I'm, I'm truly trying to figure out what will those people do because those are the future leaders of the company. But I, I don't know what the role is for them. And middle management, it's tough. Like I don't know what they're going to do. So again, I just don't know. The tech leaders sit down and think deeply about marketing, sales, customer success, operations, finance. Like I think they think about the technology and that we, it just always works out and I don't believe that's what's going to happen in the near term.

53:16

Speaker B

All right, next up, we have entrepreneur Greg Eisenberg warning that the tech industry is going through a phenomenon he calls, quote, vibe revenue, or income driven by curiosity and novelty rather than genuine product utility. Eisenberg argues that many AI companies are currently masking serious churn problems because there's this initial wave of vibe revenue that mimics the growth curves of true product market fit. So according to his analysis, customers often sign up for, you know, the cool new thing, but then cancel subscriptions within three to six months once that wow moment fades. He notes that founders frequently blame current model limitations for this churn, missing the reality that better models do not automatically create daily user habits or higher switching costs. Eisenberg concludes that sustainable businesses are those used on what he calls, quote, boring days and stressful days, whereas companies relying on novelty will eventually hit a retention wall that could render their valuations and employee equity worthless. So, Paul, that's a bit of commentary on AI startups, but how do you think about this idea of separating genuine AI product market fit from vibe revenue when you're looking at the market?

55:20

Speaker A

Yeah, it's, I thought it was an interesting one and I, again, I was like over the break I was listening to a lot of podcasts and actually like an audiobook. I was reading Blitz Scaling and then I was listening to the Lenny's podcast, which has like great go to market stuff and, and, and I think just contextually for me it's like a spot I've been thinking a lot about because even with our own growth, like growth and sales isn't a challenge for our business. Like the way it's been growing, it's all been inbound to date. You know, it's a pretty Fast scale. But I think about like you can't build a business and then just always keep replacing it with new stuff. Like you have to build like a world class customer experience that drives expansion and retention and like true value. And so just this idea, you know, of the vibe revenue and just kind of almost masking problems with these. I don't know, it's like a, it's a fascinating thing. And again, I think from a go to market perspective, it's what business leaders go to, market leaders really need to be thinking about. It's just where we're at and what we're now, what is now capable of being built and tested and then, you know, but staying true to like the reality of building a sustainable, meaningful, impactful business that actually makes a difference and, and gets people like wanting to be a part of it, wanting to, you know, come back each day, each week. So yeah, so it's a good read.

56:33

Speaker B

All right, next up, a report in the Information is causing ruffling a few feathers at Salesforce because it's reporting that Salesforce executives in certain cases are advocating for reduced reliance on large language models, especially in their product agent force due to reliability concerns. So according to a report from the Information, company leadership is now promoting what they call deterministic automation, which is decisions based on predefined instructions to counter what they describe as the inherent randomness of generative AI. So basically a senior vice president at Salesforce as part of this report noted that the industry quote, had more trust in the LLM a year ago, but that practical deployments of the technology have exposed limitations. So for instance, they're seeing customers often require these like deterministic triggers to ensure that the LLM performs the same way consistently a hundred percent of the time. Which as we know LLMs are not always great at doing. So Salesforce's CTO for Agent Force actually explained that LLMs often begin dropping instructions when given more than eight commands and consequently the company is reintroducing in some cases basic if this, then that logic to lower costs and guarantee these agentic workflows follow exact steps. So Paul, the PR response actually to this article is pretty interesting. On one hand, Salesforce seems to be saying in certain cases they're falling back a deterministic logic. It's not really AI to make sure AI works consistently, but they also claim they're resolving 90% more customer issues with agents and they're really just kind of optimizing where the agent works and where it doesn't. But that is also a Very different message than they've been touting in the past, which is AI agents will transform your business and do all this work for you.

57:57

Speaker A

I think a few things are going on here. One, I think Salesforce is going to learn the bitter lesson which we've talked about on the show before, which is basically the concept that you know, if there's a flaw in the AI now you know it'll get resolved by smarter AI and that you trying to think you can human craft these task specific things or these specific ways for it to behave over time gets obsoleted by just smarter models. I think there's a catch 22 here though for Salesforce, which is they've kind of painted themselves into a corner where they've bet everything on AI that isn't ready fully yet to do what they're envisioning in it and selling it to do. And while the AI might get there in six months, 12 months, 24 months, whatever it is, like there's just a few more things that need to be solved with these models to get to the level of reliability that's expected if you're using it as a CRM, you know, integration, they can't wait around for that. So they're going to go try and hand code a bunch of stuff. I, I don't know. Like I, I, I just, again, like I, I'm, I'm, I'm not like the technological expert, I'm not in these systems building it. But when you zoom out, this is just one of those things that just doesn't pass the like the smell test. Like I, I feel like this is gonna not probably work out the way they're hoping it does by trying to solve a technology that isn't meant to be deterministic and trying to make it deterministic. And I don't know what the answer is. Like I don't, I don't know that there's a better path for them at the moment because of the bet they've made. But Microsoft's, you know, kind of been in a similar boat. Like there's just these tech companies that made these massive bets, went in very early on LLMs and infused them in and they, the tech probably wasn't there yet when they made those bets.

59:47

Speaker B

Yeah, and almost too the backlash of taking this position is not only it might not work, but also admitting that this thing you've hung your hat on may not be as ready for prime time in certain cases as you've promoted. So it's just kind of a double whammy.

1:01:41

Speaker A

Yeah, yeah. And I don't. Again, like I, I mean our only frame of reference is we're, I've mentioned before, like we're HubSpot customers. I've been a, I mean my agency was HubSpot's first agency back in 2007. Like I've been a HubSpot customer for 18 years, so I'm far more familiar with their platform than Salesforce. But I feel like HubSpot, you know, almost like, it's almost like HubSpot was the apple of like the CRM world. They didn't go all in quite the way that Salesforce did make the massive deep integration bet right away. I feel like they've been far, you know, slower in terms of their integration, maybe more calculated in how they did it, where it didn't. It's very additive, I would say, to the HubSpot experience right now. Not like I feel like they screwed it up or went too far with the LLMs and routes doing things in our platform. And again, you're the, you and our team use HubSpot more directly than I do every day. But my sense is it's, it's a very complimentary layer of features that you sort of like, yeah, it's okay like if it's not perfect, like it's just summarizing this thing or it's helping us do this script where it's not like the core thing but we're, we're also not using it to write all the sales emails and do all the outward facing things. A lot of what we use HubSpot's AI for is more internal purposes where it's okay if there's like some hallucinations and things like that. So I don't know, might just be our way of using it. Not comparing the platforms perfectly, but it's just my sense of it.

1:01:55

Speaker B

So next up, Nvidia has executed a landmark deal entering a $20 billion agreement with AI chip startup GR. That's G R O Q. We've talked about them before. It is not GROK G O R K, which is Elon Musk's AI company or AI chatbot rather. So this deal is kind of one of these like Aqua Hire acquisition things, but it's structured as a non exclusive licensing agreement rather than a traditional acquisition. Under these terms, Nvidia gains access to grok's intellectual property and hires its founder Jonathan Ross, along with other senior leadership. Now Grok, as we talked about in the past, specializes in something called language processing units which are designed specifically for Inference. So the process of running AI models that differs from the training heavy focus of standard GPUs or the chips that Nvidia often sells. Now Nvidia CEO Jensen Huang stated the company intends to integrate these low latency processors into its products to serve real time workloads. Now Grok, the company will remain an independent entity focused on its cloud business. Their former CFO is taking over as CEO. And the information reports this deal structure allows Nvidia to onboard key talent and technology while avoiding the regulatory scrutiny associated with often these kinds of acquisition. So Paul, I'm curious, you know, longtime Nvidia watcher, we've talked about GROK before. What does this mean for Nvidia's business moving forward?

1:03:25

Speaker A

This was the one that happened on Christmas Eve. Oh man, I got to read this like 11 o' clock at night. Yeah, so I mean GROK was kind of, you know, it, it definitely captured the attention of Wall Street. I would say there was a couple of instances in 2025, especially early in the year where I think there was some questions about like Nvidia's dominance and could they maintain it as more of the usage of AI moved to inference versus training. So Nvidia stock, you know, has skyrocketed to historic levels largely because they enable the training of these models where GROK sort of specializes more in the use of the models on devices like you and I when we go in and use ChatGPT. And so there was some questions and you know, I think Jensen had to address it numerous times in different interviews last year and differentiate it but overall, I mean I this seems like a brilliant move. It, you know, this Aqua Hire kind of approach that everyone is taking and you know, I think that it is a prelude to many more acquisitions to come for Nvidia. I feel like they're going to be very aggressive. They have based on a quick search they're about 60 billion in cash and equivalents by late 2025. So that's cash on hand, cash reserves. I don't think they're going to be shy about making investments. They're obviously, you know, printing money right now in Nvidia. So I don't know, I would think they're going to be pretty active in, in the mergers acquisition space or Aqui hire space at least.

1:04:54

Speaker B

Our next topic also about an acquisition. So Meta has reached an agreement to acquire the AI startup Mamis for more than $2 billion. This is a Singapore based company which was founded by Chinese entrepreneurs and specializes in autonomous AI agents capable of performing Complex tasks like producing detailed research reports and building custom websites with minimal human input. So this acquisition represents a bit of a pivot for Meta as it's trying to cement its position against Microsoft and Google in the market for AI productivity tools. It's also one of the first major instances of a US tech giant purchasing a startup with deep roots in the Asian AI ecosystem. So Manus gained quite a bit of attention and traction earlier this year, crossing a hundred million in annual recurring revenue in December, just eight months after its launch. So under the terms of the deal, Manus co founder Xiaohong will join Meta and report directly to its coo. And this obviously follows a broader recruiting blitz by Zuckerberg, which has included the recent purchase of a 49% stake in Scale AI. And Metis says it intends to scale Manus's agents and service to more than two more business customers, while integrating the technology across its social media products. So, Paul, what does acquiring Manus tell you about where Meta is headed with AI?

1:06:21

Speaker A

They're going to keep spending billions of dollars to try and get back in the game on AI. I think, like, you know, in 2025 started, Meta was, was hot. They were like, you know, dominating on the open source model side. They had a lot of momentum and I feel like we'll talk about it in a minute, but Llama 4 was sort of a dud and, and Zuckerberg had to go into scramble mode and try and Aqua hire directly hire talent to get them back on track. And I feel like they're still scrambling and I'm not sure the path forward other than integration into their existing platforms, but I don't know. I mean, that's a tough one. If you had to force rank the AI labs right now and who's going to make the biggest impact in 2026, Meta would be at the bottom of my list, honestly. Honestly.

1:07:43

Speaker B

Well, you might not be the only one thinking that, because in our next topic, Yann Lecun, who we've talked about all the time and is considered one of the godfathers of modern AI, is kind of speaking out. He just did an interview with the Financial Times. You know, he's currently, as we've covered in the past, stepping down from Meta to launch a new research startup. So in this exclusive interview with the FT, LeCun announced he will serve as the executive chair of Advanced Machine Intelligence Labs, what he calls something called a, quote, NEO lab, focused on fundamental research beyond large language models. So Lecun has become a vocal critic of the industry's obsession with LLMs he says they're a dead end for superintelligence because they lack an understanding of the physical world. Instead, this new venture of his will develop world models that learn from video and spatial data. And that's an approach he believes is necessary to replicate the way biological intelligence learns now. Also, a lot of juicy details in this article about the tension at Meta, where LeCun revealed that Zuckerberg lost confidence in the generative AI team after the Llama 4 model was considered a flop. He also said performance benchmarks were reportedly, quote, unquote, fudged. And he also stated that his integrity as a scientist compelled him to leave an environment that he described as, quote, LLM pilled. So we've tracked LeCun for a while, Paul, and his overall take on AI. What did you take away from this interview and kind of what he revealed here?

1:08:34

Speaker A

Yeah, again, I don't have any internal sources at Meta, but if you go back and listen to the 2025 episodes, everything that we assumed was happening and, and said would happen happened. Like, this is, this is exactly how I anticipated this would end when they made the Alexander Wang Aqua Hire. And so I did not expect him to, to just like, put it all out in this interview. It's a fascinating article to read. You get kind of a sense of his personality as you're reading through this. But yeah, so, I mean, just real quick background, he joined there in 2013 as a bit of a consolation prize because Zuckerberg had tried to acquire DeepMind and Demis did not want to work for Zuckerberg, so they ended up selling to Google instead. So LeCun went there on three conditions. That he didn't have to leave NYU, that he didn't have to move to California, and that everything that his research lab did would be made public that, you know, Zuckerberg held to that up until sort of the chat GPT moment they tried to play the game of like, let's own the open models. And so they. That's where they made their initial traction. Llama two, they put out open weights for all users, meaning people could download and tweak it for free. And LeCun said in the interview, like, this sort of like changed the industry, became the gold standard and open LLMs. I would say the deep seek moment probably impacted Meta more than anybody else because in essence, this Chinese lab showed up and just took ownership of the open source side of things. So that led to the switching of gears. Zuckerberg placed more pressure on the Genai unit to accelerate ad development and deployment, which led to a communication breakdown. LeCun says in the article. He said, we had a lot of new ideas and really cool stuff that they should implement. They were just going for things that were essentially safe and proven. When you do this, you fall behind. And then he mentioned, as you said, Mike, that the Llama 4 model was a dud. The fact that he said they fudged the results is like, I can't believe he said that. Like, yeah, that was the one thing that jumped out to me. I mean, he laid a couple other doozies in there, but the fact that he admitted they cheated was crazy to me. Again, something we all basically knew. But for him, someone to say it was crazy. So that's when he said Zuckerberg lost confidence in that team, kind of sidelined the Gen A organization and then they started poaching everybody. They asked, do you think it's going to work? And he said, the future will say whether that was a good idea or not. Lacun says, deadpanned when asked about Wang. And so again, Alexander reg is what, 28 years old? I think came from Scalia and the 15 billion dollar aqua hire. He called him young and inexperienced. He said he learns fast. He knows what he doesn't know. But there's no experience with research or how to practice research, how you do it, or what would be attractive or repulsive to a researcher. So he's basically saying, Alex is not a researcher. He has no idea what he's doing when it comes to research. And then he said this again. I'm just kind of shocked he said these things. Alex isn't telling me what to do either. You don't tell a researcher what to do. You certainly don't tell a researcher like me what to do. And then said he didn't mince words about why he ultimately decided to leave Meta after more than a decade. Staying became a politically difficult thing, he tells the author. And while Zuckerberg likes lacun's World Model Research, the crowd who were hired for the company's new superintelligence push are completely LML pilled, as you said, Mike. And then this clearly alienated, he said, I'm sure there's a lot of people at Meta, including perhaps Alex, who would like me to not tell the world that LLMs basically are dead end when it it comes to super intelligence. But I'm not going to change my mind because some dude thinks I'm wrong. I'm not wrong. My integrity as a scientist can now not allow me to do this wild like, kudos to the author of that article to get that interview. I mean, just not the stuff you usually hear on the way out.

1:10:04

Speaker B

Feels like Lacuna's been holding those words.

1:14:02

Speaker A

Oh my God, again, all the stuff I assumed, like, I was like, there's no way he's reporting to this guy. And yeah, I just didn't expect to actually see it in print.

1:14:04

Speaker B

All right, next up, we get a report from the information that OpenAI is ramping up its audio AI development in preparation for the launch of a new hardware device, which we've talked about a little bit, that is expected to debut in roughly a year. Now, according to the Information, the device will be largely audio based and perhaps screenless, reflecting a design philosophy championed by former Apple designer Jony I, who has a passion to aim to reduce screen addiction. Internal sources told the information that OpenAI researchers believe current audio models, which are distinct from the primary text models used in ChatGPT, currently lag in accuracy and speed. To address this, the company has unified engineering and research teams to build a new audio architecture. That architecture is slated for release in Q1 of 2020 26. This new model is basically expected to handle interruptions, speak simultaneously with the user, and offer more emotive responses. So this would then power this hardware, which is envisioned as a companion that proactively offers suggestions based on its surroundings. Some form factors under discussion include smart speakers and glasses. Now, Paul, still a lot of details to figure out what this device is going to actually look like. We've talked about before. I can tell you from personal experience, even just using simple products like Whisper Flow or just Voice Mode have like totally transformed how I interact with AI. So it feels like if OpenAI nails this in some way, this is going to be a huge deal.

1:14:14

Speaker A

Yeah. Here's another idea for you on the product side of what it could be. And I don't remember where I first saw this over break, but a pen is the thing that started gaining traction and it's kind of a fascinating concept because there's an interview, we'll put a link to a tweet that shared this interview with Sam from earlier in 2025 where he talked about the fact that he's actually a pen and notepad guy.

1:15:47

Speaker B

Okay.

1:16:13

Speaker A

Everything he does is writing and then he rips the pages out of all these notepads. And so he goes on like five minutes about how writing is thinking and he has to go through this process. And so it's actually kind of a really fascinating form factor that I hadn't previously Considered where the pen would recognize everything you're writing. So you'd be creating a digital version basically as you're writing. He even gets into like the specific kinds of pens he likes, like the style of pen and like the ink and things like that. And so you could tell he's talking very like he's thought deeply about that form factor before. And so imagine everything you write again is also written in a digital form. You can talk to the pen if you'd prefer, you can hang it on a necklace or a clip and it can have a camera that can also then audio record everything around you. Goes with your iPhone, doesn't replace your iPhone. I don't know like it was when I first saw it. I was like, damn, that actually makes a lot of sense. And oh, it was a supply chain thing. Somebody actually kind of had a source within the supply chain that said it was, it was a pen that the, the part that yeah, obviously everybody's under very, very extreme NDAs on this stuff, but doesn't mean supply chain doesn't leak. So yeah, something to watch for. Again, I'm not saying that's what it is, but it's actually the most logical form factor theory I've heard when you think about it.

1:16:14

Speaker B

All right, next up, Richard Socher, the CEO of You.com and former Chief scientist at Salesforce, has released a forecast for 2026 outlining how he thinks AI will reshape labor and science. And he predicts the emergence of what he calls, quote, reward engineering jobs where specialists must precisely define success metrics for AI agents to prevent what he calls, quote, suboptimal reward hacking as models tackle longer term goals. He argues that while the AI wave is as foundational as electricity, many startups will fail due to poor leadership or structure. Conversely, he anticipates the rise of a 10 person unicorn company of 10 person unicorns which achieve billion dollar valuations with minimal staff due to high revenue per employee efficiency. He also forecasts that 2026 will see at least a dozen companies raising billion dollar seed rounds. Now, beyond economics, Socher posits that AI will become to biology what calculus was to physics. A language capable of handling complex messy systems like the microbiome, for instance, that simple equations just cannot describe. And he concludes that while we may see some waves of Luddites rejecting the technology, he will say, he says that automation will create new industries rather than shrinking the total work available. So, Paul, definitely a notable person in the AI space. What did you make of these predictions? Some of these sound pretty familiar to some of the stuff we've talked about.

1:17:41

Speaker A

Yeah, I thought overall they were really good. A few that jumped out to me is new marketing motion market directly to LLMs. As agents make purchasing decisions, companies will need to optimize for ad consumption, not just human eyes. That's one we think a lot about. You know obviously we have our Macon event that was a topic of interest. We have our Marketing AI Industry Council with Google Cloud. This is something that's come up in council meetings. So so that is definitely one is everyone's trying to figure out the go to market when agents are playing a much greater role in information consumption and buying behavior. The continued pushback against agents on the web slows adoption. I thought it was a good one. He said murky human incentive structures create friction advertisements. Economics do not like web agents doing work for users. Hourly paid jobs like lawyers don't have incentives for massive productivity increases until major disruption happens. He also said every knowledge worker becomes a manager of AI agents and then by the end of the year we're going to see the first glimpse of where super intelligence might go. And as we touched on earlier, he said self improvement is a term being thrown around a lot but nobody has actually figured out yet. In its fullest form it'll be the most exciting research area.

1:19:15

Speaker B

Moving forward those will be predictions to watch. I can't say I disagree with a lot of the this.

1:20:27

Speaker A

Yeah, for sure.

1:20:33

Speaker B

All right, our last topic this week. OpenAI has unveiled a broad suite of what they call prompt packs that are designed to transform ChatGPT from a general purpose chatbot into a specialized tool for specific industries. These are basically just curated libraries that have standardized ready to use prompts for different industries and functions. So for instance, this release has tailored collections for roles like sales, HR, role engineering and government. For example, the ChatGPT for product pack includes templates for competitive research and UX design, while the government section features a prompt pack for leaders to assist with fiscal analysis and public messaging. So this system, which is just hosted on their OpenAI Academy, is basically kind of a plug and play resource. You browse by whatever ones jump out at you by your sector, by your industry and then you copy the pre written prompts directly into ChatGPT. So they basically are trying to reduce the trial and error time often required to get useful outputs from ChatGPT. Now Paul, what struck me here is a this is amazing but B how much this underscores this problem we still have, which is three years after ChatGPT comes out, the vast majority of knowledge workers just simply still don't know everything that's possible that they can do with AI or how to get the results they need through prompting. So because that gap hasn't closed, this seems like really good resource.

1:20:35

Speaker A

Yeah, these are great. And there's some of these came out in like the summer and I don't know if they repackaged them and kind of like put them into a single page, but some of you may have like checked these out before. If not, they are great reminders. So if you're already actively using, there's some really cool prompts in there just to like maybe give you some inspiration. These are the kinds of things though, if you're trying to bring your co workers along, it's a super tangible way to do it. Like give them three to five sample prompts that say, hey, here, just put this in. I I customize it with context for you. Like we talk all the time about that need to again change management, like solve the people problem, make it easier for them to get to value. So that first time they try it, they get that wow factor and they want to keep going. So yeah, we love sharing like simple things like this that create immediate value for people. So we'll put the link in the show notes, you can go right to it. And as Mike said, there's like sales, marketing executives, HR it, there's just a bunch of sample prompts and they're really nice. Quick way to get started for people.

1:22:02

Speaker B

All right, Paul, it's a packed week back for 2026. I appreciate you breaking everything down. As a quick reminder to folks again, take take the weekly AI Pulse survey, SmarterX AI Pulse. And also if you haven't left us a review on your podcast platform of choice or followed us there, please do so. So Paul, thanks again.

1:23:05

Speaker A

Yeah, good to be back. Thanks everyone. Hope you had a great break and hope your new year gets off to a great start. We'll be back with you next week with our regular episodes. Thanks for listening to the Artificial intelligence show. Visit SmarterX AI to to continue on your AI learning journey and join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual and in person events, taken online AI courses and earned professional certificates from our AI Academy and engaged in the SmartRx Slack community. Until next time, stay curious and explore AI.

1:23:25