Y Combinator Startup Podcast

Tokenmaxxing: How Top Builders Use AI To Do The Work Of 400 Engineers

41 min
May 8, 202626 days ago
Listen to Episode
Summary

Gary Tan, YC President, discusses how he shipped hundreds of thousands of lines of code in months by leveraging Claude AI as a coding agent, introducing concepts like 'token maxing' and 'thin hardness/fat skills' to maximize AI productivity. He built Gary's List (a political research platform), GStack (a prompt framework), and G-Brain (a RAG system) while running YC full-time, demonstrating a new paradigm of human-AI collaborative software development.

Insights
  • Token maxing—spending more on API calls to get comprehensive, high-quality outputs—is a critical investment for builders, analogous to paying high San Francisco rent for serendipity and opportunity
  • The future of software engineering separates concerns into 'thin hardness' (deterministic code) and 'fat skills' (LLM-driven markdown prompts), with the key skill being knowing which domain each task belongs in
  • AI agents require human agency and taste in the loop; the most productive setup pairs Claude Code (for rapid iteration) with Codex (for deep problem-solving) and human judgment
  • Personal AI and custom prompts are the next frontier; without control over your own tools and data, users remain 'below the API line' of corporate platforms
  • Agentic engineering is entering a 'Homebrew Computer Club' phase where technical founders can build production-grade systems with minimal capital, shifting the economics of software development
Trends
AI-assisted software development moving from copilot (suggestion) to agent (execution) paradigmToken spending becoming a strategic investment metric for builders, not a cost to minimizeOpen-source AI models (OpenClaw, Codex) creating competitive alternatives to closed-API platformsShift from 'lines of code' to 'logical lines of code' and outcome-based productivity metricsPersonal AI infrastructure (local models, custom RAG, private data) as a counterweight to corporate AI platformsMarkdown-first prompt engineering replacing traditional code-first development for knowledge workAgentic QA and testing automation reducing manual testing burden in software developmentMulti-agent orchestration (CEO agent, engineer agent, designer agent) as standard development patternVector databases and hybrid search (RRF) becoming foundational for AI-native applicationsDecentralized builder movement emphasizing control, transparency, and alignment with personal values
Companies
Y Combinator
Gary Tan is President of YC; episode discusses how YC founders can leverage AI tools for rapid development
Anthropic
Claude and Claude Code are the primary AI tools discussed throughout the episode for agentic development
OpenAI
Codex model mentioned as alternative to Claude for deep problem-solving in agentic systems
Twitter/X
Gary's List uses X API for research; Grok API mentioned for deep research on X platform
Perplexity
Perplexity API used for deep research capabilities in agentic systems
Microsoft
Playwright testing framework mentioned as alternative to Claude's browser automation tools
Initialized Capital
Brett Gibson, Gary's co-founder on Posthaven, now runs Initialized Capital
Posterous
Gary's first YC startup (2008) that was acquired by Twitter for ~$20 million
Posthaven
Rebuilt version of Posterous after Twitter acquisition; rebuilt again in 2024 with AI
CaseText
Jake Heller's legal tech company; inspired Gary's approach to agentic research and retrieval
Airbnb
Brian Chesky's 10-star experience framework used as meta-prompt for product design in GStack
PostgreSQL
Postgres with PG Vector used as database foundation for G-Brain RAG system
People
Gary Tan
Guest discussing his return to coding using Claude AI, shipping 400x more code while running YC
Brett Gibson
Co-founder with Gary on Posthaven; now runs Initialized Capital venture fund
Jake Heller
Created CaseText; inspired Gary's agentic research approach discussed in Light Cone episode
Brian Chesky
10-star experience framework referenced as meta-prompt design pattern for product thinking
Peter Koeman
Created OpenClaw; partner with Gary; inspired 'thin hardness/fat skills' philosophy
Andrej Karpathy
His post on knowledge LLM wikis inspired Gary's approach to building G-Brain RAG system
Steve Jobs
Referenced as historical parallel to current 'Homebrew Computer Club' moment in AI development
Steve Wozniak
Referenced as historical parallel to current 'Homebrew Computer Club' moment in AI development
Boris Churny
Mentioned as inspiration for Gary's realization that builders can leverage AI agents effectively
Quotes
"Using OpenClaw these days is like driving a Ferrari and it's exhilarating. It's insane. But then it's also like a Ferrari in that you better be a mechanic. It's a Ferrari that will break down on the side of the road when you most need it."
Gary TanOpening and closing remarks
"Will you have control over your own tools or will your tools have control over you? That's the defining question."
Gary TanFinal segment
"Token maxing is going to be one of those things for founders that we have to teach them. It's not immediately obvious, but this is actually like rent—you should spend as much as you can to get the most utility out of it."
Gary TanMid-episode discussion
"I can buy millions of years of consciousness, of machine consciousness. Now I can be a time billionaire. It's not my own time. It's the time of a machine doing work for me."
Gary TanLate episode reflection
"The difficulty in agentic engineering today is when people try to do things that should be in markdown in code and it fails because code is brittle. Code doesn't understand what you want or who you are."
Gary TanThin hardness/fat skills discussion
Full Transcript
I think that's like the defining question. Like, will you have control over your own tools or will your tools have control over you? Using OpenClaw these days is like driving a Ferrari and it's like exhilarating. It's insane. Like you get to do things, like it figures things out you would never think a machine could figure out and it does it so quickly. But then it's also like a Ferrari and that you better be a mechanic. Like it's a Ferrari that will break down on the side of the road when you most need it and you need to get out with your wrench and pop the hood and fix it. You're going to have to fix it yourself. And so this is a very exciting time in computer science and technology. Welcome back to a special episode of The Light Cone. In this episode, we're going to talk about how Gary Tan got back to building. If you follow us on Twitter, you'll know that after a multi-year hiatus to become an investor, Gary Tan is back to being a builder. And in the last couple months, he's shipped hundreds of thousands of lines of code and built popular open source projects that have gone from nothing to more than 100,000 stars on GitHub. And he did all of this while having a very demanding job running YC full time. A lot of people on the internet don't even think that this is possible and are somewhat like in disbelief, but it actually happened. We know because we were here to see the whole thing. And so today we're going to talk about how he did it. Well, I'm relatively shocked myself. I'm amazed as well. It was 13 years of not coding. And then suddenly, boom, I'm doing about 400x the amount of work that I was that year. The last time I was even sort of like two thirds of the time writing code. Maybe to start things off, how will we go back to the project that started it all off, which was Gary's List? Oh, yeah. And just like talk about a few months ago, how you powered up Cloud Code and like started to get back to coding. And it was right after one of the Likon episodes, right? Oh, yeah, definitely. I realized that I wanted to bring together all the people who believed what I believed, particularly for California. And so I started a 501C4, and now it's a C3 and a PAC, which is sort of what a lot of political groups do. It's a very common way to bring people together. Everyone focuses on the money, but we're trying to bring together smart people. You know, what I learned in the years of working in San Francisco politics is that bringing together people is so powerful. And that's what a mass social movement is. And I said, okay, well, why don't I just make a website where we start doing that? And it would just start with, why don't I start writing about the issues that I'm worried about? It's like I want children in school, you know, people watching this from all around the world might find it very, very strange, like I find it strange, that it was not possible and still very, very hard for a seventh grader or eighth grader in middle school in San Francisco public schools to be able to take algebra. And that was a math education thing. If I didn't get to do that when I was in public schools in the East Bay of the Bay Area, there's no way I would have studied engineering at Stanford. I never would have written code. I never would have been able to do any of these things. So it was close to my heart, and I realized, hey, it's time to write code. And I ended up building Posturus, my first YC startup from 2008. What was Posturus for people who don't remember it? Yeah, Posturus was Dead Simple Blogs by email. Well, it grew to be a top 200 website on the internet and then Twitter ended up buying it for about $20 million. So that was sort of like my first bag, really. I actually built it again as posthaven when Twitter bought it for the amazing people that we had hired and they shut down the startup. It would have cost a couple million dollars to buy it back from Twitter and at the time I had no money in the world. So the next best thing was why don't I write it again? And then in January of this year, I ended up writing it a third time. Only, you know, the first time it took about, you know, $4 million and, you know, six or seven people and about a year and a half. And then the second time it, you know, took about, I don't know, $100,000 and two people, me and my co-founder Brett Gibson, who now runs Initialized, and maybe like three months or so. And then in this case, it took about $200, which was my Cloud Code Max account, and probably five days. Full featured blog platform, does everything you want. And then on top of that, like full rag, full agentic retrieval, like be able to sort of go out and read all of the Internet, like every tweet I've ever done, recursive crawl, deep research of any topic. The algebra thing is just one of a whole lot of different issues that we really, really care about. And to be able to go ingest the internet, you know, see all the arguments for and against, and then to craft incredibly detailed reports on the back end about what are all the quotables. I think people who are big followers of The Light Cone might remember one of our first episodes about agentic systems with Jake Heller, actually. So Jake created case text and he described exactly what I ended up building for basically journalistic long form articles about any sort of issue or piece of news that was happening. And so, you know, anyone can go to garyslist.org today and, you know, we do about two or three relatively, you know, researched, all fully sourced articles about what's going on in California and San Francisco and L.A. And how do we build a better government? This is the thing I feel like people missed about Gary's Little Don't Fully Get is that it's like the classic thing we've been talking about here, which is like software was you build software to let people use it. is like you build a blogging platform and people like write blogs and maybe like they'd start their own sub stacks eventually or they write articles but gary's list is both blogging platform but it actually does the work of a high quality investigative journalist it's not just something that a journalist uses to publish their articles yeah i mean basically the for the equivalent of like five or ten dollars of opus calls i mean i would estimate that it does the work of like you know, a real human being that would have to like go painstaking through dozens of articles, read entire books about certain subjects, annotate them. I mean, going back to the case text example, like the thing that Jake taught me was that you need to think about what a human would do with the context given. Like what would it retrieve? Like, does it go to the library? What kind of book would it look for? What does it search on for search, you know, on the web? I mean, the great thing now is like, you don't have to just do that. Like you can get perplexities API and you can do deep research there. You have X's API, you can do deep research there. You know, Grok's API, if you need to like do research on X using the Grok API is actually very, very good. And you can just grab all of the context. This is sort of going back to the philosophy of boil the ocean, which is one of my essays. It's like, particularly when building agentic software now, you don't have to settle for what we did when we were humans writing the code. And that goes for research as well. What if you absolutely boil the ocean? What is the total completionist? If you were a human, this would take you about a month to do this research. You can just zap the rocks harder. You pay more money and you might be token maxing, but you should token max. Like basically, if there is incremental work that makes something more complete, more awesome, more in the case of this type of writing, like we want it to be more representative of reality. Like, you know, we don't just settle for one source when we can get 20 sources and we can cross reference them. We can figure out like, well, these 13 sources say this and seven sources disagree with that. And then, you know, you want to feed all of that context into like your core prompt. And then you can basically make a better decision than what you would like just, you know, a human being clicking on a link, reading a headline, and that's all you understand. And I think if you token max, like that's actually the coolest thing you can do now. And it's not just in, you know, generating articles. It's not, you know, it's clearly in writing code, right? I think now it's going to permeate every part of society. Like every thing that we would call knowledge work could be token maxed. And I don't think that it means that we're going to get rid of people. I think it means that people need to still supply the agency. Like, I need this. Like, I'm the one who's sitting here caring about algebra. Like, I want kids like me who couldn't afford private school. You know, San Francisco is the one city in the world that has the highest rate of private school attendance, probably in the entire country, actually. And that's not OK. Like, you shouldn't have to be rich to have a good education. And, you know, I don't know why that's controversial. And so for me, it's like this, you know, mass sort of shift in technology was happening. And then I had a need and a want and a desire. And it was a burning desire. Like, it hurts me and pains me to think about 10, 12, 13-year-old kids who don't know algebra and, like, could have. But some bureaucrat or, you know, some virtue signaling person in power says, like, actually, I don't want that kid who wants to learn algebra to learn it. So I think in this process of basically solving your own pain and need from the young Gary and building Gary's List, you sort of discover a lot of patterns on token maxing and this new way of building that led you to the next project, which was GStack. Like I actually did not plan to make G-Stack. All I did was like, I realized that I was doing the same things over and over again And then I got sick of typing the same thing So I went into my Apple notes I typed in all the things that I found myself writing over and over again into Cloud Code And it was pretty simple stuff It's like, here's the plan review. One of the things I started doing is I really love asking Cloud to make ASCII art diagrams. One of the things I discovered is sometimes Cloud would just get confused and like write bugs or not be complete. But once I started saying, actually, before you start your work, make an ASCII diagram of all the data flows, all the inputs and outputs. What are the user flows? What are the error messages? And you can see this. It's like data flow, state machines, dependency graphs, processing pipelines, decision trees. Once it did that, it loaded all of the context in and then it just did the work more completely. Like it boiled the ocean better. And it broke down into a bunch of different sections. Like here's architecture review, code quality, test. I mean, one of the things I learned building Gary's List was that when I was writing the code myself, I would always do the minimum amount of testing because it's just like not very fun. I knew I needed to have it, but I'm here to write, you know, fun new code. I did not like to write tests. And then honestly, like I hit all the things that everyone else hits when they start vibe coding, which is like, this is slop. It's not working that well. Like it works fine for the 80% case, but if any users actually touch it, it starts falling over. And then that's when I realized, oh, I can get to 100% test coverage. I've since learned that 100% is probably too much. Like hitting 80% to 90% is usually the best practice at this point. But yeah, this is basically the first version of plan-eng-review. dash review. I know everyone knows the office hour skill, which is, you know, what people can use and I still use when I'm trying to make a brand new product or a brand new feature. It simulates what we do when we're working with a company. It's like, how do you know that people want this? You know, who's it for? What does it do? And what's the impact, right? But this is like the proto skill. Like this is, I didn't even know skills existed. And I posted this and it went viral, Like, you know, 200,000 people saw that. And then I made another version of it that was a much more expansive version. I called it the mega plan. And then I ended up renaming it to the CEO plan. We've probably talked about meta prompting before. I use meta prompting here. I took the other review plan that we had. And then I said, okay, well, let's do a version of this. But like, imagine Brian Chesky sitting with you, right? Like Brian Chesky has this great line about what is a 10 star experience. So and, you know, the point of it is everyone thinks about hotels in terms of like three. This is a three star experience. It's a four star experience. And he like goes, you know, through the list like five stars. It's like everyone, you know, yeah, cool. Like he's like, what's a six star and what's a seven star and what's an eight star? And like he goes all through that entire list. And that's one of my favorite like product and design exercises to go through like as a mental exercise. And then the cool thing is like you can do that every single time now. And so that's what this is. You know, this prompt basically tries to figure out what is the platonic ideal of what this is. These are sort of like the three, the two things that are pretty awesome. One is what is the 10x check? What is more ambitious and delivers 10x more value for only 2x the effort, right? And so for whatever reason, coming out of latent space, this helps the model like really visualize. So I'm plan CEO skill. I actually really enjoy because I'm an ADHD CEO and I love potential, like pure potential. And so this is like the one, like I can't believe this is just literally two little sentences, but like this unlocks an incredible amount. And so that's how GStack started actually not as, you know, I didn't want it to be anything other than like, well, I just need to make some skills. And I had heard that people were making like skill repos. But then the third thing I did was I started using these two skills so much that my conductor instance was getting very backed up. So this is how I use conductor. This is actually my real setup. Okay, so this is your like daily workflow. this is how you've been shipping hundreds of thousands of lines of code a month. It's all in here. Yeah, that's right. So I dropped like 13 PRs in the last 48 hours. And then, you know, you just queue them up. Like anytime I come up with a new idea, I come in and here it is. You know, I love using the CEO skill. I love using the eng skill to like really make it super well tested. I did that all in plan mode. And then I'd click approve here. And then, you know, Claude would go and do all the stuff. And then I did that so much that I ended up having like 15 different features that were all queued up waiting for me to manually test it. Like it passed it, you know, it passed end to end testing, it passed integration, it passed unit tests. But like at the end of the day, I still need to, you know, for Gary's list, it's like pop open the Rails server and like, you know, load that user and like make it into that configuration for that particular user and like manually just make sure it works. And I got sick of doing that. And I was trying to use Claude Encode MCP. And it was very, very slow, two to three seconds for every turn. And I was like, this is not usable for QA. But I had heard that Microsoft had released Playwright, which is sort of an alternative testing framework. In retrospect, it's like, actually, there was like agent, there were like agent harness and like all these other like tools that I could have used. But the upside and downside of Cloud Code is it's so easy to just start something that I just popped open. Like I literally went in here and this is probably what I did. It's like, I'm so sick of using Cloud. Cloud in Chrome MCP, it's too slow. Let's go ahead and wrap Microsoft's Playwright. Can we do that? And then I just pressed enter. And then, you know, one of the things that emerged with GStack is that like, this is how I create new features now. Of course, you know, what it's going to do now is like, hey, dude, you already did that, which is hilarious. You know, I have bug fixes right next to giant features. And then the way GStack works, there's a CEO, there's a designer, there's actually a developer experience person in there. There's a number of design tools. And then plan eng is the last one. And then I actually usually run slash codex. And I recently added a slash Claude in Codex. So one of the cool things that I actually learned from YC alums, I came to an event and brain totally frazzled, but went to one of our batch events and we were just shooting the shit about what's going on with Claude Code versus Codex. And at the time I was a total Claude Code only guy. And I realized, oh, a lot of people actually prefer codex? Why is that? And I discovered that Claude code is ideal for the ADHD CEO. But once in a while, Claude code will just BS a bunch of stuff. Claude models are very, very good, but they are not the smartest, it turns out. And so a lot of people explained to me that if you have a problem that's much crazier, you need the 200 IQ nearly nonverbal CTO. So you can just call in a friend and then that's what like slash codex is. It's a G-Stack skill that takes whatever your plan is or if you're out of plan mode and you already implemented it, it'll take your repo. And it'll run codex in a command line prompt with the prompt that says find all the problems and all the bugs. And it reports it back to Cloud Code. And then you and Cloud Code can work through that feedback. And then I have since added, if you use Codex as your main coding agent, you can actually go and type slash Claude and have Claude come and be the CEO briefly if you want as well. The cool thing about G-Stack is when I run it through this program, I start with office hours, CEO review, I do design if there's UI, if I know a developer needs to use it, which is practically all of G-Stack and G-Brain stuff, I run the developer review. then I do eng review and then codex. Once that plan is done, I've worked through all of the issues. The G-Stack relies very heavily on ask user question. So because, you know, and that's, that to me is like really important. That's where the human, you know, vibe coder, operator, agentic engineer needs to supply their understanding of what's going on. What are we building? There's not really a substitute to that. It would surprise me very much if someone really, truly did manage to make a thing that could just make software without the human in the loop. It's a controversial take, I think, but I never want to be entirely out of the loop. I just want the machine to do the stuff that I don't want to do. And so basically QA is a good example. And I mean, that's hilarious. Coming back to the demo, it's like I typed something into the modern version of GStack and it's like, dude, what are you doing? We already built that. We have Browse is a long-lived HTTP daemon with 70 commands as a CLI. And then QA is just browse. But in the prompt for QA, it says, look in your context. What did we do on this branch? If there's UI or any mutation of data, go and use the browser to test that thing, which is cool. It's like having a black box browser. It blew my mind when it first worked. It's like mini AGI is already here. You know, I realize this is not true AGI. True, true AGI would be like, I'm not even here. And actually, that's fine. In this respect, like as a builder, you know, selfishly, I hope that we never have to stop. I hope that the machines never figure it out because that would be really cool. Like then, you know, humans are really important and like engineers who know how to do this, who have taste in design and product feedback and, you know, the real customer in mind. Like we're going to be like we basically have wings for as long as we do. YC Startup School is back. We're hand selecting the most promising builders in the world and flying them out to San Francisco for July 25th and 26th to discuss the cutting edge of tech Apply now for a spot Okay back to the video I think you crystallized a lot of these thinking in this post on X about thin hardness and fat skills. Oh, yes. Which actually encompasses all of this philosophy on how to token max. Yeah, I mean, some of it came out of being trolled on the internet relentlessly about markdown and like, you know, I'm just like peddling a set of markdown. And it's like, you know, I guess my lived experience at this point is that Markdown is actually code. It's just like this compiled in a different way. But like you can get the computer to do really astonishing things. Like, I mean, even this, it's like, could we have imagined that I would be talking to something that has replaced Visual Studio for like, I don't use Visual Studio at all. Like there's no reason to like when I can talk to my agent and my agent can do this. The name actually came from our partner, Pete Koeman. We have had to build an internal agent, and we call that the harness, over and over again. And then at some point, using Cloud Code all day, we realized, why should we rewrite a version of that over and over again? Like, you know, we should just use the things that are really awesome as, you know, harnesses. Like a harness is the core loop that takes the user input, gives it to the LLM, runs what the LLM does. Like it can do tool calls and things like that. I mean, why would we build that? Like what we should be spending all our time doing is thinking about what markdown should there be. And the way to think about markdown is if you were an event planner and throwing a wedding and you were trying to write down a checklist of how to throw a wedding again. Like what would you write in plain English to teach the next person who had to do it what to do? All of that should be in the markdown. Whereas all the things that should be deterministic, I mean, or is a real action. Like a wedding planner might have to call like 20 venues, right? But you wouldn't use markdown for that. Like you would make a call to Twilio, for instance, right? There's like sort of all of the difficulty in energetic engineering today is when people try to do things that should be in markdown in code and it fails because code is brittle. It doesn't understand special cases. It actually, you know, code literally doesn't understand what you want or who you are. It is like, you know, executing deterministic zeros and ones in a Turing complete loop. Right. Like it doesn't know. But then now we have LLMs that have latent space and they know who you are and it knows what your motivations are and it can handle generic cases. And then, you know, a lot of the magic right now as an engineer is like figuring out, OK, how much of it is over here in LLM land and how much of it is over there in code land? And then, you know, if you combine that with the other thing I learned, which is like get to 80 to 90 percent tests, like if it's not tested and you're just throwing users in there, like it's slop, you know, 10x worse than like human written code because like you just have no idea what's going to happen. And so that's like one of the things that people have to do. It's like, all right, not only do you need to figure out what's going on in latent space and deterministic space, you also have to make sure that like it's, you know, individually tested and then the integration is tested. And then going back to boil the ocean, like the machine doesn't care. It'll just do it. It's amazing. Like just zap the rocks more and you can get to 90 percent test coverage. And then you can have a system that, you know, is not quite perfect. Like, you know, Open Claw right now, there are lots of like failure cases, but it's 95% there. You know, it's I feel like using Open Claw these days is like driving a Ferrari and it's like exhilarating. It's insane. Like you get to do things like it figures things out. You would never think a machine could figure out and it does it so quickly. But then it's also like a Ferrari and that you better be a mechanic. Like it's a Ferrari that will break down on the side of the road when you most need it. And you need to get out with your wrench and pop the hood and like fix it. You know, you're going to have to fix it yourself. And so this is a very exciting time in computer science and technology because it's like this is Homebrew Computer Club. You know, the moment when the Apple One came out, like the Apple One created by Steve Jobs and Steve Wozniak was a breadboard inside, like literally a wooden case hammered together with like nails and duct tape, you know. and if you wanted a personal computer that's what you had to do and that's where we're at right now like you have relatively you know smart technical and you know people who had to study computer science have to spend like two or three hours and like maybe like five hundred or a thousand dollars in both tokens and cloud to actually get something like that running but like once you get it it's like we're sort of in the kit car ferrari phase it's like then you can drive and you can go anywhere and you want to shout to the hills like, hey, I got a Ferrari. Even the part about fixing yourself, I feel people, it's just like one of those things until you've like pushed through, you just don't quite get. If I really zoom out, it's almost like things have moved so quickly. Like if you think way back, just having Stack Overflow as a website that you could consult when you got stuck on a programming problem felt like amazing. And then it's like a chatGPT launches, like, oh, now I've got this like interactive thing that's way better than Stack Overflow. But you're still sort of doing the same thing. You're like asking questions and you're copying and pasting code and you're running the code and seeing what happens and copying and pasting it back and then you sort of with clawed code you sort of push through and you realize that you don't need to do the copy and pasting anymore it just like actually like executes and runs the code and even open claw i found out when i set it up yeah it's annoying because it can like effectively brick itself and it does a bunch of annoying things but if you actually have like clawed code like sort of fix it yeah if i just have called code running it will just like fix it and it's clearly not the way things will be long term but there's this mentality shift of it doesn't actually matter if it's brittle and requires fixing, because you can actually just have another agent sat there fixing it all the time. I feel like this evolution, I was completely Claude Code pilled, and still am, but probably only 50% or 60% of my time building product or agentic engineering is in Claude Code now. At some point, basically... Almost half of it is through OpenClaude now. Yeah, which is very interesting. I mean, then again, I'm also spending most of my time working on G-Brain itself. So G-Brain came about because I met, you know, obviously we had Peter on the show. And then I finally got around to it. It was like one weekend I said, I got to check this out. Like what's going on with OpenClaw? Let's get it going. And this was about the time Karpathy wrote his text post about knowledge LLM wikis. And so I was like, okay, well, I have a repo full of Markdown. all my, you know, I should put all my context into that markdown. And then at some point I realized, oh shoot, it's just using grep. And grep is not that good. Like it's, you know, wasting context, it's loading a lot more into context than it needs to. And then I sort of fell into a rabbit hole. I just went into conductor, click quick start. And then I had GStack built into conductor already. And, you know, basically this was how I started. I, you know, it was actually much more interesting than that. So I didn't start off from nothing. One of the things I've learned as you write a larger and larger corpus of code is you have it loaded in your brain. You're like, oh, well, in order to build an agentic newsroom for Gary's list, I actually had to learn about vector embedding and hybrid RRF and chunking. When you're in there trying to make it work, you're just very applied. It's like, I have an output that I want. I want the article to look like this. It needs to be of this quality. It needs to have these citations. Like you start building up your, you know, your tests and integration tests. And like you end up with like a product that's like battle tested from like the output that you want. And so I sort of put two and two together. And I, you know, and this is something that, you know, anyone can do. Actually, it's like this. This is why I think we're entering the golden age of open source. I could just open, you know, this project in Conductor. And then the first thing I write is like, you know, go look at, you know, tilde slash git slash Gary's list. Like, look at how we do chunking, embedding, you know, hybrid RRF, rag, like all of this, and then just like extract it. And then I want to use Postgres with PG vector. And like, I want a, you know, full rag system for my open claw. And then sort of like one thing led to another. It's like, then I have, you know, 10 windows and G-Brain and I'm just like at it. What's cool about open claw, I mean, maybe this is a good example. This is actually my open claw. I did go ahead and ask, it's, you know, how did I actually get into it? January 23rd. Also all your emails. I had a tweet that was like, Claude code this week has awakened my 25-year-old self, the one that checked Red Bulls and stayed up till Don coding. We're so back. the builder identity resurfaces yeah you know i'm basically back to you know sleeping four hours and you know coding 20 hours a day you know this is also when i started getting myself into trouble like talking about lines of code i still believe this by the way yeah this might be like a good quick aside to talk about like this this idea of like lines of code being important measure has been like controversial on the internet there's obviously the counter argument like oh lines of code doesn't like measure developer productivity, but well, it doesn't. Right. But it also does. So it also kind of does, right? Yeah. Like it does. It's clearly, and you know, what's interesting is you can actually, um, there's well-published Git repos out there that you can run to, uh, strip away and like standardize what is actual logical lines of code. And so I actually did go ahead and do that. Um, you know, and I got into trouble for saying like, Oh, I'm coding at like 100x the rate that I was in 2013. And then after I did the logical lines of code stripped down, it actually went up. It actually went up. So it turns out that I was actually doing 400x the amount of code But obviously I wasn writing it I was directing you know 15 agents at a time to do so And then by the numbers like it was not that it did like, knock down my lines of code from Cloud Code a little bit. But the surprising thing to me was that it knocked down the amount of lines of code that I was writing in 2013 by like 70%. And so I think that that's sort of the mismatch here. Like people get very upset because it's easy to like pad the lines of code if you're a human writing code. Whereas like, unless you direct clod code to literally like pad the lines of code, it doesn't necessarily do that. Like it'll maybe build the wrong thing. Like you might not steer it very well. It might not do the right thing, But like, it's not trying to optimize for lines of code the way a human working a job would, right? Which is, you know, that's just life. And then I guess the really surprising thing is if you look at the literature about software engineering going back to like 2000, 1990, I mean, it's pretty clear that the average number of lines of code that a professional software engineer that's like tested and production ready, it's not like 100 lines of code. It's like 50. It's like 30. like a day yeah a day right like for me it was like 14 but i was like part-time i don't know it's uh so that's where the 400x actually came from you know the other thing i know is like i should have said that instead of just trolling people more on the line so i yeah if i trolled you on the internet i'm very sorry for that like there you know there is a deeper understanding of this and i did end up releasing a blog post about it that um explains this quite a bit more i mean and i think it's not a little bit significant. It's very significant for people who are technical, because it actually raises the bar on what you're capable of doing. All the people who are attacking me about lines of code, they particularly are the people who are most likely to get wings if you let it rip and token max. This is sort of like the classic problem. It's like if you have taste and you understand technology, you are particularly the people who would benefit the most from getting this all someone has to do is you know believe right so stop fighting and just open cloud code and try it you know i think another thing that's potentially going on is just like the experience is very dramatically depending on like the the models and the harnesses um like certainly something i've noticed is any sort of like semi-complicated programming task i try and do through my open claw agent just like kind of fails um like it's exactly the same model and sort of like opus 4.7 as clawed code but it just like like anything above like a simple script i just find like it's not like that great at so i'll go back into like clawed code and then it was sort of a moment for me where i realized oh like this is how it used to feel like this is how like even six months ago it used to feel like oh like you try and like these things yeah these things aren't quite there yet and then clawed code with like opus 4.5 was like oh like it's actually like here it's about to recur like right now people sort of are feeling like open claw or hermes is like not quite there or it's like a lot of work and then i guarantee you like this time next year like everyone's going to be saying what you heard here first which is like every single person on the planet will have their own personal ai we could either live in a world where we have our own AI, where we have our own data, our own integrations, like we see what's happening, we write our own prompts, and we have control over what we see. Or it's corporate controlled. It's something, you know, you go to a host, it's kind of like your Facebook feed. And like, you don't know what the, you know, who wrote that algorithm? And who does it benefit? And like, what business model is behind it? Like, nobody knows. The most powerful idea that, like, was a gift was the personal computer revolution. And we're about to go through exactly that same shift with personal AI. And it's going to be a choice. People are going to have to figure out, am I willing to write my own prompts? And I think I wish Pete Koeman were here. That's one of the things we learned from him too. It's like, unless you have your own prompts and you can write it for yourself, you are below the API line for some PM or developer that is not you, who will not understand you, will not understand your needs, will not understand what you uniquely care about. And I think that's the defining question. Will you have control over your own tools or will your tools have control over you? And I think this is one of the disconnects that the public has, I think, is a lot of these capabilities, you have to be on the latest and greatest models. And it's actually quite expensive to use them and burn all the tokens for now. It's coming down. But I think maybe people are just trying like Sonnet or the free model or having the basic Claude Pro subscription only. And part of it is maybe we have to address that this new way of really getting all this almost ASI, AGI moment for building is you have to be burning lots of tokens, the whole token maxing paradigm. It actually reminds me of rent, San Francisco rent. One of the things that I feel like we always have to do with YC founders is that it's like a general thing. It's like, oh, I don't want to move to San Francisco because it's so expensive to live there. It's so expensive to not live there. Yeah, exactly. That's the whole point, right? Early on in a YC batch, I'm used to a founder of being like like this like this apartment is like thousands of dollars a month in rent like seems ridiculous like should i like pay it or not it's like no you should absolutely pay and if anything you should pay more to not just be in san francisco but being like the dog patch and just like being like neighborhoods where you create the serendipity like token maxing is going to be one of those things for founders that we sort of have to teach them where it's not immediately obvious that you shouldn't this is actually like rent like this is one of the things where you should like spend as much as you can to like get the like most utility out of it versus treating it like the office desk or something like sure you can economize on that or you don't need like a super expensive like couch but like when it comes to like actually using the models and your token spend you should probably be like pushing pretty hard on that yeah one of the key maxims for yc is you know how do you find good startup ideas live in the future and build what's missing right And so this is a profound version of that where all you have to do is commit your brain to look at, you know, spending $500 in a single day on tokens and say, actually, like, you know, as long as I'm building something that's actually of great value to me, you know, and I'm building the right thing, I'm going to do that. Gary, I have a weird question. Do you think that in some ways, the fact that you tried to build all of this while also being the CEO of Y Combinator actually helped you? Because like your time is so scarce, you have to like try to figure out how to write hundreds of thousands of lines of code, which is like spare minutes in between meetings. unlike a full-time software engineer that could just take the time to open the website and click around it, test it. Those minutes were insanely scarce for you. And so you were constantly pushing yourself to figure out how to automate everything. Yeah. I envy time billionaires. Sometimes I look at my kids and it's like, these kids are time billionaires right now, man. You could just do things. We run across people at startup school all the time and it's like, you're a time billionaire right now. Like, this is incredible. Like you could just do anything, like learn about anything. This is so great. So yeah, you know, personally, like, I think my philosophy is I am in a crazy rush. In my brain, I'm like, probably live 10 billion lifetimes, live in this body right now. And I need every single moment to count. And then if you can token max, it's like, I mean, you can buy millions of years of consciousness, of machine consciousness. Now I can be a time billionaire. It's not my own time. It's the time of a machine, doing work for me and the human entities that I care about, working on the causes that I care about. I care about YC. I care about builders being able to build. Even in a lot of our internal meetings last year, remember in our off sites, we would talk about like, how do we teach the next generation how to use these tools? And so, you know, I'd like to, I wish that I could say like, that was all a part of the grand plan. And that's how it started. It's not like, but, you know, subconsciously, I actually think it was like, I think subconsciously from doing light cone and like talking about this stuff, like sitting side by side with Boris Churny right here was a very powerful moment for me. me because I realized like he's he started saying things that like I could do myself. It's like he said, our team doesn't write a single line of code. I'm like, oh, actually, like I can do that. And like the people who are watching right now, it's like you and I are not different. Right. We're the same. Like we started in the same place. I don't think of myself as like, you know, in the sky yet, even though people seem to talk like I am, you know, like I'm just a person trying to do a thing. And if I sit next to Boris, I'm like, you know, this guy is one of the best engineers I've ever met. But also, like, if I just open a prompt, we have the same prompt, we have the same MacBook Pro. And, you know, there's nothing that stands between like me or you or any of us from like drawing on millions of years, potentially of like tokens to like serve humanity. Well, Gary, I think that was a beautiful quote that should be retweetable. It shows... Gotta get it on X right away. You could have infinite time by borrowing the time from the machines. Yeah, what a time to be alive. That's a beautiful thought to end on. Thanks, Gary, for showing us the future. Thanks, guys. Thanks, Gary. All right, thanks for watching, and we'll see you on the next episode of The Light Cone. you