How I AI

How Coinbase scaled AI to 1,000+ engineers | Chintan Turakhia

59 min
Mar 2, 2026about 2 months ago
Listen to Episode
Summary

Chintan Turakhia, Senior Director of Engineering at Coinbase, shares how the company scaled AI adoption across 1,000+ engineers through hands-on leadership, practical tooling, and cultural shifts. He demonstrates concrete techniques for driving adoption, building internal AI agents, and compressing feedback-to-feature cycles using tools like Cursor, Linear, and custom Slack bots.

Insights
  • Successful AI adoption at scale requires a single high-conviction leader who codes hands-on, not just mandates from above—engineers respond to demonstrated results, not decrees
  • Reducing coordination overhead and eliminating meetings is a direct outcome of AI-driven velocity; leaders should measure time-from-ticket-to-user rather than lines-of-code metrics
  • Building internal AI agents is feasible for organizations with security/compliance constraints; Slack and Linear serve as friction-reducing platforms that make AI workflows viral within teams
  • Gamification and social proof (e.g., 'Cursor Wins' channel, PR speedruns) drive adoption faster than top-down rollouts; peer visibility of wins creates organic momentum
  • AI enables leaders to reclaim hands-on coding time and perform previously tedious analytical work (cohort analysis, playbook generation) in minutes, freeing capacity for strategic impact
Trends
Internal AI agent development becoming standard practice for enterprises with data/security constraintsShift from measuring engineering productivity via lines-of-code to time-to-user and cycle-time metricsSlack and Linear emerging as critical infrastructure for AI workflow integration and adoption visibilityEngineering leaders returning to hands-on coding as AI tools reduce coordination overhead and enable faster iterationFeedback-to-feature compression as competitive advantage; real-time user feedback loops replacing traditional sprint planningGamification and social channels (e.g., wins/losses tracking) driving organic AI tool adoption over mandatesPR speedruns and time-boxed shipping events breaking traditional code review and approval processesAI-powered cohort analysis and behavioral analytics enabling data-driven engineering team managementCustom Slack bots and MCPs replacing separate tool ecosystems to reduce friction and increase adoptionSuper Builder role emerging as new career path for AI-focused engineers driving organizational transformation
Companies
Coinbase
Chintan Turakhia is Senior Director of Engineering; company scaled AI adoption across 1,000+ engineers
Cursor
AI code editor used extensively at Coinbase; Turakhia demonstrated Cursor analytics and agent mode workflows
Linear
Issue tracking platform integrated with Coinbase's internal AI agents for ticket creation and context management
GitHub
Infrastructure broke during PR speedrun event; mentioned as prior adoption attempt with GitHub Copilot
Slack
Primary platform for AI workflow integration; CloudBot and feedback channels drive viral adoption within org
OpenAI
Mentioned as user of WorkOS for enterprise features; implied as model provider for AI tools
Perplexity
Mentioned as user of WorkOS for enterprise features and secure authentication
Microsoft
Referenced in context of Copilot adoption challenges and long-term AI rollout struggles
Datadog
MCP integration for CloudBot agent to access monitoring and debugging context
Sentry
MCP integration for CloudBot agent to access error tracking and debugging context
Amplitude
MCP integration for CloudBot agent; mentioned as example of org building custom agents
Atlassian
Jira and Confluence integration with Rovo AI; mentioned as platform for knowledge and workflow management
Stripe
Referenced as analogy for WorkOS positioning ('Stripe for enterprise features')
People
Chintan Turakhia
Senior Director of Engineering at Coinbase; demonstrated AI adoption strategies and custom agent development
Claire Bowe
Host of How I AI podcast; product leader and AI obsessive conducting the interview
Quotes
"It's not only possible, it's adapt or die. It's just been such a huge superpower for the team."
Chintan TurakhiaEarly in episode
"The worst thing any engineer could do is just be like, I decree you must use AI. Come on, no one's going to listen to you."
Chintan TurakhiaMid-episode
"You have to show, not tell. And so I did."
Chintan TurakhiaLeadership approach discussion
"My biggest thing is how do I make this damn thing stick? Because there's something here."
Chintan TurakhiaAdoption strategy
"No one should expect all this information is going to be perfect. Your job as a leader is to lead, right? And to make change and impact."
Chintan TurakhiaOn AI-generated analysis
Full Transcript
People are skeptical that large, established, highly technical, highly capable engineering organizations can deploy AI at scale and get any effect. But I think you've proven it's possible. It's not only possible, it's adapt or die. It's just been such a huge superpower for the team. How many engineers are we talking about here? A thousand plus. So we're not messing around here. The company tried to adopt other AI tools, and we saw this uptick in adoption. People opened it up, checked the box, did kind of like a hello world thing, but it didn't stick. My biggest thing is how do I make this damn thing stick? Because there's something here. I do think that it's really important when you're doing this organizational transformation that you have a single person with incredible conviction at the leadership level who is also hands on the metal. Show the engineers, not just tell. And the worst thing any engine leader could do is just be like, I decree you must use AI. Come on, no one's going to listen to you. Welcome back to How I AI. I'm Claire Bowe, product leader and AI obsessive here on a mission to help you build better with these new tools. Today we have Chinthan Tarakia, Senior Director of Engineering at Coinbase, and he's going to show us, yes, it is possible to drive AI adoption and higher velocity in an engineering organization of thousands of engineers. He's also going to show us the new expectations for engineering managers and engineering leaders, which is less meetings and more code. Let's get to it. This episode is brought to you by WorkOS. AI has already changed how we work. Tools are helping teams write better code, analyze customer data, and even handle support tickets automatically. But there's a catch. These tools only work well when they have deep access to company systems. Your co-pilot needs to see your entire code base. Your chatbot needs to search across internal docs. And for enterprise buyers, that raises serious security concerns. That's why these apps face intense IT scrutiny from day one. To pass, they need secure authentication, access controls, audit logs, the whole suite of enterprise features. Building all that from scratch? It's a massive lift. That's where WorkOS comes in. WorkOS gives you drop-in APIs for enterprise features so your app can become enterprise-ready and scale up market faster. Think of it like Stripe for enterprise features. OpenAI, Perplexity, and Cursor are already using WorkOS to move faster and meet enterprise demands. Join them and hundreds of other industry leaders at workos.com. Start building today. Chintin, thank you so much for joining. What I love about what we're going to talk about today is we've spent so much time talking about the individual vibe coder or the non-technical person becoming a software engineer. And still people are skeptical that large, established, highly technical, highly capable engineering organizations can deploy AI at scale and get any effect. There's still so much skepticism. But I think you've proven it's possible. And you're hopefully going to show us the way. I think it's not only possible, it's, you know, adapt or die. Like, it's just been such a huge superpower for the team. And we've gotten so much efficiency out of it. And there's just like ways to approach it. I think I was reading a tweet yesterday, just about a very, very long story at Microsoft or someone like pulling Copilot in to their organization. And it was just like just a fun tweet of just like, yep, we're going to make graph go up into the right. But like the actual adoption wasn't good. And so like I've been spending the last year just absolutely obsessing about it. And you can do it. People can do it. So how can you do it? Because, you know, how many engineers are we talking about here? A thousand plus. Yeah. So we're not we're not messing around here. This is a real team working on real products who know what they're doing, who have built great software. And so where did you start? Either culturally, from a product perspective, from a tools perspective? So I think a lot of it actually just started around this time last year. We had some changes to align the product I'm responsible for. And a big part of that was effectively rewriting the entire product from scratch, turning it from a self-custody wallet to actually a social consumer app that just happens to use crypto. and you know we're using react native um but we made a lot of decisions for a self-custody wallet but to become a consumer app you gotta like rethink everything that was one two we needed to do it in like six to nine months so we were going head to head with like the big social players out there that have multi-thousand person teams uh that have a 10-year head start and we We were really trying to just do something big and new and crazy, like absolutely just crazy. And a big part of this is like, how do we rewrite the app so that it is the best possible app out there, like consumer grade, and do it in this insane timeline? And the team is cracked. They're amazing. But like, you know, we became a smaller team as a result of some of these changes. And so I started just looking at like ways to accelerate. And, you know, like, I don't know, my team knows me well. And if you know me, like I obsess about efficiency. And I think that's like so critical to like make teams accelerate their velocity. But in ways that makes sense for tool and using the tool. So around this time, I think Cursor had come out with their sort of initial release. It was around like November of last year. We all tried it, right? 2024 and it kind of sucked and it's not like i love cursor i love cursor uh the models weren't there just the models weren't there like the models couldn't even you know really write a unit test right well and you know you're an engineer um and you understand like once once an engineer tries a tool and and they're like ah this is not so good like it's very quickly and very easy to write it off, right? It happens. And so we kind of went through this trough of sorrow of just like, okay, goddammit, AI tools are not here. The models aren't ready. What are we going to do? And for even a year prior to this event, the company tried to adopt other AI tools like GitHub Copilot. And we saw this uptick in adoption. Like people opened it up, checked the box, did kind of like a hello world thing, but it didn't stick. Right. And like my my biggest thing is how do I make this damn thing stick? Right. Because there's something here. Right. And my mental model was just always. The models will the foundational LLMs will always get better. And it's like going to the gym. You need to go and build your reps and try it. And that's okay. And the cost of doing it is like nothing. It's just a little bit of wasted time. We're not worried about compute right now because it's so early. And so like from basically January all the way to like March or April of 2025, I just changed the mindset and the mentality. I was like in cursor every single day, every single hour of the day. And I was like, how do I make this work? Right? Like, you know, it was great because I was writing code again. It was great because, you know, it was unlocking all these like use cases like we were doing interviews, like interviewing candidates and just like I don't want to necessarily write up all the notes. Right. That takes a long time. But intuitively, I like I know I've assessed. Right. So I would use it for like tactical day to day paperwork kind of things to accelerate me. But also from like a coding perspective, we just pick up bugs and be like, hey, let's try this. Right. What's going to happen? What can I learn? what are the tips and tricks to like show the engineers, not just tell. And the worst thing any engineer could do is just be like, I decree you must use AI. Like, come on, no one's going to listen to you. I have to empathize with this because I also running a large, like multi-hundred person engineering organization, you know, was experiencing even early versions of these tools and had such innate conviction that it would, of course, transform how we did work. Like, that was very obvious to me. I don't know it's obvious because of experience or obvious because it was just obvious. But then, you know, you just had these experiences as leaders, especially in the, you know, maybe 12 months ago. One engineer tries it, doesn't work. It's not just that engineer throws it away. It's everybody else says, well, I think, you know, I trust their opinion. And if they say it's not going to work, it's not going to work for me. And I do think that it's really important when you're doing this organizational transformation that you have a single person with incredible conviction at the leadership level who is also hands on the metal. because until you can say, well, I understand it didn't work for that, but it worked for these three things, or I actually figured out how to make it work for that because we tried A, B, and C. I think it's just the only way. You cannot be in philosophy. You cannot be in, you know, someday in the future, you figure it out. You have to actually get back to it. And then I think like bonus points, so many of us in engineering leadership have like been pushed away from, from coding. I know. I was so happy to get back in it. And I'm like, I just want to code again. Give me some joy. Give me some time. And so I think that's the benefit as well. And you have to show, not tell. And so I did. And I think what I learned very quickly is like, okay, there's something here. There's a there, right? And then we just started picking off one or two use cases. And the best way to get to an engineer is just give them the tools so they stop doing the shit work and so that they can build the stuff they love right right and so like we would just like pick off unit tests we'd pick off like linting all these like little things that just like paper cut and suck the soul out of you as a builder but the engineers and you know like the team just wants to move faster the team wants to build better things and so we started leaning into like cursor rules for some of these things even the simplest thing i remember like i think remember my aha moment which was like popping in some bug report working through it and then i didn't think about it i just did it i was like just create a draft pr here's the ticket here's kind of the pr delete and you know here's the pr description i want and it just did it and i was like i never need to remember get status get rebased not like why is anyone doing this anymore Like, like, what are we doing? And it took a funny thing is, it took some convincing of me to the team. Like, guys, just type create draft a PR, like create a draft PR and it'll be done for you. And like, like, well, you know, I kind of have my workflow. It's like, cool, cool, cool, cool. I get your workflow. You can modify it. You can use cursor rules. It's OK. Like no one's getting bonus points for memorizing get commands. Exactly. Exactly. And and so like we chipped away. And we put in a bunch of rules, like cursor rules. And that helps so much. And then I was sensing, I was like, okay, I have enough folks on the team that are like, yep, this is unlocking stuff. And they would post in the team channel, like, look what we had. Literally, a channel called Cursor Wins. And everyone was just posting in the channel, like, I just did 20 unit tests and then went and had a coffee. This was great. I love it. And so people started seeing it in action. And then we hit this like point. I was like, okay, how do I speed run now the whole team? There's a little bit of conviction here. So we just, and I remember this, like, I think I had landed. I was going to the East Coast. I landed for my flight, got into an Uber, hopped on like an entire team, all hands, like speed run. We call it, it was like basically cursor speed run. and I was in the Uber using cursor, putting up a PR. And the goal of the speed run was every single person would just pick up the most trivial thing. It could be like copy change, a bug, whatever, and just put up the PR. And we ended up, I think in 15 minutes, I think 100 people had joined. In 15 minutes, we ended up putting up like 70 PRs. And we broke GitHub too, which was cool because we learned like our infrastructure needed improvement. So I want to pause real quick Because again, How I AI, a little bit about tactical techniques. And you've used a couple that I have used, which is like one high conviction leader with hands on the metal that just says like, we just got to do this. Access to tools. Focus on toil. I think it's very important. You called out linting. You called out tests. Another one I would call out is like design debt where, you know, front end engineers or designers have just lived with parts of the app they hate. Yes. That is another really great one. And then a shared Slack channel. And one riff I would make on your cursor wins channel is we made ours wins and losses. And so we were very clear, just post what you did and when it worked and when it doesn't. Because when it didn't, people would be like, oh, yeah, but you could try XYZ or I have a cursor rule for you or whatever. But what I haven't heard that I want people to just perk their ears on and pay attention to is this idea of a PR speedrun, which is like do a time down time everybody boot up whatever tool and just speed run some fixes because how much conviction does an org have to get going from look i've been there like the the doldrums of like quarterly planning and this will be in four months and blah blah blah blah to just like we just got 70 prs that we've been sitting on out out the door in in 30 minutes i just that It has to be such a transformational moment for an eng team. You know, there was a success rate on those on merging those PRs. And like, it was just like, shit, this is possible. They're like everyone's eyes lit up. And it was really sort of a death to status updates long live building moment Yeah And this is the other thing I want to call out because I think you all have a really special culture there But so often we in product engineering design orgs get like really wrapped around the axle on like the rules of engagement. Like, well, I'm not allowed to build it unless the product manager says it's important. Or like, I can't really make that decision about what color that button is because design has it weighed in. And like, I do think these moments where you just break all the rules and you're like, guess what? Remember, you can just ship code you can just you can just ship code like put ai aside ai maybe enables it and makes it like a much less costly you know um expense but like just doing that is so powerful for velocity and for i also think for quality like people just take more radical ownership of things um so i'm gonna 100 steal this you should i mean i want everyone to steal it like you know i i really like the way you just put it right this is a moment where we should be breaking the rules because ai is breaking the rules for us and if we don't adapt to how like we can use it we're toast right and and we is like a very collective like whoever's not adapting is going to fall behind kind of thing right and what all of this like ends up unlocking is is like the reduction in coordination overhead so like one thing i've been obsessing about a lot It's like, okay, cool. Great. Good job on the speed run. Yes, we got a lot of stuff done. We started then seeing those wins. More and more people adopted. Brian then, you know, we were sharing some information with Brian, like how adoption is going. And then we just did a company-wide speed run. And at that moment, like there was like 800 engineers on the call and we ended up pushing up for like three, four hundred PRs in 30 minutes. And yes, again, we broke GitHub and that's fine. That's good. Like this is pressure testing. We should be designing ourselves to break the rules. Right. But the thing I've been obsessing about is like, how do you how do you measure any of this? like in terms of output right there's there's this like tension where okay the more ai we use well does that count as a replacement for people and like i'm in the camp of absolutely not ai is an accelerant right ai is an accelerant because there will always be more work to like to do right and so the way i think about it at least for for my team and what i'm pushing across the board is really like time from ticket to when the change lands to the user. Like that actually encompasses every single piece you need, right? And today, like even if you go from like ticket backlogs and stuff like that, like there's, oh, do I, should I, like you said, should I prioritize this? Is this important? Let me ask my PM or let me ask the program product manager, project manager, whatever. And now the whole team, like fast forward from back then to now, we just see someone give us feedback. And literally within like seconds, we're like, we built this internal bot. I'm excited to show you. And within seconds, like the PR is being authored, right? An agent picks it up. And within seconds, that feedback is like acted on. And so we crunch the time to action. the time then from ticket to the the pr being ready for review then the review time like all my devs complain review times take too long we found some solutions actually i think we were doing average of like 150 hours like was a cycle time for a pr review because there's so much we reduced it by 10x down to like 15 hours or so roughly and then the last piece is like from that merge how do you do like that OTA update and you squeeze that whole cycle again and then the team is like just literally unlocked with sheer velocity yeah that's it and then you get stuff in front of customers yes and then you have the velocity of like actual market ideas yes and you get that feedback and like the we're obsessing also about how fast can we take like in real life feedback yeah and then actually just fix it right then and there I think I think there is another aha moment, I was on a call with a user of our product, right? And they're like, hey, it'd be cool if you changed X, Y, and Z. And literally, while I was on the call, I just put up a PR and pushed it. And they're like, before the call ended, it was 30 minutes. I was like, just reload the app. It's fixed. Okay. Before we put this into an hour of two end product leaders being like, just ship really fast. We'll go into the merits of reducing PR cycle time, all that fun stuff. Let's actually show a couple of things you built, because I think the kind of meta commentary on you can do this in engineering organizations, there are steps to it. There are measures you can take, I think are things that everyone can learn from, but you also have been building. So let's talk about how you used actually cursor to drive how you drove this into the organization and understand adoption of AI. Yeah, for sure. I think a lot of it just comes like from honest curiosity and figuring out where the bottlenecks are, like why aren't folks adopting, how are people using it, et cetera, et cetera. I want to show you like, I think the kind of crazy thing I'm about to walk you through is like, I just got this harebrained idea. Cursor has like great analytics, right? And so you go to the admin panel, you look at the analytics and, you know, awesomely, they let you download it into csv i was like what if i just use cursor to figure out what my team is doing in terms of using cursor but not in just like from a vanity metric point of view of like lines of code committed by ai i think that's like kind of misleading actually digging more into um how they're using cursor and how do we sort of like replicate power users so let's see uh we have some data. It's in this file here. And it's just like a standard CSV from cursor that you can download from their site, like your admin panel. And then there's also here a bunch of different sort of fields. So like accepted lines, chat lines, chat lines deleted, various like data elements. But one thing like I just sort of started with, I want to understand the usage of cursor, right? And I already know we have like light users all the way to power users. And one of the things I really wanted to figure out was like, what are the natural clusters of usage? Can you find them across the team? What is the best way to cohort them? Right. And I'm just going to pick up the standard analytics file here. maybe pop in another one here. And then I love Opus High. I also love Plan Mode because it gives you a chance to like see what it's thinking through. So we can let this cook and see what it comes back with. And what I want to call out here for engineering managers or engineering leaders is this is the kind of quantitative analysis that we would all have loved to be able to do across a bunch of engineering metrics at some point, right? Like how often do we get asked by the board or our boss, like what's velocity, what cycle time, which of our engineers are super, you know, like are really on the far edge of the curve in terms of efficiency. How are our junior engineers ramping into the repo? All that kind of stuff. And that kind of analysis is actually really onerous and hard to get at because of the structure of the data and the nature of the analysis. And so what I love about just LLMs in general, and in particular using something like Cursor, is you can get to really nuanced cohorting analysis on human behavior and human analytics as a manager in a way that I think has been really challenging to do before. Yeah, I totally agree. And like the beautiful thing is now with MCPs, with data accessibility, like I think of tools like cursor as just my daily operating system if i have a question it doesn't matter if it's technical or not i just go into cursor and ask it um and so it's like super super powerful that way okay so it's asking me a little bit about like what outputs do i want i do want to enrich csv um just it makes it easier i do want a static dashboard just for fun like i'm not really trying to create a brand new dashboard right now but my main goal here is just honestly honestly like fine natural cohorts, right? And so it's going to kind of try to do light, moderate, active power, super user. It's going to look at line suggested, so volume, sophistication, agent mode, model preference, acceptance rate, and breadth. What features are they using? I'll spit out, you know, a CSV dashboard, likely generate a Python script too that I can reuse. So I'm just going to kick off build mode. While that's cooking, I do want to just maybe bop over to like, it's going to create all this stuff in Python, create the scripts for me. Awesome. But we can look here at some of the information, right? So like, this is all sort of random made up data. It's like sample data. But what it did was in a previous run, it looked at all the data, generated the Python script, which is great, super simple. And it sort of just did some like high level status metrics, like AI code percentage, again, on all this made up data, AI lines per week, composer lines. This is when you're using the agent mode in cursor or tab lines, right? When you're hitting tab. One of my team members actually got the cool cursor tab award, which is great. Great. And so it sort of breaks all this down. And then what it really segmented around was like agent heavy users, which is folks who really lean into agent usage. There's also tab heavy users. This is like a different cohort. They just lean into tab usage and they maybe want really just a bit more control and maybe haven't gotten yet used to like how to let go with an agent. You have balance users that try both and then you have sort of like maybe cursor curious or maybe not cursor pilled or you know LLM pilled right now. And so I generated this whole script. It's great. And now let me show you sort of a bit more analysis I want to do here. So let's do this. Run the analysis on, I have a sample user set and generate the HTML as well. And let's, we're actually like, this is sort of the output of the analysis script that was generated in Python, which is already cooking in parallel. Got it. So what you've done here is you've taken some raw data from cursor. You've asked one kind of agent to do a cohort-based analysis and generate a enriched CSV essentially with some data. And then you're kicking off another agent to actually do the analysis on that and generate sort of an HTML view of it so you can visualize the data. That's right. That's right. What it did was the Python script that was generated, right, it found these natural cohorts, these natural cohorts of super user, regular user, power user, light inactive. Again, this is just honestly sample data, but based on like real information, real schema, real cursor data fields. And it came up with like 70% are an agent heavy in the sample data, 20% are minimal, 4% are balanced. We have some room to improve here on the sample, right? Like not enough people are using it um and so it does a bit of a breakdown which i kind of like you know kind of a recap of metrics yeah we have a lot of lines of code in this data we have 520 power users again made up names but like this person is crushing it i want to know what this made up person gabriel diaz is doing right awesome thing here it generated a little visual dashboard nothing fancy something just really simple to look at, right? Total lines, composer lines, tap completion, a little bit of breakdown, some structuring on the tiers and usage, right? But what I really kind of want to understand is like, what is Gabriel Diaz doing, right? This made up user who's just like crushing it. Yep. How about based on the data, generate guidance for each user cohort, what they should do to advance and graduate to super user. i'm looking for explicit guidance effectively like i want to turn this into some type of playbook right so let's let this cook and then in parallel what i also want to do is i like visuals and there's something intuitive here where like as we look at the data itself right we we know that the like the path to this super user over here it's it's not like you go inactive to light to regular to power to super we know it's not linear like that right right there may be like forks from light to straight to power user regular user seems to be like balanced on the tiering but what i want to know is like what are the special things these folks are doing and how do i sort of shift the curve right and so i'm also gonna throw another question in parallel like create a mermaid diagram uh for all the different sort of paths a user can take from light to power and it's i'm assuming it's not linear and let's just see what this cooks up to okay this is really working hard really opus or five yeah opus is opus is really working hard on this but um yeah let's let's see where it goes well you know it's really interesting i'll give you a a shorter hack on this one so i think what this is generating is like an html playbook that you could share out that has things. I will tell you what I would do in this use case, and I've done this a couple of times with customer QBRs, is I say, write a Slack post that I can put in my engineering channel on a couple of these stats and how we can get people to move from A to B and it write me like a short little Slack post So I love this idea of going from something like a CSV to a really deep analysis to an HTML like visualization to like three bullet points I can send in Slack And as a manager, each one of those steps would have taken just forever to do. And now you can get them all done in Cursor. Yeah, you know, that's like kind of the awesome thing is the power of something like a workflow markdown file is huge. It's absolutely huge. And it's exactly like the thing you're describing here. Meet Rovo, your AI teammate, connecting knowledge, people, and workflows so teams can work smarter and move faster. It helps people find answers, make decisions, and automate work securely and with context through search, chat, agents, and studio. Rovo runs on the teamwork graph, Atlassian's intelligent layer that unifies data across your first and third-party apps so no knowledge gets left behind. And you always get personalized AI insights from day one. And the best news, it's already built into Jira, Confluence, and Jira Service Management paid subscriptions, so the power of Rovo is already at your fingertips. Know the feeling when AI turns from tool to teammate? If you Rovo, you know. Discover Rovo, AI that knows your business, powered by Atlassian. Get started at rovo.com. That's R-O-V as in victory, O.com. Let's see. Let's see what it came up with, right? And like, you know, the thing is like, no one should expect all this information is going to be perfect. Like if anyone is thinking, oh, wow, what is going to be my job as a leader if Kurser can do all of this? I was like, well, your job as a leader is to lead, right? And to make change and impact. And this accelerates. So inactive users, like, yeah, kind of true. You haven't installed, you haven't really used AI features yet. The hardest part is getting started. So I kind of like this. It gives like just some very simple prompts. Try the agent mode for your next task. Something very, very simple, something like, wait, try a tab completion flow. I kind of feel like the LLM really wanted to just turn this into a game, like a little quest or something. Yeah, it's gamified a little bit. Yeah, it is a bit gamified and it's kind of fun. All right, so this is cool. It's kind of given me like, this would be my Slack post TLDR. 16x more AI line super users versus other users. Let me zoom in just a bit more. More agent requests for super users. I love this. Stop typing, start shipping. It's dark mode, so the engineers will just love it. Yes, right? It's kind of perfect. and then you installed cursor but you haven't used ai yet we talked about this that's cool light mode okay i you know this like resonates stop saying fix this bug actually like talk to it like you would maybe a junior engineer right um cursor just did release bug bots i love bug bot yeah that's awesome i love i love bug bot agent isn't for hard stuff it's for everything these are like motivational quotes now but i think like we should just make posters for and put them up on the wall um write unit tests actually read the comments okay cool now power users you're good to be great think bigger and tab harder okay if cursor is listening i think this is like going to be your new merch line guys i need a hat that says tab harder yes okay so just to just to recap again we're doing we're doing a free free product work for cursor here we we took you know your ultimate problem was like how do i drive up adoption of these tools and you're like of course i'm gonna use the tool to understand adoption and then figure out ways to drive drive adoption we did analysis we created a visualization of the um the data itself you identified cohorts and power users which would have been very tedious to do if you were going to do manually yeah and then you created a hosted playbook as well as a series of motivational statements which we can either give to our friends at cursor for free or trademark right now and make a little money. Agent everything, tab without thinking, bug bot always on, iterate prompts. Love it. And this, you know, again, what I think is fun, let me talk about what I think is fun about this. One, everybody who has been in engineering leadership knows this is the kind of stuff you get asked to put in a board meeting. You get asked by your boss, like what percentage of our engineers are using cursor? Do we have power users? Are we actually getting value. And we're talking about an AI use case right now. But again, across management, there are actually measurable things you can do about the performance and efficiency of your team. Yes. And I think it's been so impossible to get before. Two, it would be no fun if you didn't get to do it with code, which you get to do with code. Actually, that is the thing. You can solve problems with just code now, right? You can just do things i i you know you're so right like i i think this i underappreciated exactly what you're saying right now and and i just want to repeat it because normally you would be asked this and then you would have to go pull an icy to do that and like what what yeah come on like no you can just do things right now and and again it's like not that i i think people underappreciate the velocity creation of a fun task. Yeah. Which is like at the end of the day, like this is silly, but also the like little fun bits of it. You're like, great. I want to go to the next level. Cause I got like a little dopamine hit from this dark mode playbook. That's kind of funny. And I think people underappreciate like that iteration speed that can just come with like a fast feedback loop when you're building something and the fast feedback loop, when you're building something that has high quality against it, which like something designed like this does so much more fun to look at than a Google Doc or a spreadsheet or a dashboard. So we did it. We did it. Again, you and I are twin stars, I think, here. And so we probably go all day on the things that we find fun. But let's go to a second use case that I think people are going to see. And let's see how fast we can do this use case, which is you were talking about the speed of feedback to feature. and you you said some fighting words out there you're like we're really compressing the time from feedback to feature so how does that actually work those were those were some fighting words um and you know i think you know this right you want you want to build this for your users right and you want to create the best damn product out there as fast as possible and the way to like make that cycle work really well is genuinely how fast you can move on feedback okay but i want to start from how does like feedback even normally come in right so you you know normal like teams and culturally like you'll have dog fooding or bug bash sessions right you'll get on a meet or get in a room keep using the product blah blah blah all that jazz and then someone has to like collect the bugs in a Google Doc and then take those bugs in a Google Doc and put them into a ticket system. Right. OK. And then there's a whole discussion around. Is this important? Is this not important? OK. Should we pick it up in this sprint? Should we wait for another sprint? And by that time, your user has turned out. They're like, you guys didn't fix this. I kind of hate it. Moving on. Right. Everyone's attention is like so, so, so short. And right now, like the whole team we're all preparing for a big launch. And we wanted to get together and do this thing called a surge. And this is where we like just bring the team together. And we do very, very long days using all this AI and just shipping like massive amounts of code. And fun fact, like during these surges, we end up shipping like more than three to four X more PR volume in the same time. But the other thing we wanted to do was bring people into the office and we set up this thing called like a feedback cafe. So we'd invite externals, internals, etc. And we'd dog food with them and we'd show them the app. And like, here's just like a couple seconds of, you know, what it looks like. We're just standing there. Collecting information, doing all this like live dog fooding. And the hard part though is especially in real life. How do you actually capture that information? Because it's voice, it's video. How do you translate it into a system? Okay, so I just spent like half a weekend and built a tool to capture feedback live. Let's just pick something. I'm going to pick a new thing. How I AI. Testing with Claire. Awesome. So let's do that. It's going to create a little session. Perfect. Very simple. And we have two modes. You can use this on your mobile phone. That's what the team did when they were in real life. But for this, I'm just going to capture some audio. And let's see. Maybe I can just hear from you a fun little bug or something of a product that you think you want to fix. So we're going to start capturing audio. There is a AI chatbot that I use where my account, when switched to business account, forces me to clear all my chats. And I think we should fix that bug so that I can access my existing chats. We're going to start capturing audio. Okay, cool. We captured it. It's basically taking the audio. I did a system prompt, sends it to an LLM. and then what we do is the prompt is basically saying go and identify the bugs yep right and then i'll create it i'm going to do one while it's processing right now i'm using the app i'm on the trade tab and i'm clicking the from field and i'm typing in numbers but the numbers are not showing up so that's not letting me make a trade so i think in our first example the audio is a little hard to capture just because it's going through the system but let's look at the second example it calls it out really clearly on trade tab typing into from field does not display enter numbers user cannot initiate a trade cool really really clean yep i hit create linear ticket it even gives like a suggested title the user journey i care about for this is trade boom i create the ticket itself awesome i pop over the ticket is all here the file is there linear is a incredible tool is doing some triaging but the thing i want to now hop over to is we're going to just create the pr so we have this tool we built in-house we call it claude bot it's actually like using all sorts of underlying models uh it's not something that is specific to to claude um so claude bot create pr I know the repo for this is wallet mobile. And here's the ticket. Oh, that's not the ticket. The ticket is boom here. Great. Cool. So I just went from a bug report to a ticket. To a PR. To the PR is cooking. Okay. So I have to pause because if you are new to how I, AI, you have not seen my signature move when I really love something, which is this. And I was doing this because I was just thinking about this little micro app that you have on the left side, which is, you know, live user feedback, totally unstructured, right? Video or audio, run a little baby LLM on it, get not only a summary of the issue, but a good recommendation on how you might fix it. very quick beep boop to linear. We love our friends at linear. I think it's a great platform for agents. And then a little custom agent in your Slack that can read those linear tickets and just execute on them. And again, so traumatized by the past, maybe, which is like this process would have been, you know, somebody manually summarizing what came out of a research session, some document being written, somebody actually making explicit decisions about what to include and not include. I think that's something that people don't appreciate. Yeah. Like no filter anymore. You don't get that like, well, you know, if I make this five pages long, no one's going to read it. So I'm really going to focus on the top 10 things. It's like, let's capture everything and then just burn through it. And then I have to ask you, why did you all build your own little bot to do this? What was the advantage of building the bot? so this this is like in-house um and we built it you know it all started um around like middle of this year i created this like i was just obsessing so much about it yeah and i was like how do i how do i create better tooling for the team for the company so everyone can be accelerated so i invented actually like i put a call out on twitter i invented this role called super builder and the single job single most important job of a super builder is to create more super builder So we hired our first super builder and we talked about some ideas. And one of the biggest things, because most of our company uses Slack, we're all in Slack. And Slack, you know, I'm a strong believer. It's just a bunch of humans pretending to be systems, right? And the cost of writing something in Slack is zero, but the cost of answering something in Slack is enormous. And most of it is noise, right? And so one of the things was just like how do we bring the workflows that we are also used to And how do we like sort of capture that and then add AI on top of it So we had like various reasons We know like lots of companies have background agents cursor et cetera et cetera We just have like different sort of security requirements right now that we just couldn launch with And that fine So we built this in-house and we have these like feedback channels, right? Hey, there's a bug here. There's a bug here. And so now all we just do is like CloudBot go and do something with that. Or if someone is like, hey, we just got out of this meeting. Here's a summarized transcript. We're like, awesome. At linear agent, go break this down into tickets. and then just like you know you know the look you you showed like right like everyone is just doing that emoji of like the head exploding right because then now we have like 20 tickets and then we do fun things like this which is just go like bonkers where we just fire off tons and tons of calls right to just and so we built this plan mode so this bot has a create pr which i'm it's cooking um it has a and also the cool thing about create pr is when it's done it will respond back it will show you a link to like the cursor branch using cursors deep link and when then the one-off build is ready it will show the qr code so you can just scan and start playing with the fix right there's a plan mode which is very much like cursor's plan mode it just comes up with like a plan and then we also have um explain as well where it's like, oh, I want to debug something. So like, why is Chintan's app not working right now? Chintan.base.es as an example, right? And it has like all the skills, all the MCPs. And so the thing I realized is context is the most important thing. So the place where we capture all of our context is linear. and then this agent that we built, we added skills and MCPs. So if we can capture context through linear, then we can trigger the agent using all the context from linear and then it goes off into all the MCPs like Datadog, Sentry, Amplitude, our internal Snowflake databases, etc. And it has the ability to pull context from the rest of the company and it can work across multiple code bases and then boom, like it's a super builder. This is awesome. And so before we move on, I think what I want to call it here are a couple of things that I hope people didn't miss. One is right now, if I can give people career advice, you want to be like the top three most AI-pilled people in your engineering organization. I'm sorry, I just have to say it. Like, you know, whenever I pulled an engineering leader aside or someone aside who's like maybe a little AI skeptical, and I said, like, I want you to lead this. I wasn't doing it. Yes, of course I want to do it because I think it has high impact on the company. But I felt like I was doing people a career favor by giving them this role. And so if you can find companies that are hiring super builders that will put you in the role of driving AI across an organization where you can learn these skills, I tell you it is an incredible benefit to your overall career. And I don't think people appreciate how much that is pretty still rare right now. So if you can find it, I would just beeline directly to it. I think the other thing, and we've seen this a couple times, we saw this Amplitude actually did it. building your own agents is not impossible for organizations. And so if you do have security, compliance, data access restrictions, you can't use cloud agents, you can't use these things, it is not impossible to build these things yourself. And there are lots of like really great SDKs out there too that you can use to do so. And then, you know, three, like I do think some of these platforms, linear and Slack, are just friction reducers to access to AI. And so So if you are thinking about driving AI adoption in your organization, like figure out how you can get the right platforms in place that can unlock access to agents. Because if you ask somebody to open or learn a new tool, it's just going to create too much friction to move forward. I think there's like one super important thing. Like this is a channel where we call CloudBot Playground. And I'm scrolling through fast just to show you like how much people are using. This was one night. I was up at like 1 a.m. just pushing this we got like 200 bugs right from this tool i showed you and i just kicked them all off in like one solid go just to get things cooking and like it was great uh let's see if a plan came out here yeah so like there's a there's a plan that comes it actually creates the plan in the linear ticket yep this the trick here why slack is because slack is how things go viral within your company. Totally. If you have pulled out the magic into some separate tool that others can't see, it doesn't happen. And so by getting things into Slack, people just like, holy shit, this is possible. Let's go. And it's like, it's really cool. I completely agree. Okay. So we have just seen about everything I wanted to see from the engineering side. But before we get out of here, I want you to spend just a couple minutes on a personal use case. Okay, let's go. I think the one that resonates probably for everyone is getting, if you have kids, getting the school emails that it's like, oh, here are 50 events that are about to land. Here are the dates. I've just started taking a picture of it and then throw it into chat. GPD can say, create the calendar invites. 100%. Right. It's like, it's the dumbest thing, but oh my God. And then the shared calendar dance happens. And it's like, it's so great. Another thing though, like I love food and wine. I really do. And like, I've done like sommelier training, et cetera, et cetera. And I, and I realized like, you know, I went to New York recently with one of my buddies. He's, he's learning about AI, but he's like, what are some of the real use cases that would resonate with me? And I was like, well, like one of the biggest sort of anxieties people have is when they go to a restaurant, they're handed the wine menu. Right. And they're like, what do I pick? What if I pick the wrong thing so uh with my friend in new york we went to some uh like champagne tasting and so like i just took notes there's like this whole notebook right i just did this like an hour ago and i was like oh here's a great producer single star means like yeah it's good and then here's another one oh see i wrote amazing by like this is someone i've actually never tried before but i loved loved their champagne. It was just super yummy. Here's another one, right? Effectively, then I just like popped this right in. And I said, here are a bunch of champagnes that I tasted. Figure out from my notes, like, what are my taste preferences? Really simple. Because, you know, like when I did like Somali classes, the biggest thing that it teaches you is the vocabulary to describe the stuff you like right and then so i just took the images it figured out the producers and this is actually like spot on the fun thing i did with my friend while i was in new york was like we were just he was he's he actually is the real life version of chat gpt and it's it's what inspired me to do this which is he's always trying to figure out my taste preferences and so you know this is like my strongest signal i i love like these wines that have very little sugar that are like really rip growing acidic. I love some aging. I love growers, right? Grower champagne, not like the big houses that are like very sweet. It even went into like a certain subcategory of like, you know, the chalky style, this specific producer that I wrote amazing buy for. And it also called out something I learned in real life, which is like, I do like Pinot Meunier, but only like with this sort of characteristic, right? Kind of crazy. All right, fine. And so then it came up with like a little bit of like a champagne profile. Cool. And if I'm buying stuff, you know, here's what I would buy. All of that's fine. Okay. Like why on earth would anyone do this? Right? Like people must be listening and be like, okay, maybe just drink a little less champagne, dude. But like the fun thing is, let's say you took, you went to a restaurant, right? And I just did this for this like example here and you just like dropped in took a picture of uh the wine menu right and it's like a big old menu some of them are like size of a dictionary some of them are simple but like you don't want to make a choice especially you just want to be with like talking to the company that's in front of you not like staring at the wine bible you drop it in and boom what it actually comes out with. And I think the prompt I asked is, what would I like from this list? What are good values? And it kind of just went through this really fast based on my preferences. And it's right. I would love this. I have had it. And it's great. And it's fun. It shares the price. Absolute no-brainer. Another example. Another example. And then it kind of gets into a bit more detail categorically. Like, look, if you want to value one and just like want a bunch of bottles, go for this. Like everyone's going to love it. If you want something a bit like more splurgy, try these. Right. And very much like it kind of talks about what why you'll like it. What I love the most always says this is the stuff just to stay away from. Right. And, you know, if it's a big night, then just go get these six bottles and call it a day. And so like that's the fun thing here for me. so what i have to call out for folks is we've actually seen not this particular use case but this flow before which is like how you reverse engineer your own taste so we saw hillary at whoop show how to reverse engineer her own taste on slides um we saw i forget somebody else reverse engineered photographic styles um ravi uh reverse engineered photographic styles and said like here's a photo, like tell me, explain to me how to describe this. But you are the first person that has reverse engineered their own taste in wines. And I love this. And now you can pick yummy stuff to get for it. You know what? Six bottle cart. I'm going out with you next time. I know. We'll celebrate AI adoption or something like that. This has been so great. I have one, two lightning round questions for you. We'll keep them very short and then we'll get you out of here. My first one is if you look back two years ago to now at work, how are you spending your time differently? How has all this changed how you personally spend your time? My calendar is empty, almost empty. The reason why is because the coordination overhead of like, hey, let's prioritize this. Let's change this. Let's change the roadmap. No, you just do things. That's one. Two, I'm writing way more code. The team knows if they're contributions fall below mine. We got to help on the AI. But look, I'm also jumping in. The team is doing incredibly hard work. I am spending way more time in the code base, fixing bugs, trying things, coming up with technical approaches. I am not a replacement for the insane amount of talented, cracked engineers on my team. But I'm able to move things forward much faster and cut through the bullshit. If AI has done anything for us, canceling meetings would be the gift that I want. Okay, my last question is, when AI is not listening to you, when it gives you a really dumb playbook for your engineers, what is your prompting technique? It depends on how many times I've tried to convince it. But generally, it's like, okay, one, you're clearly not listening to me. This is what I said. Two, yeah, I know I'm absolutely right, but like, stop being stupid. I need your help. And three, I like the nuclear option is I threaten it. And I say, Claude, if I'm using like Claude Opus for 4.5 high, like, okay, I'm going to stop using you, Claude. I'm going to switch to Gemini. And then it gets a chip together. I love it. I don't know what that says about either parenting or management style, but I think it is effective. Well, this has been great. Where can we find you, your team, and how can we be helpful? Yes. So I'm on Twitter, at Chinthan Therakia. We are building the base app. It used to be known as Coinbase Wallet. And I think by the time that when this episode airs, it will be live to the general public. use it it is a consumer social app that happens to use crypto and it's enabling creators to earn and be valued um and we're excited to launch it and we think it's like a real big paradigm shift in crypto consumer apps so give us a feedback give it a shot post uh see the magic happen and we are hiring to uh cracked front-end back-end design engineers ml engineers super builders i have two super builders happy to bring in a third one but like it is it is really really fun uh to work here on this team and and it it'll be it'll be awesome so come join us well thanks for joining us thank you this um this was such a great way to cap off the week thanks so much for watching if you enjoyed this show please like and subscribe here on youtube or even better leave us a comment with your thoughts you can also find this podcast on apple podcasts Spotify or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. See you next time.