How I AI

The exact AI playbook (using MCPs, custom GPTs, Granola) that saved ElevenLabs $100k+ and helps them ship daily | Luke Harries (Head of Growth)

44 min
Jun 2, 202511 months ago
Listen to Episode
Summary

Luke Harries, Head of Growth at ElevenLabs, demonstrates how AI automation saved his company over $140,000 annually by replacing expensive SaaS tools and agencies. He shows practical workflows for generating case studies using Granola and custom GPTs, building internal translation systems with Cursor, and connecting WhatsApp to Claude via MCPs.

Insights
  • AI-powered workflows can replace expensive SaaS tools and agencies when quality is poor and costs are high
  • Custom GPTs with detailed prompts and examples create consistent brand voice across all marketing content
  • Marketers coding with tools like Cursor can build 90% of solutions before handing to engineering teams
  • MCPs enable centralized AI interfaces that reduce context switching and increase productivity
  • Everything should be treated as a launch opportunity with systematic content creation and distribution
Trends
Build vs buy decisions shifting toward internal AI solutions for cost and quality controlRise of 'vibe marketing' - AI-powered content creation matching brand voice at scaleMarketers becoming technical builders using AI coding tools like CursorModel Context Protocols enabling unified AI interfaces across multiple toolsVoice modalities unlocking new product experiences and international expansionTransition from rigid workflow automation to flexible AI agent-based systemsLocal-first AI implementations for data privacy and cost controlAI translation quality surpassing human agencies at fraction of the cost
Topics
AI workflow automationCustom GPT developmentCase study generationContent localizationModel Context Protocols (MCPs)AI-powered translationMarketing automationVoice AI applicationsSaaS cost optimizationAI coding toolsBrand voice consistencyLaunch process systematizationWhatsApp integrationInternational expansionAI agent orchestration
Companies
ElevenLabs
Luke's company that saved $140k+ using AI workflows for marketing and localization
Granola
Transcription and note-taking tool used for automated case study generation
OpenAI
Provider of ChatGPT and custom GPTs used for content creation workflows
Anthropic
Creator of Claude and Model Context Protocol (MCP) framework
Cursor
AI coding tool used by marketers to build internal solutions
LaunchDarkly
Claire's company used as example customer for case study demonstration
Zapier
Workflow automation tool mentioned for systematic case study collection
Chess.com
Example of voice AI implementation with Professor Wolf for interactive tutoring
Palantir
Referenced as example of serious, research-led tone of voice in brand guidelines
SpaceX
Referenced as example of serious, research-led tone of voice in brand guidelines
People
Luke Harries
Head of Growth at ElevenLabs demonstrating AI workflows and cost-saving implementations
Claire Voe
Host and Product Leader serving as example customer for case study workflow demo
Quotes
"This saved us $40,000 a year for the tool, so immediately canceled it over $100,000 in agency costs."
Luke Harries
"When you're editing as much as possible, try and edit the underlying prompt rather than the actual output."
Luke Harries
"I think a personal AI assistant really only needs your WhatsApp your calendar and your email. And then it knows everything about you."
Luke Harries
"Everything's a launch. Everything's a launch."
Luke Harries
Full Transcript
2 Speakers
Speaker A

When you're editing as much as possible, try and edit the underlying prompt rather than the actual output.

0:00

Speaker B

I like that you have here the URA and then gives a very specific identity and job to be done at the top of this. And then you have very specific instructions where you say you must do A, B C D and it's quite particular.

0:05

Speaker A

This saved us $40,000 a year for the tool, so immediately canceled it over $100,000 in agency costs. And I think the highlight though is just not having to deal with more SaaS, vendors, more agencies constantly trying to get other sold

0:21

Speaker B

welcome back to How I AI. I'm Claire Voe, Product Leader and AI obsessive, here on a mission to help you build better with these new tools. Today we have a great conversation with Luke Harries, Head of growth at ElevenLabs. Luke shows us how to make everything launch by making everything automated with AI. He shows us his secret flows for generating case studies and tweets on the fly, how he saved his company tens of thousands of dollars by yes, being a marketer that coded in cursor and explains what an MCP is and how he hooked it up to WhatsApp. Let's get to it. This episode is brought to you by Orcus, the company behind open source Conductor, the platform powering complex workflows and process orchestration for modern enterprise apps and agentic workflows. Legacy business process automation tools are breaking down siloed low code platforms, outdated process management systems and disconnected API management tools weren't built for today's event driven AI powered cloud native world. Orcus changes that. With Orcus Conductor you get a modern orchestration layer that scales with high reliability, supports both visual and code first development and brings human AI and systems together in real time. It's not just about tasks, it's about orchestrating everything. APIs, microservices, data pipelines, human in the loop actions and even autonomous agents. So build, test and debug complex workflows with ease, add human approvals, automate backend processes and orchestrate agentic workflows at enterprise scale, all while maintaining enterprise grade security compliance and observability. Whether you're modernizing legacy systems or scaling next gen AI driven apps, Orcas helps you go from idea to production fast. Orcas orchestrate the future of work, learn more and start building at Orcas IO that's O R K E S I O hey Luke, thanks for joining.

0:38

Speaker A

Thanks for having me.

2:43

Speaker B

In 2025. We've talked a lot about vibe coding, cursor this and V0 that. But we have not talked about enough, I think about vibe marketing. So what do you think the future of an AI CMO is in the next couple of years?

2:44

Speaker A

There's all these tools like lovable and Cursor and the rate of software production is going to go exponential, but it's not going to matter if no one's actually using your tool. And so what's important is actually getting the product into market, getting people to know about your new features. At 11 Labs we have this launch process. So basically every new feature we do, every new model, we run it through this massive checklist which is like, okay, we need to first work out what are the value props. Then we need to work out what's the core messaging. Then we need to work out who's it for. Then we need to turn that into the blog post, the X post. And it takes a lot of time, these massive launch processes. And so the thing I'm really excited for, the AI CMO is being able to go from every single new feature or new product and translating that into your entire launch process. Making the assets, making the videos, making the images, but then also going beyond the launch. So what are those then Evergreen channels that you'll be testing? And so let's say 11 labs, we launched the best speech to text model. Okay, we need to be running Google Ads for that. So then it will spin up, understand all the various keywords, spin up the Google Ads, it will optimize the landing pages. So I think this entire thing is going to change massively. And we're already using a few of these different workflows and I'm excited to talk you through them.

3:02

Speaker B

Yeah, and I think one of the best ways that companies can market is actually just telling great customer stories. And I know you have a workflow for getting case studies out. You're lucky enough to have probably tons of customers that love you. So can you walk us through how you use AI to make case study writing really easy?

4:23

Speaker A

With case studies, you sign a great customer and you now want to be able to tell the story about how they actually use your product. What we're going to do live, Claire, is I'm going to have you write a case study for 11 labs. I know you've used elevenlabs. What we're going to be using is the Tools Granola, which is a fantastic transcription tool. It's a note taking tool and we're going to use ChatGPT actually, but the custom GPT which will then translate that into an excellent blog post. And as a bonus, we write a tweet in a, in our company voice as well.

4:43

Speaker B

Okay. I'm excited and I am a happy 11 Labs customer and you didn't pay me to say that, so I did not.

5:16

Speaker A

And so what I'm going to do is I'm going to open up granola and I'm going to do a case study interview with Claire. I'm going to just ask you two quick questions and then we'll use the transcript for this. So, Claire, fantastic to meet you. And I would love to understand how you use 11 labs in your work.

5:24

Speaker B

Yes. So at my company, we have to produce a lot of customer facing live events. It's the way that we connect with developers and customers in our community. And those live events can be either in person events or they can be recorded streamed events. And we put a lot of preparation into the messaging and the way we present our products at those events.

5:43

Speaker A

So.

6:05

Speaker B

So ahead of our user conference that's coming up in a couple weeks, we actually build a script for what our product keynote will look like. And it's me, it's our CEO, it's our SVP of product, it's engineers demoing, it's a Q and A with customers. And we like to run through and rehearse those keynotes. And the rehearsals are very expensive from a time perspective. I just named 10 people that have to be in the room. We have to walk through the script, we have to record it and then listen to it later. And we're really trying to nail a very specific set of timing. You know, we only have 30 minutes or so to get all this content in. And if you are participating in the dry run, it's actually really hard to listen for. Is this good? As a third party observer of this keynote, so one of the things that I do with 11 labs that I find super useful is actually build prototypes of keynotes. So I load in the script into the, I think it's called the studio flow, and I give everybody in the company and our customers different accents. My boss has this lovely British accent, so I give him a British accent. I give myself a different accent. I pick voices for everybody in the keynote and then I actually generate a prototype, like an audio prototype of the keynote and send it around for people to listen to for two things. One, timing to make sure do we have enough content? Do we have too little content from a timing perspective? And then two, does it narratively flow and sound natural and is easy to understand and listen to because it's a virtual event and I found that that little flow, which I guess for the how I AI listeners is also a little, a mini how I AI built in, built into the flow has been really useful for us to make sure we get high quality events going.

6:06

Speaker A

Fantastic. And so what I do is, you know, basically chat through in more detail and I try and get out some concrete facts for use cases. Claire's already said she's using the Studio product and maybe you would even go as far as using voice cloning of your different customers. So we may include that in the case study and maybe you could give me a metric. So how has this driven ROI for your company? How has this doubled the revenue of customers launched?

7:51

Speaker B

Yeah, so I, I think it saves us significant amount of time internally. So there's definitely hours and hours saved in terms of iterating on something like a keynote. And then if we make it high quality, then our customers hear more about our great products and then of course we get to sell more.

8:18

Speaker A

Yeah, fantastic. So, and maybe even you, you said statistic like it saves you five hours of meeting prep time.

8:34

Speaker B

Yes, it saves me 10 hours of meeting prep time. Let's make this one a good case study for you.

8:43

Speaker A

Great. Then I click Stop Transcript and generate the notes.

8:49

Speaker B

All this time Granola is recording what we're saying and analyzing it on the back end. Now what we're getting is this auto summary based on all the stuff that I just said.

8:54

Speaker A

Yeah, Granola is super smart. I was actually speaking with one of the founders. It's pretty cool. It actually would take Claire's email use that to enrich it to work out your job title. And so it pulls in all this extra context to make these fantastic summaries and transcripts, whether it's for case studies or meeting notes. And what we then do is I've created a custom GPT which we use throughout the company and it's the ElevenLabs copy editor. And I'll just edit the GPT so you can see here what the prompt is.

9:05

Speaker B

And for those listening, a GPT is a chatgpt, sort of like customized chatbot that has content and instructions in it.

9:41

Speaker A

Yeah, you can think of it as a, as a very easy way to share a prompt. And so this is the real prompt we use. And what I've done is I fed in our tone of voice guide. And so I've said how you're an expert editor, you're writing assistant specializing the 11 Labs communication style. You must enforce American English spelling. Even though it breaks my heart, you're A serious research led tone of voice similar to Palantir SpaceX. So it goes into quite a lot of detail. Talks about preferences for types of words and then it includes some example blog posts that we've done. It includes some example tweets that we've done. So different case studies and tweets that we're very happy with. What I'm then able to do is use this GPT. I paste in the Granola summary. So I'll say create a case study of how Claire uses ElevenLabs for LaunchDarkly. And then I go, here's the summary of the call. And I find Granola normally gives the best summary as well as pulls in this extra context. And then for bonus points, I then actually copied the raw transcript from Granola as well in case it wants to grab any points. So here's the raw transcript.

9:50

Speaker B

Can I tell you what an amateur granola user I am? Which is I did not know how to get to that transcript. So little tip for Claire here.

11:11

Speaker A

Yeah. And then so I'll also just say for the Studio product, and then I hit send and I find pretty much the first time it gets something that's usable. Here we go. It's now writing that out how LaunchDarkly uses ElevenLabs Studio to prototype product keynotes. And one of the key prompts that I've given to the GPT is to make the headers like skimmable summaries of the article. So we've got cutting prep time in half, live events, prototyping keynotes. And because I've done the raw transcript as well, it pulls that one out too. I often do like one or two iterations actually just in the asking. So maybe it's including certain hyperlinks to other SEO articles, maybe it's got certain product details wrong if you want to include pricing. And then we put that live on our blog. The other thing then is if you think about, I love to treat everything I can as launches. So if you think about your case study as launched, like first of all you have to write it, but then the distribution really matters. And so what I've also done in this GPT is tell it what a great tweet looks like. Even the aesthetics of a gray tweet, like, okay, come up with a hook line and then do a few like either bullet points or a short paragraph, then do an image. And so if I go write it as a write a tweet thread for this, it will then rewrite that into a tweet summary and it writes these handy brackets around what should the assets actually be. This is the bit where I don't think we have a full AI CMO just yet, which is I'm really excited about the image generation models, the new GPT models, because then I think you will actually be able to do end to end launches and case studies by pulling this in.

11:21

Speaker B

Yeah. So this tweet generation chat right now not only writes the content, but actually puts these placeholders of what media you would need to make an effective tweet. So a screenshot or, or something like that. And so here you've gone from, I don't know, we spent three minutes where I blabbed a little bit about a use case to a very polished case study, a tweet. And then I'm presuming you're going to do three or four other things off this one asset pretty quickly.

13:11

Speaker A

Yeah. So then you would do the LinkedIn post, you maybe would write it in the style of you could have a GPT in the style of your founder's tone of voice. So then you basically paste it into that and he's got the asset too. And then the way I'd always zoom out and think about this is how do you actually put this into a workflow? So I think the best growth systems you can do these one off efforts but things get busy, your time gets taken up and really how do you build it into a system and so concretely set up a zapier such that each time you get a close one deal in your salesforce it sends them an email with your calendly link and you just get booked. Fantastic. Different customers, maybe it's a month into their contract where all you have to do is rock up, have a nice chat with them. You can even get GPT to summarize or pre prepare what the different bullet points and topics you should cover, chuck it through your granola and then chatgpt flow and you'll be turning out five case studies a month in no time.

13:43

Speaker B

This is a great flow because I often find things like case studies or little marketing assets are easy to make, but you have to remember to do them. And if they take time, you know, you get put in a meeting or you have to pass it to somebody else and you just sort of forget and you slow down the next steps and then you produce less assets. So I think it not only makes it easier to produce the assets, but it makes sure that that engine keeps going because you as a human are not responsible for that, that next step.

14:44

Speaker A

And A common theme I think we touch on in one of the next examples as well is like when you're editing as much as possible, try and edit the underlying prompt rather than the actual output. And so if you're like, ah, it always does headings which don't, you know, maybe they're not particularly strong or I like more numbers or I like more concrete stats, make sure to incorporate that back into the underlying prompt.

15:15

Speaker B

And on that point, can we go back to the GPT just for a quick minute? Because I'd like to call out some things that I think you do pretty well here in the prompting that I think folks can learn from. Okay, so from a prompt perspective, very commonly everybody starts with the ura. And so I like that you have here the URA and then gives a very specific identity and job to be done at the top of this. Basically making sure that copy that comes out of 11 labs matches, matches the strategy or matches the tone of voice and the brand. And then you have very specific instructions where you say you must do A, B, C, D. And it's quite, quite particular, which I think is nice. Some folks I know love very general prompting, but I find that if you have a point of view of what your tone of voice should be, this sort of like very precise formatting prompting is, is very important. And then you've broken down those instructions by types of content generated. So you have instructions for tweets, instructions for blog posts, and then the last thing at the end, which I also think people underutilize as good examples. And I have a question, do you use any bad examples in here? Is it all good examples?

15:40

Speaker A

I mean they were comically bad when I was coming up. We do actually use. For the next workflow, I'll show you, we do actually use bad examples for that for translation. But a bad blog post is clearly a bad blog post. And actually if I was one extra thing I found sometimes I think to draw back the learning from the granola team, give it as much context as possible. So if I was to extend this further, I think I would give it a lot of information around what's the core messaging for each different product that we want to get across and really nail. And then it knows when I'm doing the different interviews. Oh, the Studio product, we really want to emphasize how you can do multi dialogue complex speech. Then it would draw that out too.

17:00

Speaker B

Yeah. The other thing that I think people worry about is that like AI on top of AI on top of AI becomes very lossy. And I like the Idea that you use the granola summary, but then you also use the raw transcript. So then you have both sort of the high level summary as well as some raw context. And because these contact windows are so big, the chat can make sense of it.

17:48

Speaker A

And without doing the raw transcript, you wouldn't get any of the lovely quotes as well from what the customer exactly says.

18:12

Speaker B

I didn't even think of that. Okay, well, this is. I'm going to steal this workflow. This is so great and so fast. And I love your philosophy of everything is a launch. So that's a really good way to think about things. This episode is brought to you by Retool. There's a huge gap between impressive AI demos and AI apps that deliver real value inside your business. While most AI solutions can only generate text, Retool lets you build apps that take meaningful action by connecting directly to your business systems and data. With Retool, developers combine the power of code with the speed of visual building to create AI powered tools that solve real problems. No more writing endless integration code or building UIs from scratch. The results speak volumes. The University of Texas Medical Branch increased diagnostic capacity tenfold. Amazon's Genai team uses Retool to make complex AI accessible to enterprise customers. And Ramp saved $8 million while boosting efficiency by 20%. That's why over 10,000 companies from startups to Fortune 5 hundreds trust Retool as their AI app layer. Retool. Because AI should do more than talk. It should work. So I know you had another use case where you were using a external tool or some sort of tool and you actually just built a solution that saved the company quite a bit of money.

18:17

Speaker A

Yeah, so this one was for 11 labs. We're in a whole bunch of different countries and it's very important to us that we localize all our content. And so we want our homepage to be in Hindi, in Spanish, in German, in Polish, in Japanese. And I set out about this process of how do you go about localizing the website? And I spoke to loads of the top experts and apparently what you're meant to do, you set up a very expensive localization tool. So the one we chose, I won't name the name, but it was $40,000 a year and it quickly went up or they kept on trying to push it up. So you're now paying $40,000 a year for this tool. And then the tool, you then need lots of humans inside to actually do all the translation work. So then you're getting agencies which you're paying about $100,000 for. And we set up this flow. There was actually quite a lot of engineering work to connect it to our cms, to connect it to our code base. And I was like, okay, fantastic, I've done all that, but the AI translation is terrible. And then we found out that agencies and the humans were terrible because you're constantly playing this cost game of trying to minimize the cost so you can't get anyone who's any good. And meanwhile, we have AI, which is like utterly taking off. And I had this situation where my team kept on sending me screenshots back from ChatGPT, being like, oh no, this one's better instead. And I'm like, well, if we're just using ChatGPT for the reference of what's better, why don't we just use ChatGPT for the whole thing? And so I've got this Figma board where I've kind of laid out in a bit more detail what we started with and what we went to. But basically we ripped out this entire tool. All of the agencies wrote a very small server where all it does is take. The string has a prompt per language explaining what the tone of voice is for that language, and the context sends it back, whether that's into GitHub or Payload. And this saved us $40,000 a year for the tool, so immediately canceled it. Over $100,000 in agency costs. And previously we were waiting days to get the translations back, whereas this is now instant and if anything is very sensitive, like, say, our pricing page, we just have one of our team, so we're already a decently large team, one of our team, just to do a quick sense check. And if anything's wrong, again, we update the prompt rather than the source code as much as we can to make it make it better.

19:51

Speaker B

So you just replaced this tool. And what I think is so interesting is I have this debate internally as somebody who provides SaaS software, you provide software. And I think one of the existential threats in the SaaS industry is the cost of building going to zero. And I talked to so many people and they say teams will never build this internal. Like, why would you? But then what I think you're showing is it can actually be quite cost effective and improve quality to think about building these tools yourselves. And did you. I mean, who built this? Was it the engineering team? Like, who actually, this.

22:27

Speaker A

I did the first 90% in one day and then I got one of the engineers to help you. So I literally built it all in cursor. I was actually ill. So I was meant to be skiing. I was lying in bed and I was having to deal with the fact that we had just gone through three different agencies who didn't meet our quality bar for translation. And I was like I cannot be bothered to get a fourth agency. I mean I'm just going to rewrite it all. So I did the bulk and then one of the fantastic engineers on our team, he helped get into production. I think the highlight though is just not having to deal with more SaaS vendors, more agencies constantly trying to get upsold. And my broader, the broader question of like, is all SaaS dead? I think no, but I think human in the loop SaaS, like if your job is about putting low skilled workers in some sort of flow, which translation is, I think that's very risky because just the AI and like at the moment we still do need a little bit of like every week or two we have someone just give a quick scan, is it all great? And they do make little tweaks, but give it 2 years time. I would much rather bet on AI costs getting cheaper and the quality going up rather than paying for more agencies.

23:05

Speaker B

So maybe I have three, three takeaways here. One is you really should reconsider looking at build versus buy on some things, especially if you're not satisfied with the quality of the buy. It's worth worth the investment. So I think that's, that's thing one. Thing two is look out, your marketers are going to hop into cursor and get it 90% done and then hand it to you engineers. So you might as well do it yourself.

24:22

Speaker A

They'll be making more work.

24:49

Speaker B

Yeah, they'll be making more. And three, do you know how many software products I have built out of the frustration that I'm supposed to be on vacation but I am actually sick. That is like the perfect, the perfect time to get something new done. So winter season is a highly productive season for me because I'm always sick.

24:50

Speaker A

You need to get them in the multi year deals not ending in a holiday.

25:11

Speaker B

I have to ask, do you feel like this is. Should we, should we. SaaS eats its own tail. Would you ever productionize this and sell this to others? Is it very custom?

25:14

Speaker A

No, I think we're gonna open source it. So there's a couple of, there's a couple of things that I'll just show you. If enough people in your YouTube comments say to open source it, we're open source it. But just to add this basically summarizes what it was, is you have all your code in GitHub, you have your strings, and then you push it into the Sass tool. And the biggest issue truthfully was they didn't allow you to edit the prompt of the AI and so there was just no way the translation would be any good. Understand your brand language, understand the glossary. And they quoted us about six months before they were going to ship the ability to edit prompts. And I think it's because the whole business model is based on no, get these humans in it. And so instead what we shipped was this whereby it's just a GitHub action which understands runs every time you change the keys in your translation dictionary, sends it to an LLM with a prompt per LLM and saves it back. And that works way better. And then the same with our cms, we just built into it a translate button which again, just sends it in. And it was so nice just having one source of truth. All the text is either in your, well, two, I guess, either in your CMS for the blogs or it's in the code base for your core pages. Also, another shout out. This is not a paid advertisement cursor. But we wrote this cursor rule which does all the left. One of the things with the engineers was like it's quite annoying to have to extract all your strings. And so we wrote this one cursor rule where you just say translate the strings and it grabs them all into this en JSON file, nicely wraps it handles server side or client side rendering. So that was pretty fun.

25:22

Speaker B

So I'm making the first request. Everybody in the comments as well, I would like this open source because I would 100% use this, use this flow. And you know, I think people get scared when you hear, you know, human, the loop is out. But I do think there's this opportunity for folks to operate at a, at a higher level of their craft. And so, you know, this is not the fun part of translating is taking string A into string B. Then you can start doing things like does this match localized style? Is this appropriate? Is this how we want to talk to our customers in this region? And so I do think there's this ability for humans to then add a layer of quality and use of their intellect and skills that is higher level than this.

27:12

Speaker A

Yeah. And the really cool. So we spoke through the cursor rule. The GitHub action is here, which is basically instantly on each push. It generates that. But exactly as you said, we just have this prompt file per per language which talks through again, our brand Guidelines, translation style, keywords. And the cool thing is you can define that and like you can really take the care and the nuance exactly how you want to represent your brand and then you'll be pretty confident that that's then scaled up across any content of empressing out in future and before. Truthfully, we weren't planning on translating, you know, every blog page, but now like you actually can do and it's a much better experience for your users.

27:59

Speaker B

Have you tested putting these language specific prompts in the language itself?

28:43

Speaker A

So we explicitly decided not to because I wouldn't have then been otherwise able to vet that. We were consistent, but I did. I'm not sure if you saw that tweet. I did, but I did say that recently of basically someone asked a doc system a question in Arabic and it replied well, because the docs are in English, I'll reply in English. It's like no, you. You do definitely want it to reply in the language the language you want.

28:49

Speaker B

I love this and I love that you built built most of this. And is the maintenance cost very high?

29:18

Speaker A

There's none. There's none. The. Well, it hasn't broken so far and it's just because it's just a GitHub action which is updating the JSON strings. So great. Yeah.

29:23

Speaker B

Well this is super useful and a good self customer story. You saved yourself $40,000 so you know, give yourself a case study. Okay. And then we saved. I'm not saying the best one for last, but I love this last example. One, because you get to explain to the audience what an MCP is for those that are still confused by the concept. And two, I think you built a really fun one. So you want to Talk about your WhatsApp MCP, what an MCP is?

29:39

Speaker A

It's a model context protocol and so it's a protocol written by Anthropic which enables anyone to expose tools to AI agents. And so the example and why I built the WhatsApp one was we all get tons of messages, we're all in tons of different WhatsApp groups and it's really hard to stay on top. But currently if you ask a tool like Claude, it has no idea about any of your WhatsApp messages. It can't help you out. And so what? Again, actually that same weekend Natozil I did one, I did the Saturday was ripped out the ripped out our translation software and then the Sunday was okay, can I actually connect WhatsApp to my AI system using an NCP and it's part of My broader thesis of basically like, I think a personal AI assistant really only needs your WhatsApp your calendar and your email. And then it knows everything about you. And they can even organize tasks, send emails, you know, organize dinners, dinners with your, with your friends. So that's why I built it. And there's also some like cool use cases for work as well, which we can jump into. So let me show you. Cool. So this is the WhatsApp MCP repo. And how it works is it has two main parts. So it has a bridge which it pretends you know WhatsApp web. When you sign in, you scan that barcode. That's exactly how it works. So it actually pretends to be WhatsApp web. And it uses fantastic library called what's meow to do this. So when you run it in your terminal, you scan your barcode and the first thing it does is it downloads all your messages onto a local computer, saves it in a SQLite database. And what that means is you can then keep querying it as much as you want with an AI and you have no risk of, or very low risk of being banned because it's only downloaded it once. So yeah, to be fair, this is unofficial stuff. And then the other bits, the WhatsApp MCP server, which basically gives the ability for your AI to query this SQLite database as well as like sending messages, sending voice notes. So if we jump over to claude

30:07

Speaker B

and this is the desktop CLAUDE instance.

32:30

Speaker A

Yeah, desktop claude. They've also now chipped the ability for you to run it from Claude AI if you host it too. And what I've got is I've got this MTP called WhatsApp here and I can just type into the chat, so what are some recent messages WhatsApp I received? And what that will then do is it will use the tools that the WhatsApp MTP exposes and then summarize and use back that information. And here you can see it's talking about ElevenLabs new features. And so Eleven Labs is launching conversational AI agent or a new speech to text offering, as well as a few tests like how are you? And hello world and a few examples of like why you may actually want to use this is I'm in a whole bunch of different WhatsApp groups. And what you can do is you can use it to summarize oh, what over the last week were people actually talking about? And often now some of the best way to keep up with trends or the best thoughts on new tools. They're all in these WhatsApp groups with hundreds of messages per day. And so if you're looking for a way to get an edge on Twitter or LinkedIn, you can say summarize the thoughts on 11Labs and the messages that will then search the messages that you've received on WhatsApp, which is talking about it. And then you could take something like this, chuck it into your GPT, which is ready trained on your tone of voice and generate a tweet thread for you.

32:33

Speaker B

Everything's a launch.

34:19

Speaker A

Everything's a launch. Also to plug it. If you're interested in how we run launches, the other blog post which goes through all the different sets that we use as well.

34:20

Speaker B

Yeah, we'll link to that in the show notes. Just recapping this for folks that are still have their mind blown. So you built this MCP which you've open sourced, which again is unofficial but friendly, implemented in a nice way. You download this code, run it locally in your cli, it does a one time pull of your data. So if you want it updated, do you just run and refresh that again or does it pull. How does it.

34:31

Speaker A

When you start it, it will pull all the recent messages between the last time it ran and now and then whilst it's running it will also receive any new messages.

34:59

Speaker B

Got it.

35:08

Speaker A

So actually I can also. And I can use it to. So send a message to. And then I'll do a phone number and then potentially if my phone's not on silent. Yeah, so that just came through saying hello and then you can also use it. The cool thing about MCPS as well is you can connect to lots of MCPs at once. And so I have an 11 labs MCP installed so I can use it to. You could generate a voice roundup. So you generate text to speech of this? Yes. So you can stitch these all together.

35:08

Speaker B

Okay. So you have this mcp, it pulls down your stuff, it gets regular updates, it's all local, so none of this stuff is going to the cloud. And then you've connected to that server or that server through your local desktop Claude. Or if you hosted it, you could do it on the web version. And then now just in this chat box you have access to all these different tools. And one of my hypotheses with AI is tabs start to go away. If you're like me, you have 500 tabs open everything along the bottom of your desktop dock and you're switching context. And this sort of centralized chat interface that can access all these tools just makes you much more Efficient and also allows you to get really creative about how you stitch these tools together.

35:49

Speaker A

Yeah. And so, and so for this one, for example, we just generated this text and then you can say send this to the phone number.

36:37

Speaker B

And behind the scenes, I mean for folks that aren't seeing this, it really is basically using natural language to select from a list of tools which hits a list of publicly available APIs that have been appropriately authenticated. And so for anybody who kind of knows what an API is, but maybe isn't an engineer, but wants to be able to say to a system, do this thing for, for me and use sort of the exposed endpoints, this might be a more accessible framework.

36:47

Speaker A

Yeah, the overall thesis with tools and agents is with, we spoke about that granola flow earlier, which was when something happens. So when a deal is closed, then send out, you know, use Zapier to send out a calendly, the person books in. Then you generate, you do the call, you take the transcript, you put into your GPT. And that's all very static and very rigid. But let's say hypothetically I actually want to do a roundup of five leading startups which are all doing this. Well, suddenly my workflow is completely broken. If you had perfectly scripted that all out in a tool like Zapier or N8N that's actually now not usable and you'd spend a ton of time resetting it up. And so the really cool thing about these chat based MTP tools is it can be much more, you know, it's trying lots of different ways to like, okay, how do I actually send this audio message? You know, on the spot we were like, okay, generate this, now send this. And the AIs as the models get smarter and smarter are able to deal with these like higher level abstraction tasks. And so a genuine one you could do is why not have your AI actually phone up Claire and have the conversation for you about that. So if you want, we could actually try that now I'm not sure if it will work.

37:20

Speaker B

Why not? Let's try.

38:48

Speaker A

So we can do that. So create a conversational AI that can do case study interviews. This will then use 11 labs to actually create an AI agent that you can speak to about case studies. First of all, it's going to list the agents.

38:50

Speaker B

I love this because you're using AI to create more AI. You're really just replicating agents on agents on agents. Then what this would do is create a specialized agent on the spot for a specific use case that then you could use to give me A little call and get a case study done.

39:10

Speaker A

Great. Yes, exactly. And so you can actually see that here it went through and created a prompt, so it's got the first message. Hi, I'm your case study interview coach. But if you go to our Twitter, you can see Louis doing this workflow where he then phones up and orders a pizza using an AI. And you just say, I would like to create a pizza ordering AI agent. But yeah, hopefully this gives the listeners a glimpse into like you can on the spot come up with these agents which I think will be more and more abstract, which can then do these tasks for you using the tools as they go. So that's the promise of mcp and transparently, it's still very early and I think most of these are tools, our toys, but I do think it's, it's going that way.

39:35

Speaker B

As a parent of kids who really like pepperoni pizza, I'm very worried about the ability to spin up a pizza ordering agent in my house because we will end up with, with a lot of pizza. Okay, Luke, this has been incredible. You made a case study with me using Granola AI and your magical GPT. We eliminated $40,000 of spend by coding in cursor as a, as a marketer. And then you built a way to use Claude or an AI to query your source of both personal information and industry News, which is WhatsApp, and do a bunch of really interesting stuff there. So this has been very eye opening to me. I've learned so many things from what you've shown me, including that everything is launched. I'm just going to keep that in my mind. But let's wrap with a couple lightning round questions. The first one is, you know, we've talked a lot about coding and text based flows. Even in what you showed, a lot of it is coding and text based flows. But I think what's so interesting about your point of view is you're starting to bring the idea of voices, input and output into, into the, the industry and into how people build products, you know, very quickly. What do you think? Kind of like voice modalities unlock maybe for product managers to think about in terms of what they're building.

40:20

Speaker A

I think there's two broad types of things they unlock, one of which is new experiences for customers which just wouldn't have otherwise been possible. One great example is if you're in education, suddenly you can make something which is way more engaging. So Chess.com shipped an app with Professor Wolf which enables you to get like turn by turn guidance on Your different chess. And so you can imagine a world where every, whether you're learning a language, you're playing chess, you're learning maths, that everyone can have this like interactive tutor. And that's kind of. Yeah, these new experiences. The other type, which I think is really exciting if you're a product manager is you probably have a lot of back office functions. And so particularly if you're an internal pm, if you look at all the places that you currently have people doing manning phones so often this would maybe be like say you're doing research collection, you're a scaled marketplace and you have large numbers of people collecting data or you're doing customer support. Well, maybe actually currently you're not able to expand internationally because your team only speaks English. Well as now you could spin up an entire team of customer support agents who are fluent in French and in Spanish and in German. So I think that's really exciting too.

41:40

Speaker B

Yeah, I love the, I love the international angle to this. It's something that I haven't heard very many people speak to. Okay, Luke, this has been so great. Where can we find you and what can we do for you?

43:08

Speaker A

Thanks so much for having me on the podcast. It's been a lot of fun. You can find me. My website is Harry's co and a couple of blog posts people may enjoy. So I've got the how to launch your products where I literally talk through the checklist that we use for our launches to go from idea to value props to core assets to distribution. I also talk about how to hire your first growth marketer and you can follow me on Twitter. That's LukeHarries and then underscore.

43:20

Speaker B

Great. Well, this has been so fun. Thank you so much.

43:50

Speaker A

Cool. Thanks so much, Claire.

43:53

Speaker B

Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show@howiaipod.com See you next time.

43:56