ChatGPT Image 2 is INSANE!

32 min

•Apr 22, 20263 months ago

Summary

Julian Goldie demonstrates ChatGPT Image 2.0, OpenAI's newly released image generation tool that significantly outperforms competitors like Google Gemini. The episode covers practical applications including image creation, editing, integration with AI agents, and use cases ranging from movie posters to comic strips, with detailed comparisons showing ChatGPT Image 2.0's superior quality, detail, and text rendering capabilities.

Insights

ChatGPT Image 2.0 uses built-in reasoning to carefully plan and execute image generation rather than guessing, resulting in dramatically better detail, text accuracy, and contextual understanding compared to previous versions
The tool's 1,512 ELO score represents a 250+ point jump over Google Gemini (1,270), indicating a significant generational leap in image generation quality that creates competitive differentiation
Integration with AI agents and APIs enables developers to build image generation into custom applications, agents, and workflows, expanding use cases beyond direct user interaction
The quality improvement is particularly noticeable in text rendering, color richness, composition, and ability to understand complex creative briefs without explicit instruction
Accessibility across all ChatGPT tiers (web, iOS, Android) and availability via API democratizes professional-grade image generation for businesses and creators

Trends

AI image generation moving from novelty to production-ready tool for professional design, marketing, and content creation workflowsCompetitive consolidation around reasoning-based AI models that plan before executing, improving output quality across modalitiesIntegration of image generation into broader AI agent ecosystems and no-code/low-code platforms enabling non-technical users to build visual content toolsShift toward multi-modal AI workflows combining image generation, video creation (Veo3), and web design (Codex) in single platformsELO scoring and benchmarking becoming standard for evaluating and comparing AI model performance in image generation communityAPI-first approach enabling developers to embed image generation into custom applications rather than relying on web interfacesEmphasis on image editing and refinement capabilities alongside generation, allowing iterative creative workflowsText-in-image rendering becoming critical differentiator as use cases expand to posters, comics, infographics, and branded content

Topics

ChatGPT Image 2.0 capabilities and features AI image generation quality benchmarking and ELO scoring Comparative analysis: ChatGPT Image 2.0 vs Google Gemini Image generation API integration with OpenAI platform AI agent integration with image generation tools Prompt engineering for image generation Image editing and refinement workflows Use cases: movie posters, comic strips, logos, landing pages Codex 2.0 integration with image generation Multi-modal AI workflows combining images and video Professional design applications of AI image generation Text rendering in AI-generated images Image generation for web design and UI mockups OpenClaw and Hermes agent setup with image APIs Accessibility and availability across platforms and tiers

Companies

OpenAI

Released ChatGPT Image 2.0, the newly launched image generation tool that is the primary subject of the episode

Google

Google Gemini's image generation tool used as primary comparison point, previously considered the best image generato...

Anthropic

Claude AI model used for prompt engineering and generating optimized prompts for ChatGPT image generation

People

Julian Goldie

Podcast host demonstrating ChatGPT Image 2.0 capabilities and conducting comparative analysis with competitors

Quotes

"ChatGPT Image 2 is by far the most powerful image generator that I've ever used. I was a big fan of Gemini previously, but we can compare these side by side and I'll show you the difference between them because this is so cool."

Julian Goldie•Early in episode

"The old way was like you have these old image tools, you typed a description, the AI guessed what you meant. It produced something that was sometimes close, sometimes wildly off. With the new updates the details are what makes a difference here."

Julian Goldie•Mid-episode

"ChatGPT image two came in at 1,512. So that is a 250 plus point jump on the ELO score. And in terms of ELO, like that gap is massive, right totally different league."

Julian Goldie•Benchmark discussion

"I think like it's one of those apps where you're gonna, you're still gonna be discovering what you can do in five months time, if that makes sense."

Julian Goldie•Late episode reflection

Full Transcript

Today, we are going to be looking at the new ChatGPT Image 2.0 update, which is the most powerful image generator ever with AI. And so far, I've tested it. You can see an example over here, and it looks absolutely amazing. Look at that. We created a movie poster for the last noodle, and it's just looking amazing. So I'm going to guide you through exactly how it works, how to use it, multiple different ways you can use it, how to connect it to your AI agents, how to generate amazing images, how it works step by step, and why it's so good. Let's get straight into this. And if you're watching live, feel free to ask any questions as we go along. So you can see an example of the image that we actually generated over here. Now, we can change the aspect ratio straight away and we can go from there. If you're wondering what this is, so this is GPT Image 2 and it's available in chat GPT. It is by far the most powerful image generator that I've ever used. I was a big fan of Gemini previously, but we can compare these side by side and I'll show you the difference between them because this is so cool. Look at the quality and the detail on this. It looks amazing. If you want to see the prompt that we use for this, so we actually said generate a hyper-realistic movie poster for a film called The Last Noodle, right? It's a drama where the world's final bowl of ramen is being carried across a wasteland by a retired sumo wrestler in a small cap. And it just nailed it. It just nailed it. Look at that. Even like the tagline and everything like that. It's just amazing. So if you've never tried this before, don't worry, because it just dropped a few hours ago. That's probably why. But if we compare this side-by-side versus what was probably the best image generation before, which is using Gemini, and we can generate these side-by-side, right? So we've got ChatGPT over here with Image 2.0, and then we also have Gemini over here, and we'll test them side-by-side and see which one performs the best. I would suppose the Gemini isn't going to come close, but let's see what happens next. So if you're wondering about this, how to use it, how it works, et cetera, so you can see a bunch of examples here, like the quality and the detail super nice in these images like to the point where this is how many designers could create something like this right even just to do the photo shoot along it would take like a full day right whereas you look at these images it's just absolutely amazing i'm going to show you some more prompts how to use it even just like super basic stuff right so for example here we said create an image of a cat and it looks super realistic right the way that it's generated it's obviously a 4k image as well super high quality you can get it on the api too which means you can build like image apps around it too and i think you can use it inside codex 2.0 we'll try that later to actually build out some image apps using this but amazing stuff so far really powerful really cool and it's looking absolutely awesome so tantrum says i just updated it what's new about it so if you just look at the quality of the designs it's way nicer right if we actually compare this side by side so this is gemini which up till yesterday the best image generator in the world and we compare that versus this which one looks a bit more realistic which one looks like more detailed like even just compare the colors side by side it's amazing right and then even if you look at the headline that it came up with so look at that versus this is a way more interesting headline right the world's gone hungry one ball remains right and then it's got a little coming soon section here, this looks way more interesting. And so it's way more, way more powerful for generating images. And also you can get this on the API. OpenClaw just released an update today, OpenClaw 4.21, which means that you can generate images inside OpenClaw. You can also use Hermes for generating images with image, GPT image 2, which is more powerful than ever before. All right. So let's try some more prompts here and we'll see what we get back. If you want to see some other stuff like you can see some example like thumbnails that are generated here and it pretty much nails it like first time around every single time right like these images are super nice you can use it not just for generating fun or images or whatever but you can see examples here and also i like the fact that you can just switch between the images on the left hand side it's super easy to manage everything and you can see how easy it is to create an awesome awesome image in literally a couple of prompts right as you can see right here so let's test this out now we're going to get some more prompts i've got a full guide over here that we can try and we'll try this side by side with chat chibity images right so we've done that one already let's try this one so we're going to try and generate like a comic with this so we'll say generate a full eight panel comic strip about a goldfish who slowly realizes he's been the smartest person in the room the entire time including the dialogue emotional beats and a twist ending and we'll see what it comes back with there it's also pretty quick for generating the images too and again we can compare this side by side versus Gemini as well so I'll do the same thing inside a new chat on Gemini and we can see which one performs the best even like the way that it generates images now it looks quite different so you see what it's doing what it's working on and then it has this like kind of pulsating animation in the middle. Bear in mind as well, you can always turn these into videos later. You could take the image and plug it into something like Veo3 to generate the video itself. So let me show you an example of that. For like, we want to download the image itself, we can download it here. And then we can go back. I want to see which one generates first actually. But in the meantime, we can go to Gemini. We'll grab the download here. And we'll plug that into videos. And we'll just say create a video based on this and see what we get back. So this is Gemini's output right here, which is not bad, pretty basic, pretty nice. Again, Gemini is a lot faster. It's using the fast API to generate that. So it's using Flash, I think, to generate that. And then this is the ChatGPT version, right? So which one looks nicer? Which one's more detailed? For sure, it's this one. Now let's have a look at the actual text inside this. I want to see which one is more interesting yeah for sure this is way better way better right just looks a lot cooler we've got the image generating over here by the way that'll take a couple of minutes to take the image that we generated with chat chipt and turn it into a video but in the meantime look at the quality difference between these like even just the colors they feel like a lot more rich and realistic when you actually look at them side by side and again like you can quickly generate a different aspect ratio for this this is one thing i really like is like the customization so sometimes when you're using an ai image generator the problem that you'll find is that sometimes it will generate it in the wrong aspect ratio and then you have to manually tell it what to do whereas if you select the top right now you can say okay make that square or make that landscape or make it as a story right or ultra wide or whatever i think for 99 of people they're probably going to use landscape or vertical right but you could easily use this for like ads and that sort of thing right they'd be quite easy to generate this and even the the quality of the comic here would have been impressive six months ago this would have been super impressive on this side right but now you look at chat gpt and you look at the image quality and you look at how it's put together it's just so much more interesting even the way the content is formatted the text inside it really professional what they've done here is pretty amazing i want to have a look inside codex now codex is the chat gpt super app where you can do everything inside there. So what would be quite interesting is if we just check for updates, first of all, we might have to update this to the latest version. Let's see if we need to. And then if you're not familiar with Codex, basically what you can do here is, yeah, there we go. We've got an update there. So what you can do inside Codex, you can design apps, right? You can design apps, you can design landing pages, you can design websites, et cetera. Now, why is that useful with something like GPT Image 2? Because the quality of the outputs, the quality of the UI and the front-end design is only going to be better if you're using better outputs from image. And if you can generate those directly inside codecs even better right So we wait for that to update in the background and then we open up in a second in the meantime let try out another test here so far as you can see like on both occasions chat gpt's quality of images is way nicer ah thanks very much tantrum i appreciate that thank you so much macosa says wow chat gpt is coming in hot 100 chat gpt better and easier to read. I would agree. GPT is better. Looks more real. Yeah, 100% agree with that. Codex equals Visual Studio Code. Nice. It's the app directly. What about creating logos? All right, awesome. Let's do that. So the way that I'm going to do this, so what I've actually done is I've trained Claude on how to prompt ChatGPT over here. And I think this is one of the best ways you can do it. So I gave some information documentation on how to chat GPT images to get better outputs. And that's how we got such good outputs from, for example, this movie poster over here. And so what I'm going to do inside Claude now is I'm going to say, okay, come up with an amazing prompt for a logo for my agency, Goldie Agency. And let's see what it comes back with here. So basically I'm using Claude Sonnet 4.6. I tell you what, I've been testing out 4.7 recently and I've got to say, I still prefer 4.6. It just seems to get worse and worse, especially if you're using old prompts, particularly for writing, I find particularly for humanized writing, like Sonic 4.6 is still way better than Opus 4.7. So I'm going to use that and I've switched that on the model over here. And then we're going to take this prompt like, and we'll try it on Gemini and we'll try it on ChatGPT right at the same time. So by the way, the video couldn't be generated, as you can see. So we'll go over here inside Gemini and we'll plug this into ChatGPT on a new chat. I would recommend like each time you generate an image, just start a new chat because unless it's related to the previous image, you don't want to be working with that context, right? Sometimes that can mess up the image that you're trying to create. So over here, we have Gemini. Over here, we have the logo being generated inside ChatGPT and we'll just see which one performs the best. Now, whilst we're waiting for that, there's a bunch of new updates as you can see right here. So basically you can edit existing images as well as creating new images, So you can plug in an image and say, okay, change this, edit that, et cetera. And once you enter your prompt, ChatGPT Images generates the image how you want it to be done. Now, ChatGPT Images 2.0 is available on all tiers. So everyone can use this. So images with thinking, this is interesting. So you can use it with thinking mode, which means it adds more reasoning to the image. And by the way, Gemini is finished. you can add thinking and reasoning available on plus pro and business right so if you're using chat chipt images it's available on web it's available on ios and android app as well and then you can edit images here we'll come on to that in a second so if we come back to chat chipt i would honestly say like at that point this is slightly more elegant but they're both similar actually to be fair for logo generation let's try another image here i'm going to say create another one that's more interesting plus fun because there's no there wasn't much difference between both of those to be fair so we'll try that we'll try that codex is not opening up all right let's see why is codex not opening i'm just going to close it and then try reopening it in a second there we go all right let's just make sure we're on the latest version which i think up to date all right great so let's just try something here so i'm going to say i'm working on a new version of my website could you help me generate a better image design and then i'm gonna take a screenshot of my existing website we'll paste that in and say can you make a better version of this let's see if we can trigger image 2.0 to generate here all right so now we have the images here honestly still like i think for logos because logos so basic like this one is slightly do you know what i wouldn't even say there's a big difference between i just prefer the black background but that's not a big deal i would say for images or for logos themselves i don't know i think they're like even right there there's not a big difference between them all right let's keep going with some more tests whilst waiting for codecs to generate we'll stop that we'll say can you generate an image mock-up first we'll come back to that in a second all right let's try another one now in the meantime let's talk about what chat gpt image 2 is right so basically you can describe a picture of words right and then in seconds or within minutes that exact picture shows up on the screen right that's what you want that's what you can do with chat gpt image 2 now the thing to note here is like image 2 is the latest version of open eyes image making tool right and right now in 2026 april it's the best ai image tool ever released right by a big gap now you might be thinking okay how far ahead is it really so let's talk about this when it comes to image models, the AR community uses something called an ELO score to rank image models, right? It's kind of like a football league score. So higher score equals better model. Now the previous number one, Google Gemini's image tool, that sat at around 1,270. ChatGPT image two came in at 1,512. So that is a 250 plus point jump on the ELO score. And in terms of ELO, like that gap is massive, right totally different leak right there now if we look for example look at the quality of this and compare that to let's have a look here this one right you see how this just looks a lot more kind of it doesn't look quite as real whereas this one is a lot more interesting it's got a lot more character even the way it's sitting is pretty funny like the way you've got a screw working right here and then if we have a look as well the headline is much easier to read it's got a cool little headline over here it just looks a lot nicer this kind of still feels like a stock image particularly the background as well and then the way that he sat it doesn't look quite as good as this right so the details are better too as you can see let's try another one let's just wait let's have a look what we got in codex now ah here we go so we got an image mock-up now so it's generated it can actually take like images of you so it's taking that original image that we had here and then this is a new mock-up right and that looks like a lot better to be honest like even just the thumbnail itself looks way more, looks way better than the original, right? Now, bear in mind, like a human originally designed this thumbnail here, and it doesn't look anywhere near as good as that, right? This looks a lot more professional, a lot more stylistic, etc. I would actually use that on the video, and I think it would look better for sure. And then the rest of the page, there's not a big difference between them, but I think it just looks a lot. It just looks a bit more clean a bit more interesting that sort of thing so you can generate the images using this method let me put chat chupity over there and it looks a lot more clean now if you want to actually create something if you want to build something with this then you can go directly into codex and you can say okay create this as a landing page for me and it can actually go off and just create that for you so you can generate the image of the mock-up and that's how you can use to build tools, to build websites, to build landing pages using Codex 2.0. And it would just go off and build that for you. So super nice right there. Now, also one thing to note here is like with image two, it actually uses reasoning, right? So it has a built-in reasoning system. So it reads the prompt carefully, plans what the image should contain, and then it considers details you didn't even mention, and then it will sketch it out, right? So the old versions will guess what you want, right? The old way was like you have these old image tools, You typed a description, the AI guessed what you meant. It produced something that was sometimes close, sometimes wildly off. And then text inside images, quite often it wasn't that good, right? It was hard to read or sometimes it didn't make sense or sometimes it wasn't that smart, etc. And it was really the details that dropped off in the old updates With the new updates the details are what makes a difference here So it just looks way cleaner way nicer and better resolution as well So let's try some more prompts here. So we're just going to let Codex go off and build for us in the meantime. Let's try another prompt. We'll plug that in, and we'll edit that out. I'd also be interested to see if we start the stopwatch, like how long does it take? How long does that actually take to generate an image? We'll have a look here. So we've got the image generated on the left-hand side. And we've got the timer on the right-hand side. So we'll come back to that in a second. And you can see here, like, it's sketching out. So it actually thinks about it, and then it creates the first draft, and then from there it goes off. Now, you can also ask questions about it, or you can ask it to change things. You could also add multiple elements to it. So you can actually, for example, upload another image and say, insert this icon or insert this logo, et cetera, inside the image itself. So it's quite easy to edit out the image. is quite easy to add like finishing touches make it better etc so that is basically finished now and how long did that take that took 43 seconds like you can see it's looking pretty nice right so we said generate a hyper-realistic movie poster for a four-hour slow cinema film called concrete that is literally just about concrete directed by a fictional legendary art house director with awards and critical quotes etc and if we pull this up the only thing i don't like is like even on a big screen it's quite hard to zoom in on it's pretty cool though it's funny when you read those quotes all right let's try another one it's nailed pretty much everything we've given it so far as well which is pretty impressive so let's try this one generate a six panel comic about a medieval knight who discovers wi-fi in the forest so these are the rankings right here as you can see so the elo is 1512 for gpt image 2 medium then nano banana 2 is 1271 grok imagine is down at 1170 which is not too far off like it's not too far off nanobanana 2 but it's still not in the same league and look at the gap between image 2 and image 1.5 right and that was high reasoning so 1241 versus 1512 right big difference big gap between them let's have a look at this one yeah it's pretty nice it's pretty nice isn't it even like the elements and the way it's told in the story how interesting it is it looks really cool now if we come back to codex let's see what it's doing so there we go so we've got the index html ready to go let's open up and there we've got the actual page itself so it created this whole page generated with codex honestly this is is an interesting one this because like when it actually comes to coding and building our page in the ui like claude is superior right especially claude 4.6 but the image itself generated inside codex is way nicer, right? It actually didn't do a very good job of implementing this style into this page, right? Look at the difference. So for building inside Codex, it's not bad, but it's not going to do the same work that you wanted, right? Look at the difference between the image mock-up it generated versus the actual page that is created in reality, right? It doesn't look as nice. The fonts are not as nice. Even the design, like the title is not centered. Super weird. I don't know why I did that, but you could always go back inside the chat and say okay make that better but yeah for generating take for basically taking the image and then implement it into a live page i don't think it's that good it doesn't look that good to me so that's it so we did five different tests right there looks really cool let's try and put it into let's really test it now let's try and give it some detailed prompts and see how it performs right here so we're going to take that this is a bit more detailed right so generate a hyper detailed fantasy world map see how it performs there. I also like the fact like it just picks up automatically that you want to generate an image. So you don't even need to select images, which was annoying before. Like sometimes you have to select the image option to generate that. Whereas with this, it's a student's magic directly. And I think we can generate multiple images at the same time. The other crazy thing about this is I've not hit any limits and I've been testing this a lot today. I would have expected to hit some sort of usage limit, but so far it's working out pretty well. This one actually failed to pick up that it was an image so i just want to be 100% honest with you there so it seems like you can generate multiple images at the same time i've generated five in one go there and it doesn't seem to struggle at all let's have a look at this map the quality and the detail is good right look at that very detailed a lot of information on that page but it seemed to handle it pretty well really i think also what would be fun is you could use this for game generation right so for example here we could go into codex create a new chat plug that in We could ask it to do this and then actually create the game around it, right? So we could say generate a character selection game, and then it will start working on it. And then we could actually say, okay, turn that into the game. Actually create it. So let's see what we've got here. You can see how we said here, generate a realistic page for reviews, etc. And it generates it pretty nicely. Look at the quality. Again, it's all about the details. The stars, the difference here, etc. It's a LinkedIn profile for Biscuit. It looks pretty nice. the fact that it actually knows what a LinkedIn profile should look like and then it generates that is amazing, right? And then it's got people also views and links to all these other dogs. And look at that featured post here. Some days are rough. Remember to take a breath, step outside and throw the ball. I'm always here for you. And Biscuit has named himself as an emotional support specialist and ball retrieval expert. It's so good. It's so good. All right, let's see what else we've got here. Is this a WhatsApp chat between summer and autumn where they're leaving voice notes to each other. And then we got this as well, right? It's so cool. I think like it's one of those apps where you're gonna, you're still gonna be discovering what you can do in five months time, if that makes sense. Now you can also select different parts of this and say, okay, change this or make it more interesting or add volcanoes, et cetera. So you can edit like different parts of the image that you select. You could always do that before, but I think the outputs that you'll get from it now are going to be way smarter and better than previously. Let's have a look at what questions we've got here. Are the generator dependent upon display quality or can you tell? I think display quality always helps when it comes to design, but you can tell pretty easily. How hard is it to put a barcode on there? Pretty easy, I think you could easily add a barcode. Just upload what you want and then set up. Sometimes limits the dial back, off-peak, yeah, I'd agree. But yeah, really cool what you can do with this. The quality albots, I've never seen anything like it. All right, let's try some really visual stuff, some cool stuff that just looks amazing visually. So we're going to try this now. One thing I will say is like when I was testing out probably about three hours ago, it totally failed on me, right? Whereas now it seems to be like really responsive. Sometimes when these new image models drop, they struggle or they limit you, etc. But I've not seen that with this release, especially when I'm using it right now. Now, if we come back to Codex, let's see what we got. Oh, it's actually created it in. That's interesting. So it didn't, it's quite hard to trigger inside Codex. If you want it to generate an image for you i think you specifically have to say create an image for this first and then go off and do it right now what would be interesting as well is we could take for example like an image of my book like this right and then we could go over to that chat we had on claude previously plug this in and say create some amazing prompts for this e.g.gc the extra shot. We'll see what it does in a sec. All right, here we go. So we can take like just a sort of mock-up of the book and we could say generate an image, a realistic image of someone in a cafe actually reading. So let's see what we get back. Now we've got the new output here. Oh yeah, so it's added the volcano there. So look at the old version and then it can insert like volcanoes and make it more interesting. So it's pretty cool for editing stuff too. I'm going to close some of these but wow, they're pretty cool interesting so it can generate like painting style images this is pretty nice yeah it generates some really cool stuff This is really cool Look at the quality of that It doesn look that realistic to be fair but it looks awesome. It does look awesome. All right. And let's have a look at this image. Nice. All right. So this is someone reading the actual book that we gave it. But to be fair, like the design of the book is different to what we actually gave it. So it did fail on that test really. But at the same time, I don't think I pasted it in the original book. Let's have a look here. I'm just going to paste that in again just to double check. So if we take that image, say, see image attached. I think it actually just created a mock-up for the book, like out of thin air, which is pretty impressive to be fair. So now we'll take that and we'll see if we can generate something more realistic here. In the meantime, you might be wondering, okay, like how do you, for example, how do you give this to your agents? How can you get your agents to use it? So here's how I would do it. So we're just going to get this running. I'm going to set up an open claw with our llama. Just quickly get it running. Here we go. Then we'll go over here. We'll test this out. Make sure it's working. Yeah. So OpenClaw is there, right? And so let's say, for example, you want to give the API from ChatGPT an OpenAI image to your AI agent to generate images for you. Here's how I would do it. You can set this in your EMV file if you're a bit more technical, but I'm just going to do it this way to make it easier. So we're going to go to OpenAI Playground to get the API, which we can get over here. Just log in. I look at that. Yeah, it uses the image perfectly. That's super nice. Yeah, so it basically, it took the image of the book, then turned it into an image of someone holding it in there. So the laptop looks super nice. That's great. So let's try something else out now. Claude is great for prompting it, to be fair. All right, so we just need to get an API. By the way, you can use this inside the playground as well. So you can select GPT image here, go inside the images once you've logged in. It's available on platform.openai.com, and then you can go from there. But I'm just going to grab an API and I'll delete this after the live stream. I will copy that and then we'll go over here and we'll say API equals this. And then we need to get the documentation for chat cheapity image to just make sure you don't get the old one. You need to get the new one. And with the latest update from open claw, it should be running. Right. So you want to make sure you update to the new version. And then from here, we're just going to grab that and we'll say, can you use. this for image generation inside OpenClaw just for generating images. There we go. All right so you can also access image 2.0 natively in Hermes Agent. To update you just go to Hermes update inside the chat and then select your image generation tool with Hermes tools right. So that's how you can do it and then it's going to ask do you want to set up a proper tool or just do this setup. So I'm just going to go with number one and then if we want to set this up with Hermes Let's try that now. I have a feeling OpenGlaw tells me it's set up, but I have a feeling it's not going to be set up. Let's see. We'll test it out. So then we're going to update Hermes, as you can see here. So that's beginning to run now. And then once that's done, we should be able to use Hermes tools to set up OpeningEye inside there as well. So you can see it's just updating right now. So I think pretty much all AI agents just got right onto this in terms of setting up and getting it fixed and set up, etc. So we got OpenGlaw working on the image over here. We have Hermes updating. We have the documentation for image generation over here. Now let's talk more about images on image two and what it means and stuff like that. So number one, the text images are much better now, which is great. Number two, it has an insane level of detail, right? So it can generate way nicer diagrams. It could do images of like posters or newspapers or pixel art grids, et cetera. Even like a receipt or something like that, it could create a good image of. It's also good for image editing. So you can actually change the style of an image quickly as well and that sort of thing. And it can understand context that maybe you didn't explain properly, right? Because it has a really good understanding of reasoning and that sort of thing. By the way, if you want the prompts from today and everything else, I'll put that inside the AirProfitBorning. Link in the comments description or go to theairprofitBorning.com. So from here, we're now going to go to Hermes Tools to get this set up. And we'll just configure an existing tool. So I think it'd be under this section. Ah, yes. All right, cool. So if we go to Hermes tools, reconfigure, then we go to vision, and then we type in the API key that we want to use. So if we go to API keys over here, we'll copy that, paste it into terminal here. That should be done. All right. Now if we run Hermes, let's just see if it can generate images now. Looks like OpenClaw is struggling here, but when is OpenClaw not struggling? And then I'm going to get the documentation. again pay sign someone says in love with multica yeah it's really powerful it's a tool really good for using especially with your ai agents such a cool idea i really like paperclip as well that's a good option and then you can see here that it will access using images now so you can get it set up for you and then generate images for you from there so basically what you want to do is give it the api key and also give it the documentation on chat chibity image and then your agents can go off and create images for you. So that is basically it in terms of how to use it, how to generate images with it. Some of the images are absolutely amazing, as you've seen today. You can use it inside Codex, you can use it inside your AI agents, you can use it inside ChatGPT directly or inside the Playground as well. You can get an API key for it now too. And it's pretty simple and easy to get it set up. I think for 99% of people, they're just going to use it inside chat gpt but either way great tool really good like super impressive that open i've come out with something that's useful and better than most of the stuff that's come out from them recently i wasn't that impressed with codex i think codex generating images or generating pages from it's just i don't know there's something about gpt 5.4 that i don't like for building tools but that's just me being honest and if you want to get the full guide from today you can get it inside the ai profit boardroom link in the comment description or go to the AIprofitboardroom.com and inside the community you can ask questions get help and support you can meet other people using chat GPT and AI agents inside the classroom you can get all of my new daily tutorials like you can see and if you want to get the full guide from today along with all the prompts and everything else I'm going to plug that into the AI profit boardroom as you can see right here inside this section with the full guide and prompts and then what I'll also do is I'm going to take the information from this and I'm going to say create a prompt based on your knowledge of chat GPT image 2 that helps people generate amazing prompts for their images just like you've done with me right and then that's the way that I would recommend using is Claude Sonic gave me some super detailed prompts as you can see right here super creative as well even like the last noodle trailer was really cool and so if you want to get something like that that you can plug into Sonic and then and it can generate prompts for you and add that here. So this is a prompt for generating image prompts. And that's the way that I would use it. Because if you just, if the thing is, if you just write the prompts yourself, they're not going to be as good. And the reason for that is as humans, we just don't have the same knowledge on prompting AI. And then also this is trained on all the documentation. It focuses on the details, the style, the lighting, the shots, the background, et cetera. And so what you can do is like paste that into Claude like this and then say, okay, generate a prompt. for creating an awesome ad of my book, right? And then you insert your book, you just plug that in and you're good to go, right? That's how it would work. So feel free to get that. That's inside the new advanced daily tutorial section right here in the classroom. And inside the calendar, you can also jump on calls with members where we go deep on this sort of stuff. You can share your screen, share your setup, ask questions. You can meet people in your local city who are doing similar things to you as well inside the map here. So if you're using AI, if you're using it to scale your business, et cetera, loads of business owners inside here. And then, yeah, you can check that out. Link in the comment description or go to the arprofitborn.com.