OpenAI just released a new feature called 4o Image Generation. It's built into GPT-4o, letting you generate realistic, detailed images directly inside ChatGPT. It can accurately handle text in images, and you can even use your own images as references.
I'll cover 10 practical use cases in this video and I'll compare it to Dalle 3.
You can read the full blog post here: https://openai.com/index/introducing-4o-image-generation/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
MORE FROM SKILL LEAP:
💡 Join the fastest-growing AI education platform & Instantly access 20+ top courses in AI:
👉 Start with a free trial: https://bit.ly/skill-leap
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Оглавление (3 сегментов)
Segment 1 (00:00 - 05:00)
Chat GPT has a brand new way for making images. I've been using it all day. I have 10 examples I want to show you, 10 use cases. This is called 40 image generation. So, it's a native way for chat GPT to make images. It's no longer using Dolly. So, I've been using Dolly for a long time. Dolly 1, Dolly 2, Dolly 3. Those are now removed from Chat GPT. And I want to show you this new way for making images, which is a massive, massive improvement from the old way. Now this is available to all the paid versions like the plus, the pro, the teams version. It is not yet available in the free account though. Okay, let me start showing you some examples and some use cases. The very first use case is making a product mockup right here. So here's the prompt that I use. And I'm not going to read through the prompt, but I gave this prompt exactly what text I want, exactly where I want the text, the details of the box over here, and all kinds of different ways I want the colors to look. And look at the details of this thing. Here's exactly how I spell chocolate in the prompt that I gave it. And it followed every single ounce of that prompt perfectly to a T. The next example was creating a mockup for a website, the entire banner. So, I wanted to create one for a resort that is trying to get people to book a vacation. And I gave it all the information here for the menu that I want. I told it I want center text that says this, everything about the website. Okay, look at the details of this thing. And here's the really cool part. Let me go back. I got this in square because I didn't tell it the size. So, I followed that up and I asked it to create this in 16 by9 format for me and it just kind of stretched it out. Every time you click on one of these right down here, you could ask it for a revision or to remove something or to replace something and it will go ahead and do that. So, if I just wanted the background and all this removed, I could do that. If I wanted to change the text right here, I could go ahead and give it a prompt right down here to do that. Okay, for this next use case, they actually created a very realistic looking photo with a lot of text. Okay, so if you need something with a lot of text, which couple of them I showed you with a little bit of text, but this example they had, they gave a ton of text here and described this whiteboard and this is the image they got out of that reflecting the Bay Bridge here in San Francisco. And the text was exactly the way they had in the prompts. But a lot of times I need to test these out on my own. And here's the version that I got. So the person's in a little bit of a different spot, but it did not say that in the prompt, but it got all the text right here. And you could see the Bay Bridge here in the reflection, the person taking the picture. Now, let me show you how far this has come. So we had this prompt book here for Dolly 3. And these are different prompts that we had that generated these images. So I took this exact same prompt from Dolly3 which the image generation platform we had literally two days ago inside of chat GPT was Dolly3 that's now removed and replaced with this native chat GPT image generator and this is the prompt I took. So here let me zoom in this is what this gave us this exact prompt with dolly 3 and this is what we got inside of chat GPT with 40 image generation. Here's the next one I had. This is a white photo inside of a grand library. But Dolly always made things look a little bit unrealistic, right? It didn't look like a photograph. It looked more like an illustration, right? But look at this one. Now, this is so much more photorealistic compared to what we were getting with Dolly 3. Okay, I tried it with this one from Dolly 3 prompt book. This was a close-up photograph of a vibrant butterfly and more details about the flowers. So this is Dolly 3 here and this is the new image generator inside of Chat GPT. I mean this is clearly a world apart. Now Dolly was not at all good with making any type of portraits or any type of human faces. So this one right here a close-up photograph of a young athlete. I mean there's all kinds of issues. It's much better illustration than any type of a realistic photo. But I was doing this is a photography prompt book and this is the best I was getting out of this one. Okay. So, yeah, this is what we got now. This is what we had before. It's I mean, it's not really comparable. I'll just show you one more example. Here's a vintage portrait of a person where I said fulllength portrait. It didn't quite even follow the prompt that was over here in Dolly. And here's what we got. And it follows the prompt a lot closer. It looks a lot more authentic as a vintage photo. Okay. Now that I've compared it with Dolly 3, let me show you some more practical use cases here. I make a lot of YouTube thumbnails obviously. So I wanted to turn this into a YouTube thumbnail. I said make a YouTube thumbnail of this person. Cut them out of the background. Put a techy and blurry background instead and have him hold a glowing open
Segment 2 (05:00 - 10:00)
AI logo. Okay. So that's me. That's the prompt that he followed almost exactly, but I don't look quite like me. I mean, it's in the same world, but that's clearly not me, right? So, then I said, "Well, no, make it exactly the same person. " Well, that's a different person. So, it was pretty close, but you could tell it's not quite me yet. So, I think for thumbnail generation, it's not quite there yet, but I tried to turn myself into a wizard. I said, "Well, make me a wizard with a wizard hat and a robe and a magical expression. " I wanted to see how it kind of thought about that if I said magical. Now, it's more cartoony looking and it's still not quite me, but it's getting a little bit closer when it looks a little bit more not like a photograph. On this one, I said replace all the logos with the top seven AI company logos. So, now this is a little bit more than just image generation. It has to go find logos, figure out what the top seven are, and it did a pretty good job. It created eight logos but you could see anthropic open AAI Google this hugging face midjourney right it got few I don't know what a couple of these are actually but it kept everything else the same again I don't look exactly like myself it's in the ballpark of myself but not quite right but I said hey you cropped it sometimes it crops the left side it did not fix it in that case and it kind of made it a little bit worse it's pretty close though to actually be able to generate entire YouTube thumbnails. The fact that it could cut someone in the background, generate a new background, find logos, get the text right, that is a huge leap. Now, the next use case is coming up with infographics because it's so good at text. I asked Chat GPT to create a prompt for me and I kind of described what I was looking for. I was looking for an infographic to show evolution of video games. And first chat GPT gave me this prompt where I put these video games including some stuff that's coming out later. NextG cloud console which is a concept. So some real ones, some concept ones. Now look at the details of this thing. The amount of text that he had to get right. He got almost all these right. By the way, I think there was a little bit more to this timeline, but to fit it here, it looked like it started from the beginning and took us all the way to this concept. next gen console. But I mean, you even got the shapes of these things. Like look at the Nintendo right here. Now, the next prompt I tried was to create a meme. So, I asked ChatP to make one up for me. He created this prompt for me. And the first time around, he cropped it. So, you couldn't quite get it. And I said, "Hey, you cropped this. " I just clicked here. this. Give me the text without cropping it. " And he actually fixed it for me. It kind of stretched it out so the text fits perfectly, right? and it's ready to post. Literally, I could click here, download it, download it to my computer, and upload to social from there. Now, the next use case is changing the style. And look how well this did. I gave it this thumbnail. I said, "Turn the person on the left to a cartoon, but keep everything else exactly the same. " And look at this. It kept things exactly the same. It kept the background, even kept a little bit of that OpenAI logo here that I had, but it turned me into a cartoon. That's incredible. Now, the next one is for creating graphic markups. I actually wanted to see if you could mimic something famous like the cover of Time magazine. So, I said create a high-end Time magazine cover featuring a confident individual. And in the prompt, I said photos to be inserted. And I forgot to insert it. I'll show you something interesting happened. Looking directly with a visionary expression. And then I said, surround that person with the logos of the top AI companies. This is what chat GPT just chose for me when I was crafting the prompt. And then here is the exact text I wanted. Okay, let me scroll down. Okay, look how well it copied that prompt here. It's an exact Time magazine cover. It put the logos here surrounding the individual that I asked for. It put the right text over here. And it put a person right over here, which I didn't actually intend this to be me. I was going to put someone else here, but it actually looks a lot like me. So, I don't know how it figured that out because I did not include any photos. It just said that placeholder said something will be inserted. It did not do that. But that looks a lot like me. So, I don't know exactly what happened there. And then I was like, "All right, well, let me try. Let me give it this picture and say, put this person, but have them crossing their arm. " Okay, everything is the same. looks the same. It's surrounding me, but it did not get my face right. I guess it kind of looks like me, but again, different person. It did follow the crossing the arm part of the prompt
Segment 3 (10:00 - 11:00)
and it got the rest of it right, too. So, and there were a couple limitations that I wanted to point out that I noticed. So, Dolly 3 actually got this one right back in the day. Create a wide photograph of a person standing in front of a bright sunlight and using that to kind of create that warm glow around them. Well, every time I tried this, for some reason, the sun goes through the person. So, that's not the way that should look. So, there was a couple limitations I came across. Very rarely though, did it do something like this where it wasn't following my prompt. The other limitation was from time to time it would crop things. And the fact that it just couldn't get my face exactly right, which for my use case of creating YouTube thumbnails, that limitation is big enough that it kind of takes that out of the equation. I still will have to do it the old-fashioned way of using Photoshop, but it's getting a lot closer now. And if you're trying this out for yourself inside of your chat GPT accounts, this is a lot slower than Dolly used to be. It's any image generation platform. I use ReCraft, I use Midourney. Those are a lot faster. So maybe that's just related to the fact that this came out about 24 hours ago and I've been pretty much using it all day. And I'm inside of my Pro account. is even a little bit slower inside of my team's account. So, you do have to be patient with it. That's pretty much most of the day here. That's about all I was able to generate so far, but hopefully that does speed up. Now, as soon as I get more time with this, I'll update our prompt book and I'll make some updated videos about how you should prompt this new model here. Thanks so much for watching. I'll see you on the next video.