OpenAI Just Perfected AI Image Generation (See the Comparison!)
23:40

OpenAI Just Perfected AI Image Generation (See the Comparison!)

The AI Advantage 25.03.2025 66 087 просмотров 1 986 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Want to save time with AI? Get my FREE newsletter and instant access to 700+ AI use cases updated for 2025: https://myaiadvantage.com/newsletter OpenAI just surprised everyone with this new AI image generation model. Seems like it blows the competition out of the water. Let's test it and talk about it The official OpenAI Blog Post: https://openai.com/index/introducing-4o-image-generation/ Links: https://community.myaiadvantage.com/c/ai-app-ranking/image-tool-rankings Chapters: 0:00 What Just Happened 0:41 First Impressions 1:30 Release Details 2:03 How to Generate Images 3:51 How to Edit Images 5:08 Tips & Tricks 9:36 Comparison 22:45 Closing Thoughts #gpt4oimage #imagegenerator Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://x.com/IgorPogany 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (8 сегментов)

  1. 0:00 What Just Happened 131 сл.
  2. 0:41 First Impressions 176 сл.
  3. 1:30 Release Details 107 сл.
  4. 2:03 How to Generate Images 364 сл.
  5. 3:51 How to Edit Images 271 сл.
  6. 5:08 Tips & Tricks 925 сл.
  7. 9:36 Comparison 2617 сл.
  8. 22:45 Closing Thoughts 169 сл.
0:00

What Just Happened

whoa whoa so we just got a spontaneous open ey live stream and they unveiled their new image generation model um this is a capability that is present within chat GPT on all tiers including the free one today and I think a lot of opening eye releases are very Niche and for specific audience and not everybody is going to make use of them this is not one of those I think this is really one of those moments and one of these tools that well it's not just going to be accessible to a wide audience but also useful so that's why today in this first look video I'm going to show you various things first of all we're going to have a look at um the generation and
0:41

First Impressions

how it works just to get you quickly into it then right away I'm going to follow up with some use cases I mean look at this it can do it can fune the model off of one image I just gave it this and boom um it turned me into a firefighter with a simple prompt then we're going to look at more examples at some of the special capabilities here because there is some like doing text like this and then I have a bunch of benchmarking prompts that I already ran and will be comparing it to some of the top models so here in this quick concise video I'll give you my first thoughts on how this performs comparing to all the other models out there like image and free and flux and mid Journey how to use this that's where we'll start and finally I'll leave you with some thoughts on this model and on this release and how it slots in into the AI landscape now let's begin with the
1:30

Release Details

first thing the facts I mean this one is pretty straightforward it's a new release inside of cat GPT that rolled out immediately so you can access it now it's accessible to everybody on the plus pro team accounts and also on the free account so if you just have an account with open AI you can use this thing right now sure there is restrictions and those always change over time so I kind of um Shi away over time I shy away from like mentioning those because they will update them but you can just log into chat gbt and use this now the way
2:03

How to Generate Images

you use this is as simple as it could be you simply say create an image or generate an image off and then you say whatever you want to do I usually do a cat with a hat as you might know and then as you hit enter this will start loading in it is slower than the previous tool that you might be used to which was di but that was at this point one of the worst models on the market and this spoiler is probably the best one in the market okay we we'll talk about the nuances of that in a second here okay there there's a few things to talk about there but as you can see it's getting started and within about I don't know 15 to 30 seconds this will start rendering it also renders from top to bottom as you should see any second here okay so what makes this model special why should you care isn't there a thousand image generation models out there why is this one better why are you making a quick video about this and why are people talking is there going to be a thousand Freds on Twitter tomor tomorrow of ooh 10 uh gamechanging ways uh to use the new open AI image generator oh uh you know made all designers obsolete like you're going to have all of that um oh and as you can see huh it wasn't able to complete the request so maybe it has a lot of traffic anyway where was I was making fun of Twitter Fred people Fred boys right okay so let's get back to the content here so what can it do well it can generate images and not just that it can also edit images so let me show you this quick example here that I did right away generate an image of a cat with a hat it generated this beautiful image in about 15 seconds I mean hyperrealism on point okay but there's many other models who do that what else can you do well one thing that you could do is you could selectively edit things
3:51

How to Edit Images

this is something they didn't talk about so I could do this and say change the eyes to be red right so you can do this send that it's going to update it what else can you do well you can edit the images okay so this is one of the main points of this video it's not just image generation but it's also image editing so this generated an image but now I have an image and I want to keep working with that so I can just follow up by saying change the hat to something a nobleman in the Middle Ages would wear um I'm playing Kingdom Come Deliverance right now for anybody who knows so that's why this is inspired by that anyway here you have a noble cat from the Middle Ages and then I could go on add a shirt with funny text referring to cats so you know I'm using the power of the large language model this is not like interacting with my journey this is not hey I want exactly this text or type of result I'm using GPT 40's capability to come up with a funny text and then this image generation model seamlessly integrates sorry I can't I have plans with my cat perfect it's a great shirt color correction has been done and as you can see it works and this is like the big point they LED with but enthusiasts might back to differ but they're saying like this model can do Flawless image generation but we sort of had that before no and this is where my
5:08

Tips & Tricks

comparison table comes in because yeah if you look at some of these other models like imen or flux Pro or even image journey Now can do short text well long text not so much for long text you might want to go with ideogram sort of hid and Miss but it works but if you do really long text most of these will break well not opening eyes you can do things like this you can do a ticket with multip paragraphs of text immediately and it gets it right every single time not most of the time not sort of it just works as you can see here so that's pretty amazing now this is just one of the capabilities there's a few more I'll quickly name those and then we'll move on to more example more examples so another one is that it can um do work with multiple files this is also something we haven't really seen out of any other model except maybe um Google's competitor here and just here I want to show you I ran the same thing right turn me into a firefighter with an image of me this is what I got out of Google um where's the chat GPT one this is what I got out of chat GPT it's not even close right it's not even this is absolutely terrible and this is absolutely incredible now this would you would get something like this out of flux if you trained in on a few of your images but you wouldn't get something that this closely resembl from one image let me tell you that I've done many fine tunes I have many fine tunes of myself we use them in the thumbnails as we might know this just works so I don't know wins out on that also what it can do transparent backgrounds so remove the background and turn it into a PNG I'm going to be extra precise here so it works here for sure but I believe just by saying remove the background it will actually cut it out and turn it into a PNG file which you could then copy and paste anywhere else and if you're not familiar PNG files are IM image files like jpex with the background cut out okay what else well we have a few benchmarks here at the AI Advantage because as you might know we do a monthly ranking so if I just go into the public area of our community here then this is accessible to everybody for free you can just review this we sort of do a ranking for image video and llm platforms once a month this is the version for March and it doesn't include image free yet because we include things that are like publicly available and for Google Studio we just got it now before it was in like early release anyway the point is this is a ranking of all the tools okay and imag would belong in the S tier so now the tools really to beat here are mid Journey flux and imag free which is this by the way um which also has image editing M journey and flux um only do image generation I mean I suppose you can you have editing tools on top of the stuff but you can't really do things like this you can't upload an image and kind of work with it in multiple steps and expect Photoshop like Behavior that's what I mean by editing okay so how do these models compare well we have a test of a set of test prompts and I ran all of these for this new model so what I want to do now is look at these different test prompts here you have them logo design portrait photography cinematic still aerial photography book cover and comic book it's a total of one two three four five six prompts and I ran all of them through chat GPT that's this these tabs up here so let's have a look together and evaluate it but before we do that I just want to kind of look at the image of the fireman this is still creating to give you a feeling for you know how long it takes and then also this one is still creating so I suppose now it will be a little slower because you know everybody's catching on everybody's using it I ran the test prompts right after the stream but as you can see it's slowly generating here um I'm sure this will work well um before we get into the test prompts and I forgot forget actually one more thing it can do things like this you can give it your brand guidelines your colors fonts you want and it will actually use exactly those colors now this looks funky but I gave it different shades of green and a purple that we use and a gray and it used all of them and also it used The Prompt I told it to so it can you can fine-tune it on one image like this and look at that the firefighter is coming out great you can give it one image and do images of yourself in all kinds of scenarios you can cut out the background you can use it for marketing purposes uh with things like specific colors and fonts and now to round out the video let's look at how it
9:36

Comparison

compares to all the other tools because that's what everybody's curious about well I can already tell you it does super well uh but the specifics matter okay and the specifics are to be determined by this comparison so I'm going to look at mainly these okay maybe there's ideogram 2 over here there's a few more can of a magic studio and stuff but really what I care about is the m Journey column the flux Pro column and then maybe recraft imen and ideogram but sort of these are the S tier tools and also imen these are the tools that I really want to kind of review this against so we're going to start with this logo design prompt okay logo design this is the way the others look and what I'm going to do here is I'm just going to kind of pull this to the bottom so we have a nice view of both okay there you go so for logo design you can see this is what we got out of chat GPT 40 image generation that's what this thing is called I would say it's very simplistic you know it probably is most similar to image in free what image in free did here but even that has more shading so I honestly like on logo design I think recraft is kind of the best here and ideogram is also very interesting like there's a specific style over here can you can see it here um got to make sure that my image doesn't cover it up but this is ideogram I think this is probably the best one or imag free does a really good job at a minimalistic look but this is just simple there's no shading um I would say it's slightly unclean in the lines like there's an extra kind of what is this there's like this if I got this from a designer I wouldn't be happy with this look at that this is not clean so I'm not a big fan of the logo generation here okay just straight up it did the text perfectly though and if you need a quick logo there you go but for that specific use case I would probably prefer another model let's look at our next um next one though which is portrait photography a very important one okay so portrait photography here we have the exact prompt ran through C GPT 40 and this is a fantastic looking image you can tell already I mean you can really how do I Max this out I think this is actually the best view you can really the skin texture and the pore pores are beautiful the suit looks hyper realistic just looks perfect the beard a bit too perfect actually and if I compare it to all of these well hyperrealism sort of was solved with these models right flux moury both doing an excellent job moury always has this slightly more like artistic and artsy look like a creative director created the images like some you know um very tasteful and Avant cards the creative director created them that's sort of the thing that you get with moury um you get these like tasteful artistic images flux is just super good in hyperrealism um so this one is really hard to judge to be honest I think flux Ultra might be slightly worse maybe but not really I don't know honestly this one is super hard to judge recraft is definitely worse yes and imagin well it shows a bit of a different individual but that's always going to change slightly depending you know on how often you prompt but in terms of hyper realism I would say well it's above recraft and it's on par with flux and imag no real difference there it doesn't matter well I guess it matters because this is inside of chat GPT and that's what most people use so that's an upside but other than that in terms of quality I think logo was worse and I think image generation is on par this you know common people who don't spend their um you know where AI is not their hobby wouldn't be able to discern this from a normal image okay so let's move on to the next one which is a romantic um cinematic still right here um shout out theoretically media for originally showing this to Matias in our community Matias using it all the time now and now we're all obset with cinematic still prompts anyway that's a good little prompting trick from fellow YouTuber theoretical media point being we ran this prompt through all the models and you can check out the differences here I have one comment and that is that I ran this prompt twice I had to run it twice because in the first time it refused okay so it didn't want to go along and it told me hey that's I cannot do romantic scenes so you know it's pretty uh it's not super restricted from my experience so far but there's definitely some restrictions also the this is just the first look right cinematic still how does it perform well let's look at moury it does these extremely film-like sequences there's there seems to be you know like the blacks are pretty gray so like there's a very distinctive look to it it's sort of like it has like a filmhorror filter on it or like some vintage lens like this is very stylized that's what I'm trying to say this one is very similar to what we get here and you know honestly how do you even like judge this it's just different flavors of good at this point right like honestly if you look at these two flux 1. 1 Pro and flux 1. 1 Ultra results I mean I guess these look more like Polaroids with a flash but this looks like a movie scene I mean this one is obviously worse so recraft is worse at these again but what about image and free well image and free is excellent maybe the motion is a bit stronger here this looks more like a poster I would say but I I mean just quality wise they're all on par they're all they're all they are all hyper realistic there you go I got it out so maybe one last comparison here to ideogram just looking at these images briefly the quality of these is lower but as you can see imag mid journey and flux all sort of like on the same level with these comparisons it's really hard like you can make up your own mind you can look at this video you can pause it you can zoom in but they're very similar so let's have a look at the next one which is a aerial um image of well Mother Nature so let's have a look and but I can already tell this will be very similar to the others it's on par with flux it's on par with M Journey again takes more of a cinematic approach there's more clouds there's a stronger color grade they really push the saturation in the Blues so again it's just more cinematic and then recraft here just didn't perform well at all and imag sort of gives some something that looks like a photosho background I would say so in this case I think imag is actually worse um this just doesn't look real this looks artificial like honestly like this like what kind of this looks like something like a kid draw and then they turned it into AI image so I'm not a fan of that and ideogram also doesn't work but again it's sort of on par with flux here right in terms of realism so fantastic job though like that being said it's on par with the best in the industry right in these things but again I have to point out flux doesn't allow you to give it one image and get a firefighter image like the one we looked at in this level of quality flux doesn't allow you to cut things out into pngs at this level of cutout flux doesn't allow you to do long text flawlessly right Flex isn't integrated into GPT 40 that you're probably using anyway so those are big advantages so that's so you know the conclusion is probably if you have the choice like just use this if the quality is on par I just really wanted to look at the Quality because that's important two more examples so one of them is over here um a book cover a book illustration with the text the forest okay let's have a look at all the examples we have here and I hope this is large enough I think it should be okay for comparison sake but there you go we have the forest over here and we have the book covers over here we have moury looking really good I think moury does this really well flux on the other hand I don't know this font is kind of corny I don't know I'm not a big fan this is just too basic this is just not good okay but like this poster is incredible right here also this poster those look really sold it very well just like M Journey did just like this did it but I would even say the M Journey ones sort of like the artistic flare again it just has that on top of it so if you need you know if you would want to hire a creative director for the image generation or if you would want somebody who's very creatively involved and want something that stands out I got to say I think mid Journey still takes this home like this direct comparison but um it just doesn't offer some of the extra functions um you know like the long text and pgs and the things we talked about okay what else we got then we have the models that are really good with text recraft here it goes with like a minimalist style in these cases and this is more of a photo realistic one honestly just different flavors of the same thing I don't think one of these is better or worse and then lastly what is this one it's image and free oh yeah this is what I'm interested in it picked a very similar font to this the image is essentially the same honestly this is just a darker version of The Imaging thing like they just look identical so again and all these cases look at that all of them did the text flawlessly because it's short text right and then we have one more example the com comic book that I want to move on to and that's this one so let's have a look beautiful work from uh GPT 40 right here look at this little comic strip I'm sure it could add uh comic book text I don't know super you know I'm recording a video I'm not going to come up with what exactly to say here right now but this is just beautiful work and it creates the entire strip look at that like that is some advanced stuff it doesn't just create the image for you but the entire comic book strip because it well it should be a strip illustration and in the other models it disrespected it look at that these are just images of the woman looking good also the flux ones looking good but none of them really feature the hero in a dramatic way like this maybe these ones do um what is this image and free these ones do there's also text on them um kind of makes me want to increase the size on this and really show you the comparison with imag in here let's do that but yeah let's have a look this is the one on imag yeah the text is all messed up let me tell you like some words are correct like even if you can't read this I can read it on my screen here some of the words let me zoom in more are screwed up look at this Ty I your edits an hey Brio yeah it's just not coherent uto it's just not correct okay so other than that I think this is really close and good but on this one I think GPT 40 clearly takes this home wouldn't you agree that this is just the best looking one out of the bunch and when I say add comic book text um oh it gives me ideas on what text to add so here you can work with the llm so as you can see there's real advantages to this on all the other things it's equally as good or maybe with the logo slightly worse arguably maybe but just the upside of having gb40 at your disposal um and you know it telling you exactly what text it came up with llm you don't have to round trip to chat gb2 generate the text you can just do it all in here and now I suppose it will do this like that is super strong if you add on the capab ability to you know use a llm like this and to use the selective editing tool like this to just change the eyes and maintain everything else right from the first image to be fair it did now um change the background to the brand color I gave it so you know if you wanted to do this cleanly you would just generate the image use the tool and then regenerate this already took in the context of my further conversation but it did the eyes as you can see it well this is still working but you know to be fair it must be overloaded right now as it just launched a few hours ago but this is working it's going to work right you're going to have a PNG that you can download and again these are all the capabilities you will not be able to get um generate the image with the other models because you have all of the tools combined right so really here it's not just about the image generation it's also about the image editing and fact that you have a buil in llm and I think if you combine all of those things and you give it away to everybody on the free tier well then again a master strike by open AI in remove in kind of like out competing the competition here again now this makes me really curious if that's true across all use cases or if it's just on this one quick little example and that's why we're already creating a video which will be comparing various use cases between Google's image and here in Google AI Studio that we pointed out on the channel about two weeks ago when it released the new Su news show and opening eyes new image generator because this is really good at editing but opening eyes image generator is really just not just good at editing but incredible Ed editing It's Kind of Perfect at editing it can do long text which no other model can do and all the other things I pointed out multiple times in this video so there you go
22:45

Closing Thoughts

that's your Roundup and comparison versus all the other models let me know what you think are you going to be using this trying this it's really simple to use right because it's just on a free account you can try it out right away I think this is sort of a well it's a it's like a big part of Photoshop built into chat GPT now that you can just activate within your conversations you can add multiple images you can edit them together you can alter things you can cut things out you can train a model on one image impressive open Ai and good job I kind of outplaying Google here again so that's really everything I have for today subscribe if you want to see the comparison video versus Google tool with a bunch of use cases for this and other than that I'll see you on Friday in the news week use show that's all I got don't forget to like And subscribe bye-bye

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться