Top AI Innovations You Can't Miss

10:14

Top AI Innovations You Can't Miss

The AI Advantage 22.03.2024 19 813 просмотров 765 лайков обн. 18.02.2026

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

This week, I'm coming to you live from Nvidia GTC to cover every important update and new release in the AI world. Join me to learn exactly what is possible with AI today! Links: https://www.sievedata.com/functions/sieve/describe https://stability.ai/news/introducing-stable-video-3d https://huggingface.co/stabilityai/sv3d https://arxiv.org/pdf/2403.12008.pdf https://stableprojectorz.com/ https://app.leonardo.ai/ https://colab.research.google.com/drive/1SoAajN8CBYTl79VyTwxtxncfCWlHlyy9 https://console.anthropic.com/ https://the-decoder.com/openais-gpt-4-5-turbo-leaked-on-search-engines-and-could-launch-in-june/ Prompts: 0:00 What’s New? 0:35 Sieve 1:59 Stable Video 3D 3:19 Prompt Improver 6:13 Leonardo Updates 7:39 OpenAI's Possible Updates 8:19 Nvidia GTC #ai Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (7 сегментов)

What’s New?

As per usual this has been another week full of different AI releases that you could be putting to work today. We're going to be looking at everything ranging from a new tool that analyzes video based on the visuals all the way to a workflow that upgrades your prompts in a big way. Plus we'll also be talking about NVIDIA's GTC. Welcome to GTC. As you can see I'm in a travel setup here. This is not my home studio. I'm in San Jose, California right now for the conference and most of that conference was talking about generative AI. So I have some interesting insights that I want to share with you later on in this video. But let's just kick things off by looking at the brand new things that came out over the course of the last week.

Sieve

Starting off with Describe by Sieve. Sieve? I'm not exactly sure but what this does is very interesting. It takes video as an input and then it uses AI to transcribe the content of the video. Summarize that. That'd be pretty usual. But it also uses vision to look at the frames and then take the information from that and then includes it in the summary. So you get an audio visual summary of the video. Obviously it works on the examples here but let me show you one of mine too. So this is just a random clip from a previous video we created where I show a camera transition where this over shirt disappears. Kind of a tricky subject matter. Let's see if it summarizes it well. I'm just going to upload it and say submit. Alright then after 24 seconds we have a result here. Look at this. In this video a man with long hair and a beard is captured in a personal space dedicated to music production. Yeah close huh? Displaying his passion for the craft. There you go. There's further details but look at this part. A distinctive tattoo on his arm and his casual attire including a moment where his shirt seems to disappear adds a relaxed yet focused atmosphere to the setting. So as you can see it really picked up all the action including the little detail of the tattoo back here. Now it does make some assumption as a large language model does with the music production but I think this is really fantastic. I have to think how this could be usable but I haven't seen this before and I wanted to share it with you. I think this is a great way of summarizing clips where you're looking for more than just a transcript summarization but a visual summarization. Very cool app but now on to the next one.

Stable Video 3D

Okay next up we're going to briefly dive into the world of 3D. Full disclaimer this is not my area of expertise. Nevertheless I want to show you what came out this week because this model from Stability AI that produces 3D models from text prompts is significant. So they released a new iteration of the stable diffusion video model but now it doesn't just produce video it produces 3D models. Now there's a bunch of examples on here. There's no space to try this out quickly and probably the most important note is that if you want to use this for commercial purposes you will need a Stability AI membership. As per usual all the links are in the description below and the biggest difference here to other text to 3D models here is that usually these other models have used image generation to create various images and then they stitch them together. In this case it actually uses the video model. As you've seen with Sora that can be very powerful for 3D models because if you have consistency you can just move the camera around and scan the whole thing. If you want to use this or learn more about it here's the hugging face space and here's the paper that came out with the release of it. One more super quick 3D innovation I just wanted to point this out stable projectors and this is basically a way to use generative AI to generate textures for your 3D models. This is something I expect that over time it will be implemented into all 3D software but as of now you can install this extra application that will help you generate textures for your 3D models. You do need a Nvidia GPU for this to work though and that's really all I got to say about this. Just wanted to point it out. Let's move on to something that more people watching this video will be able to use.

Prompt Improver

Alright and the next one is really fantastic. This has been released by Anthropic and it's a prompt improver. Now look this is meant for development and yeah we do need to use a Google collab workbook to make this happen but just stick with me if you're not technical if you don't know code it's fine you can use this too even if you intend to not use the prompts for development. That's how simple this is. Actually it's so simple that I'll give you a full tutorial right here right now. Let's begin. So if you want to improve some of your prompts in a big way you're going to go to this link that's in the description below and then you're going to go ahead and you're going to copy this to your own Google Drive. For this you need to be logged in with your Google account which I am as you can see up here. Save a copy in Drive. Alright now I have my copy. I can see this by the title which says copy of meta prompt. Now all I need is one thing. I need my Anthropic API key. This does work with the Opus model and no you don't need the subscription. You can even be in Europe. This just works okay and the way you're going to get your API key is you go to console. anthropic. com and here I'm going to go to get API keys. I'm going to create a key. I'm going to call it demo key one. Copy this over and post this right here in the workbook. Here between the quotation marks I'm going to post the API key and also if this is not clear to you if you've never used the OpenAI API key it should be pointed out that the way APIs work is that you're going to be paying per request and in order to do that you do need some credits here. So if you go to plans and billing you can add some money. You should be able to get five dollars completely for free just by entering a number. You don't need a US number although it shows one. So just make sure to have a few dollars of credits in here and then we can proceed and that is it. We're almost ready to improve our prompts. All you need to do now is go from top to bottom and press the different play buttons. Okay so I'm going to press this one. We got the green check mark after 14 seconds. Now we're good to go. This took one second and all I'm going to do is scroll and click on all these little fields like so. Just don't forget to also do this meta prompt one along the way. Press play here, press play on this one and this one is going to take longer okay because the meta prompt is so long this is going to need some time to go through the whole thing and to improve your prompt. And after exactly 30 seconds we're pretty much done. All we need to do is click play a few more times. Yeah it is a few steps but except if the first one is pretty straightforward right. And there you go. That's it. We have the improved prompt here. Now this is set up for development so it does include variables but this prompt is amazing. Look what it did with this super basic one-line prompt. It went from draft an email responding to a customer complaint all the way to this. Really well crafted. Opus is fantastic at prompt engineering as I pointed out in the release video. Now what you could do is you could simply replace this part here and run all the fields from here on down. And there it is. Simple prompts turned into a seriously effective, precise, unambiguous, high quality prompt. I strongly recommend this workflow. Only downside is each one of these is going to cost you 5 to 15 cents from your API. Matter of fact I like this so much that we ran a little workshop on this in the AI Advantage community where we helped different members in a live event troubleshoot their workflows if anything came up. That's just something the community is really good for and we're going to keep doing that.

Leonardo Updates

All right next up we have interesting updates from leonardo. ai. As you know we talked about this before. Fantastic all-in-one solution for AI image generation. They have a lot of different models, tools and they added some that are really essential in my opinion. First of all they added a universal upscaler in beta. We don't need to cover this. We talked about upscalers many times. So that's this result right here. Oh my god. You can basically upscale images with AI where it adds new pixels and uses a generative engine to upgrade the quality of the image. It's not just fancy sharpening, it's generation. And the new feature in here is this little checkbox here. Okay it's transparency. As somebody who creates videos all the time this is essential. Generating images with transparency is so amazing. We've seen demos in the previous weeks where you could do it in some hugging face space but now it's starting to get integrated into apps which is what makes it usable. Super exciting. So I'm just gonna generate this with the free credits that come with every Leonardo account. As you can see every generation 20 credits. I get 150 for free. I just logged in with my google account and basically without touching all of these advanced controls which you can certainly do and then you could upscale the image too. It just generates me the image I want. Look at that. A cat with a hat with a transparent background. This is a first on a channel where we got this from app. And what's the point of having transparent images in your videos? Well I'm just gonna say the magical words of editing team. Show them why it's important to have pngs of a cat with a hat. Is this a cat in a hat? That's why you need this tool all right.

OpenAI's Possible Updates

then to round this week out I decided that in the end I want a segment where I kind of look into the future a little bit and this announcement is so big that I really want to talk about it. Apparently GPT 4. 5 leaked. Now this is not 100% but under the OpenAI domain a crawler caught the GPT 4. 5 page and there's two big updates which are supposed to come in June which is kind of far away right. But one is an increased context window. They're doubling it from 128k to 256k with better retrieval across the whole context. And secondly the cutoff is supposed to be updated to June. Now while this does mess up all our hopes for GPT 5 it does align with what Sam Altman said in the most recent Lex Friedman interview. So according to that GPT 5 is probably not going to come in the next weeks. So that's our preview into the future and now I want to talk about NVIDIA GTC that happened

Nvidia GTC

this week because that really brings all of this together. You might have already seen the summary videos talking about the brand new Blackwell GPU that was announced during the keynote by Jensen. That's incredible. It's going to power the future of this industry and of all software. But I just wanted to share my personal impressions of actually being there and going to the talks, being close to the front during the keynote and spending the week with other AI content creators here on YouTube. And the one big takeaway is it's so damn early. You could see this everywhere. This was a developer conference that was largely focused on enterprise solutions but it just takes time to build those right. Enterprise solutions don't pop up from one day to another and everything we're looking at here are either demos or their consumer apps but it's so early with all of this. You could really see this at the conference. All this new technology coming out, the hardware, the software that is being developed from NVIDIA side or other vendors. It just takes one to two years for some of these solutions to be implemented and it just hasn't been that long since the chat GPT moments. So you just have to respect the fact that what we're looking at here is early adopter type of stuff and the big apps, the implementations like Copilot inside of Excel. I mean they're just releasing these things, implementing them for the first time, feeling them out. So if we're talking use cases, there's just going to be more and more over time because it's literally present in every industry and we're just getting started. So yeah, the conference was a blast. Great job NVIDIA for bringing together all of these amazing people into one place for multiple days. Fantastic experience and I've never been more optimistic about the future of this technology. This is moving at a pace that nobody would have predicted and I'm glad to have you on board for this wild journey where we explore use cases week after week. That's everything I got for this week. I hope this was helpful to you. We do this every single Friday so make sure to subscribe to a new episode like this every single Friday and if you want to discover more use cases right now, check out the full playlist with past episodes that include everything that came out over the course of the last few weeks. All right, for me it's time to pack up this mobile setup. I'll see you around.

Другие видео автора — The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник