Claude Cowork is Taking Over & More AI Use Cases
15:07

Claude Cowork is Taking Over & More AI Use Cases

The AI Advantage 16.01.2026 15 819 просмотров 511 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Subscribe to stay up to date with AI in 2026! This week, Igor shows off some results of his Claude Cowork testing, the new Scribe v2 transcription model from ElevenLabs, and Midjourney's new Niji 7 model. Plus, he discusses the rising "AI for shopping" trend and OpenAI's new healthcare initiative. All that a more in the video, enjoy! First Claude Cowork Video: https://www.youtube.com/watch?v=BWAr7gTkll8&t=1s Links: 🔑 Free ChatGPT Prompt Templates: https://bit.ly/newsletter-aia 🧑‍💻 Igor Pogany on LinkedIn: https://bit.ly/IgorLinkedIn 🐦Twitter/X: https://bit.ly/AIAonTwitter 📸 Instagram: https://bit.ly/AIAinsta https://claude.ai/login?returnTo=%2Fnew%3F https://claude.com/blog/cowork-research-preview https://x.com/claudeai/status/2010805682434666759?s=20 https://elevenlabs.io/docs/overview/intro https://elevenlabs.io/blog/introducing-scribe-v2 https://elevenlabs.io/docs/overview/models#scribe-v2 https://www.midjourney.com/home https://docs.midjourney.com/hc/en-us/articles/32199405667853-Version https://nijijourney.com/blog/niji-7 https://huggingface.co/spaces/multimodalart/qwen-image-multiple-angles-3d-camera https://apnews.com/article/google-gemini-ai-shopping-checkout-walmart-f1679240ba93d40b90a97348b73039d3 https://blog.google/products/ads-commerce/agentic-commerce-ai-tools-protocol-retailers-platforms/ https://about.ads.microsoft.com/en/blog/post/january-2026/conversations-that-convert-copilot-checkout-and-brand-agents https://blog.google/company-news/inside-google/company-announcements/joint-statement-google-apple/ https://www.youtube.com/watch?v=TVsounscj4U https://openai.com/sv-SE/index/introducing-chatgpt-health/ https://blog.google/innovation-and-ai/technology/ai/veo-3-1-ingredients-to-video/ Chapters: 0:00 What’s New 0:55 Claude Cowork in Practice 7:20 ElevenLabs Scribe V2 9:08 Midjourney Niji V7 10:27 Qwen 3D Camera Control 11:33 Gemini Shopping 12:09 Google UCP 12:40 Microsoft Copilot Checkout 12:50 Gemini-Powered Siri Upgrade Coming 2026 13:29 OpenAI for Healthcare 14:08 Veo 3.1 Updates

Оглавление (11 сегментов)

  1. 0:00 What’s New 225 сл.
  2. 0:55 Claude Cowork in Practice 1433 сл.
  3. 7:20 ElevenLabs Scribe V2 317 сл.
  4. 9:08 Midjourney Niji V7 278 сл.
  5. 10:27 Qwen 3D Camera Control 223 сл.
  6. 11:33 Gemini Shopping 138 сл.
  7. 12:09 Google UCP 102 сл.
  8. 12:40 Microsoft Copilot Checkout 30 сл.
  9. 12:50 Gemini-Powered Siri Upgrade Coming 2026 146 сл.
  10. 13:29 OpenAI for Healthcare 133 сл.
  11. 14:08 Veo 3.1 Updates 189 сл.
0:00

What’s New

Ladies and gentlemen, welcome to another week in AI. We're starting to get all of the innovations that they were withholding for us throughout Christmas, New Year's, and the first week of January. And I'm here to tell you about them and to try them, including more use cases of Claude's co-work product, their agentic system that is the first consumer product from one of the big labs that is actually trying to get things done rather than just assist you in doing things. And then there's a bunch of tools for creative users like new state-of-the-art transcription models, a Chinese tool that allows you to move the camera in a scene and generate new images from that. All of that and so much more in this week's episode of AI News you can use to show that pulls together all the AI innovations. We test them, compare them, I show you the results, and hopefully you walk away with something useful or at the very least you know about all the bleeding edge releases in Generative AI. Somebody pointed out to me that the more time goes on, the more I look like this guy, maybe like this. What is up with tech bros just transforming into this archetype? I don't know. I say we just look at the first story. All right, so
0:55

Claude Cowork in Practice

let's start by talking about cloud co-work. Now, if you want the full summary and my initial tests for it where I go through two different use cases step by step, check out the separate video I uploaded earlier this week. This is a follow-up to that after a few days of usage and I want to talk about the patterns that I found while using it so you can get the most out of it too. So, first up, a summary of the entire product is basically a friendlier clawed coat. That's really the best way to put it. It's a simpler user interface that everybody can use. And a lot of people I've talked to said something along the lines of this is the first time an agentic workflow feels achievable and feels useful. But in many ways, it's just an evolution of what you might already know with Claude or Chat GPT. It is just more proactive and it does more. It does these to-do lists in between. And let me now concretely talk to what works and what doesn't. Well, what works is the stuff that already worked. If you really didn't like the connectors to Gmail and Google Calendar, well, they didn't change. It uses the same connector. So, those won't work in a way that you expect them to just because we have Claude Co-work. Now, if you liked the Claude Chrome extension and you use that, well, now it integrates into here and it's so simple to use that everything you could do there, you will be able to do here. The problem with that is that the Chrome extension was limited in certain ways and that didn't change. Now, okay, I realize that for a lot of people that might be stating some obvious facts, but here's a thing that was not so obvious to me. Andropics Claude had this release called skills where it basically created an entire skill that contained certain instructions or certain guidelines into a markdown file that then you can access at any point in time. And now in this interface, they've become so useful because they're really easy to create. use. They work in conjunction with everything else. Again, this is nothing we didn't have in a while. it just became so much simpler to use. And let me show you a concrete use case that you could do before talking about the things that I tried and I couldn't do. But talking about these skills, if you just tab over to Claude co-work here and then you navigate to the settings, you can go to capabilities here and all the way at the bottom of the capabilities, you see skills, repeatable customizable instructions that Claude can follow in any chat. These work really well with Claude co-work. And here in the settings, we can basically create them. So, one recommendation that I would have is what worked well for me this week, and I've also seen other people do this successfully, is it's really good for content repurposing. Claude Co-work is really, really good for that. But you might want to start off with creating a skill to keep things consistent. So, what I recommend is you just go here to add skill. You say create with Claude. And now I'm going to create a new skill with our AI advantage brand guidelines baked into the skill. So whenever I want to work on content repurposing with Claude Co-work, I don't have to worry about brand consistency. So I happen to have a PDF with our brand guidelines where it outlines fonts and other best practices. And all I'm going to do is simply add those social media guidelines. And I'm going to say I want the AI advantage social media brand guidelines to be within a skill. And then as we're in the skill builder here that we got to through the settings, it will just build this skill for us, which then we can use inside of co-work at any point in time. Okay. As it asks questions, I'll add a little bit of context. As per usual, the more specific you are, the better. All right, so that took a while, but you only have to do this once, and it's really overd delivered. Look, it has voice and tone presets. It has an entire repurposing workflow. I could delete parts of this or just keep it as is, which for the demo I will do. All you have to do here is hit this button, copy to your skills, and then it moves this markdown file with the skills in it into your skills, which now if I tab over to co-work, if I go into my settings here, capabilities, you will see the AI advantage social skill. And then if I do something like turning my claw co-work video into a carousel and I just provide it with the link and then it opens up the window and tries to go into the description reads all the content here and ultimately from what I can tell here it actually fails to fetch the transcript. So I would still need to do that manually by going here saying show transcript toggling timestamp solve copying this entire thing. This way I could work with it. But see these are the imperfections. Like it can't do everything but it can do a lot as it just progresses throughout here fearlessly. Use the skill that I just created and then it creates a Instagram carousel with proper AI advantage branding. And in this case it doesn't even have an image generation API. So it can't create different slides. If you want to get creative and feel comfortable with this type of stuff, you could add a custom connector to a MCP that connects it to a image generation model. But there it is. Here are all the prompts that it wants me to run in an AI image generator. Here's the caption and it even created a little artifact that is um not great. It did its best. It really lacks the ability to create images here. But you know, research preview and all of this will become so much more seamless soon. You won't need these custom setups. But this is what you can do right now. And to round out the segment, I want to share two more things. One of them is I tried a thing where it looked through the entire community we've been building over the past few months. I haven't been very vocal about it here yet. We'll talk about it soon on the channel, but basically we needed to go through hundreds of posts and look for the mentions of a specific name. And because something changed, we wanted to update it. I figured, okay, Claude Co-work could be perfect for this. So, I let it do it, but as per usual, the Chrome extension wasn't perfect. It was pretty good, but sometimes it just got stuck and I needed to reprompt it. And after reprompting it like eight times, it did like a third of the task manually for I think two out of six spaces or something like that. So it just kind of worked. But that's not because claude cowwork is not great. It's because the Chrome extension is not perfect and it kind of struggled within that application. But to abstract away from that and just round out the segment, I would love to say that the thing that this is really good at is batch processing something if you have a lot of repetitive work on something. Whether it's lawyers with 80 different documents they have to read through and summarize and create a new type of report from it or you keep repurposing content from format A to format B. Well then just create a skill and use that together with clot coworker to do that consistently or any other knowledge work that is repetitive and where you have dozens of repetitions. That's where this friendly version of clot code or you could also call it a business process automation agent really shines. It's already useful, but you need to get a little crafty, but it will only get more useful and userfriendly from here, and we'll keep an eye on that. And hey, if you're enjoying this video, make sure to subscribe. We do this every week. And if there's tools that really stand out, we create dedicated videos just on those. Okay
7:20

ElevenLabs Scribe V2

back to the next story. For this next one, we have Scribe V2 from 11 Labs. So, if you're not familiar, this is 11 Labs transcription tool, meaning you give it some audio and it turns it into text. Now, Scribe v1, previous version of this, obviously, was already the state-of-the-art model in transcribing on many dimensions, and they just released Scribe V2, making it even better. This is mostly something for builders, but also for content creators. If you just want flawless transcription for many video files, this is the one to use. Yes, there's an API for developers, but also you can just try it in 11lap studios, which is easy enough for everybody. So, let's just try the demo. It will auto detect the language. So, let's throw some curved balls at it. First of all, okay, that just works. But secondly, let's use some words like chat, chipt, claw, gemini, x ai. It got all of those right. Not bad. What happens if I switch to a canon of English? Okay, it made a slight mistake there. It said cannon instead of a cannon, but that's fine. It's back in English. Okay, switching to Slovak mids sentence was a bit heavy. Maybe let's try one more time. That worked. Okay, impressive. Arguably even better than the previous model just feeling wise. The benchmarks are obviously better. And this thing has every feature that you could imagine. All the audio formats, super quick latency as you saw, and also features like speaker detection and all that. So if you need to turn multiple voice files into text, this is probably the most reliable way and you could just batch process them here and if there's some specific terms, you can add them here because it just has the stuff in a dictionary and a lot of AI terms obviously cuz I got those right.
9:08

Midjourney Niji V7

But yeah, that's Scribe. So next up we have a new Mourney release. It's a iteration on their anime focused model and bear with me because this thing generates visuals that are so impressively beautiful that you should see them. What we basically did is we ran our test prompts that we used to do. Now we do them less cuz everything turns out similarly on all AI image generation models and we ran them on NG7. That's the name of the new anime model they have. And look at some of these results. It just has this like super unique aesthetic where it's more animeesque than anything else. Seriously, look at this image. When I'm creating a presentation or something creative, I think I personally would actually prefer this style to well every other photo realalistic result you will get here by default. I mean to be fair, we're literally prompting for a cinematic still. But because this is a anime focused model, you will get more of that. For portraits, that's not what you want to use it for. Logos, same thing. But if you lean into this anime capability, you might just be pleasantly surprised. And I just wanted to bring it to your attention. Our team member, Hayes, who always loves to explore and create interesting things with these models, shared some initial examples of one of his signature styles. And honestly, this is just one of the most unique looks you can get out of AI midjourney. the newest anime model. Great way to differentiate yourself from all of the other AI generated stuff that most people can identify as such by now.
10:27

Qwen 3D Camera Control

Okay, next up we have a open-source tool that we've seen iterations of in some creative tools, but this is Quen image edit 2511 and it allows you to do 3D camera control to generate new angles of an image and it demos really well. So, let me show you. Okay, I just have this little AI image generation of me playing paddles. Something that I've obsessed over for the past year. And here's the idea. You get to control these camera angles. So, maybe let's do a more top down angle. Generate. That's the same thing. Maybe let's rotate all the way around. Yeah, there it is. Maybe it's just a bit laggy. I mean, that's not perfect. It's not even a paddle rocket anymore. Maybe one more attempt. A slide switch. Okay. Yeah, I think it's just lagging behind a little bit, but you get the idea here. Is it perfect? No. Nowhere close to perfect. But this is a hard challenge with glass and everything. I mean, the anatomy is fine. The net looks almost identical. You really need to start nitpicking to find the differences. Now, when we tested it on a more realistic image, it actually worked really well. So, yeah, just a fun thing to play with, and I thought this was an interesting interface that I
11:33

Gemini Shopping

wanted to show you. Okay. And next up for this week's quick hits, there's a bunch of stories that are actually in the same theme, and that is basically the big companies going after AI plus shopping. For example, we got Gemini Shopping, which is very similar to Chat GBD Shopping, where they partner with different retailers and online stores. And then they turn the Gemini app into kind of a new interface for online shopping. Now, obviously, all of this stuff is problematic because you want unbiased opinions. And when they have these partnerships and they kind of direct you to certain brands over others, well, that's not really a neutral recommendation, is it? We'll see how this develops over time. This is just getting started and there's a lot of competition, which is good for the
12:09

Google UCP

consumer. Then Google also introduced something they call the universal commerce protocol, open source framework that allows AI agents to handle the entire shopping journey. So maybe in the future when you combine that with something like claw co-work, it can kind of figure out what you need based off your email, your calendar, your messages, maybe your smart fridge and whatever and like order new groceries and send messages to people and just organize things and do things. It's going to be really interesting when a lot of these technologies kind of collide. We'll keep an eye on that as it
12:40

Microsoft Copilot Checkout

develops. And then also Microsoft did their own AI commerce feature called this co-pilot checkout. Again, this is aic thing that is aimed at purchases being done through chat conversations.
12:50

Gemini-Powered Siri Upgrade Coming 2026

One big story that I just wanted to point out quickly is Apple will be partnering with Google for their Gemini AI. So you'll see Siri powered by Gemini, which is really what people just want. They just want to talk to an AI assistant and get things done across their phone, across their life. So we're really getting close to it. I know we've been talking about that for the past few years on this channel and all across the internet, but agents are materializing in a whole different way now as opposed to 2 years ago where it was more like an idea and people try to build automation flows that broke all the time and now it's just become intuitive and it's starting to ship in products that are available to not just millions but billions of people. It's going to be an
13:29

OpenAI for Healthcare

interesting year. There's also Open AI for healthcare that was now announced. This is different from OpenAI health that we covered last week. OpenAI for healthcare is their hospital and healthc caref facing product that helps them make better decisions and gives them information whereas chat health is the consumerf facing piece that helps us as consumers make better health decisions. So all of that is happening and some of the early data on that is are actually very promising because the fact remains the standard for AI to match is not perfection. It's the human error rate which is actually quite high even amongst doctors and using AI in the process has already proven to be of benefits to the entire process and we're only going to see more of that in
14:08

Veo 3.1 Updates

healthcare. And then finally there's an update from Google VO3. 1 their flagship video model. It now has things that people are requesting like native 4K upscaling and you can easily integrate ingredients into videos. And just a little pro tip, a lot of marketers are starting to use this product that we covered previously called Flow where you can bring in ingredients really easily and then mix your products with different environments and then turn them into product marketing videos. And all of that just became more powerful with this update. If you already use this, this is a welcome addition to your toolkit. For everybody not using it, this is probably not a reason to start doing that. All right, and that's pretty much everything I have for this week's episode. I hope you found something that was interesting to you. I think this year is going to be acutely practical with all of these agentic apps actually being put into practice. I'll be here covering all of it. And with that being said, my name is Eigor Pagani and I hope you have a wonderful day.

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться