DeepSeek V3 - The King is Back…For Free!

5:17

DeepSeek V3 - The King is Back…For Free!

Two Minute Papers 26.03.2025 86 738 просмотров 3 322 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers Guide for using DeepSeek (R1) on Lambda (can be applied to DeepSeek V3 too, see links below): https://docs.lambdalabs.com/education/large-language-models/deepseek-r1-ollama/?utm_source=two-minute-papers&utm_campaign=relevant-videos&utm_medium=video 📝 DeepSeek V3 (0324) is available here: https://www.deepseek.com/ Try it online (note: they see your data, I prefer private, see below): https://chat.deepseek.com Paper: https://arxiv.org/pdf/2412.19437 How to run locally: https://github.com/deepseek-ai/DeepSeek-V3?tab=readme-ov-file#6-how-to-run-locally Ollama is probably the simplest way to run it - support is likely coming soon too! The previous version is available, keep an eye out for the V3 0324 version: https://ollama.com/library/deepseek-v3 Ollama + 0324 versions might appear here: https://ollama.com/search?q=deepseek%20v3 Sources: https://x.com/itsPaulAi/status/1872320003770618146/photo/1 https://x.com/_akhaliq/status/1904340631549329902 https://x.com/deepanshusharmx/status/1904224760399282587 https://x.com/slow_developer/status/1904209736209137759 https://x.com/Ysqander/status/1904225263568789672 https://x.com/deepanshusharmx/status/1904363892899492141 https://x.com/pandeyparul/status/1904352867433242926 https://x.com/cobaltdigital33/status/1904179361856508044/photo/1 https://x.com/michaelkaoi/status/1904178015833297342 https://x.com/_akhaliq/status/1904344443014029770 https://x.com/beinghamed/status/1904728667277623758 📝 My paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder, Owen Skarpness, Richard Sundvall, Steef, Taras Bobrovytsky, Thomas Krcmar, Tybie Fitzhugh, Ueli GallizziIf you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers My research: https://cg.tuwien.ac.at/~zsolnai/ X/Twitter: https://twitter.com/twominutepapers Thumbnail design: Felícia Zsolnai-Fehér - http://felicia.hu

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Today we are kicking off an absolute AI revolution. Goodness, this is the new version of Deep Seek and I think this might be it. R1 was their thinking AI, but this one is different. It is called V3 and the newest version is Wow, it is truly something else. First, it is not a reasoning AI. Is that a good thing? Well, allow me to demonstrate it. Let's ask the earlier reasoning AI Deep Seek R1. what is the capital of France and it is thinking and still thinking goodness and then of course it gets it right. However, when we ask the new V3 look we get an answer instantly. Yes, it is a touch less intelligent than its reasoning version theoretically but in practice I am not so sure of that. You will see in this video that it might actually be even a bit better. Also in return it is perhaps 50 to 100 times faster than the reasoning R1 depending on the question. That also means that you can run it cheaper. So let that sink in. This can hang with the best closed AI techniques out there. But wait, this is fully open and free for everyone that you can try online right now. The link is in the description. Now note that I myself like downloading models like this and running them myself privately and my choice for that is Lambda. If you want to do that too, the instructions are in the video description. But now our question is okay. These are benchmarks numbers on a paper. Can it actually do all the cool stuff the closed thinking AIs are capable of? Let's have a look. For instance, if it can write up a cool game easily, I would say that wow, that is exactly what we are seeing here. So, what else can it do? Well, look, it can code up this website in one shot. More than 800 lines of code super quickly. Now, let's make it a bit harder. Like writing a soundwave visualizer, another problem. We all did that ourselves in college by hand. And it took at the very least hours of work and programming experience. And here you get it for free nearly instantly. Just pops into existence. That is mind-blowing. Wow. And these are not just some one-off results. It can write up really cool animations. And all you need is just one text prompt to make them come alive. And there are more good news dear fellow scholars. This is two minute papers with Dr. Koa Eeer. Oh yes, it has an MIT license. That is the do whatever you want license. So it is as permissive as it gets. You can even ask for an interactive water molecule simulation where you can play with the temperature and see how the kinetic energy in the system changes accordingly. Now, reasoning AIS like Deepseek R1 can get you better results if you have super complex tasks, but most people don't need it. Most people instead need this one, V3, which is much cheaper and faster. And get this, in some cases, even better, too. Here, it provides a little smoother motion than its rivals. Man, so good. And it is not just some isolated case. There are genuine use cases where it smokes the much more expensive reasoning model. What a time to be alive. And yummy, we have a paper, too. We fellow scholars are paper people, so we love that. Especially that they showcase a needle in a haststack test. What is that? Well, this tells us about the model's precision when recalling information after seeing a lot of data. And it is all green, baby. That is incredible. Even after reading a long document, 128k tokens, let's say that is 250 to 300 pages of text and it recalls the details accurately. Now, note that this model is still huge. So, you will probably need a bit of grunt to run it. However, I am absolutely sure that this is going to kick off a huge AI revolution. This is exactly what we need and I feel that going forward cutting edge AI systems are not going to be closed anymore. They will be free and increasingly more open for all of us. Think of all the good that we can do with these. Absolutely amazing. So, what do you think? What would you fellow scalers use this for? Let me know in the comments below. Here you see me running the full DeepSseek AI model through Lambda GPU Cloud. the full 671 billion parameters running super fast and super reliably. This is insane.

Segment 2 (05:00 - 05:00)

I love it and I use it on a regular basis. Lambda provides you with powerful NVIDIA GPUs to run your own chatbots and experiments and it's the best. Seriously, try it out now at lambdalabs. com/papers or click the link in the description below.

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник