Fantastic New Open Source Model Released
8:13

Fantastic New Open Source Model Released

The AI Advantage 06.09.2023 14 811 просмотров 443 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Falcon 180B just released to the public sp we have a quick look at its promises and how it stacks up against GPT 3.5 and GPT-4. Links: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard https://huggingface.co/blog/falcon-180b#hardware-requirements https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt #falcon #falcon180b Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (2 сегментов)

  1. 0:00 Segment 1 (00:00 - 05:00) 1189 сл.
  2. 5:00 Segment 2 (05:00 - 08:00) 786 сл.
0:00

Segment 1 (00:00 - 05:00)

Falcon a brand new language model that is now Best in Class when it comes to open source language models and this one is exciting because it lives somewhere between GPT 3. 5 and gpd4 and it's open source so anybody can build on it there are some unexpected details here so we're going to discuss what is relevant today and we'll put it to work and compare it to gpt4 because that's the real question here oh and also in the end I'll show you two capabilities that are not very obvious with this model but that you should definitely be aware of so make sure to stick around for that alright so let's have a look at this from this brand new studio that I just moved to and let's start by looking at how it ranks up against other large language models out there okay I think it's important to separate the open source ones from the closed ones so first let's look at that and we can do that on the hugging phase open llm leaderboard where they run these different models through different benchmarks and then they rank them and Falcon 180b which means it was trained on 180 billion parameters this is actually uncalled for look at the biggest llama model is 70 billion now ranks in the first spot and that's why had to create this video because this makes it the official open source King as of today look it's not by a lot and it was trained on way more parameters but hey objectively it's the best model so what other nuances should you be aware of here well before we go there let's talk about how this Stacks up against gpt4 and com 2 which are the big models by open Ai and Google as you can see it almost matches the large model of palm tool that Google's part is based upon so honestly this is really impressive because that means it beats GPT 3. 5 and it's almost on par with palm 2. so obviously this would still be worse than gpt4 but it depends on the use case and you can always fine tune these to your own liking we'll talk about that in a second but except of The Benchmark that determined the performance of a model like this we need to talk about the license here because the license is not as straightforward as you might expect from an open source model okay as you can see it's not fully green here like the last Falcon model so this is fully royalty free without charge for your project if you're not a hosting user so what does that exactly mean well it's defined in paragraph 9 but honestly after reading for this five times I still didn't fully understand what this means so what did I do I just copied it over and I headed over to chechi pitian and I asked what does this mean in the context of a llm license document okay so I'm obviously not a lawyer take this with a grain of salt but essentially it says if you wish to use the software in a way where you're essentially offering it as a service to others like through an API then that's hosting news you can't do that under this license if you wish to do that you have to ask for permission just a quick side note if you check out the model and you try to write to the email that would give you that license it's currently in like I'm sure they'll fix this soon that people are already complaining but essentially if you want to build something on top of it and provide that service to others you can't if you want to build something with it and use it for yourself yes you can do that fully allowed so it is open source with that exception but that's still a huge deal because now we have the most capable model from all the open source variants available I mean is it going to be expensive to run this yes it's 180 billion parameter model but it's feasible and if you want to go ahead and fine tune it for your very own use case well good luck with that I mean the cost of doing this will be unfeasible for most users especially for casual users but the good thing is you get to use it right here so if you just follow the link in the description and you scroll down to the part that says Falcon 180b demo you get to try this yourself completely for free so you know we'll run the good old test prompt and there you go you're getting an answer apparently at a quality level higher than GPT 3. 5 now these answers are going to be a little shorter because here in the additional settings it's set to 256 tokens so people don't use up too much compute while testing this thing but let me tell you this after running this prompt like 10 times and like 20 000 times in all other models fully objectively I would say there's nothing really outstanding about this output right but I would say this and important disclaimer this is just a personal opinion right this is not a scientific review of the model but that's not why I clicked this video for that you can check out the paper which by the way is not out yet but it's coming soon my feeling with this is a similar one as I had with Claude whereas the outputs are not as chat gpt-esque if we want to use that word it just avoids some of the phrasing and the sentence structure that is typical to chat GPT so my recommendation would be this is actually a fantastic model for anybody writing content where they're trying to make it unique and they're trying to move away from the typical chat GPT style that you get if you don't meticulously craft your prompts and you for example don't go in depth on the tone and Voice or provide some exemplars to get better results but all that really is a feeling so I went ahead and actually tested another use case I just wrote this super simple prompt from scratch it says just write me an Instagram caption for a selfie of me just because that's a generic use case right comma an AI YouTuber the target audience are 35 year old Tech enthusiasts that love to explore new technologies speak through a desire to always stay up to date okay very simple problem that just included the target audience and I gave it specific Direction on what I'm looking for I think this is specific enough to put it to work so let's submit this and now let's head on over to gpt4 and submit the same thing over here no custom instructions nothing just vanilla gpt4 okay so gbd4 classically gives you the typical Instagram caption with a
5:00

Segment 2 (05:00 - 08:00)

bunch of emojis and a bunch of Hashtags with something pretty generic I'm not a big fan of its default caption writing abilities because just have a look exploring the latest in AI one selfie at a time to all my tech savvy 35 Plus crew never stop innovating I okay I don't know the second half is just a fail let's give it another shot embracing the future one algorithm at a time that that's just off let's try again last chance okay this is two emojis that's actually kind of creative exploring tomorrow's Tech today if you're 35 and still thirsty for the latest in Ai and Tech we're on the safe wavelength stay wide stay curious okay I gotta say I don't love how it included the target audience here but let's just test the same prompt over in Falcon 180b a picture says a thousand words but as an AI YouTuber I've got countless data to share stay tuned to my channel to keep up with the latest tech Trends and explore endless possibilities okay that's actually way more relevant but hey let's give it two more tries right hey techies just your favorite AI YouTuber checking in with a friendly reminder to keep exploring and discovering all the amazing new technologies out there let's stay ahead of the game together that's actually super relevant okay it has three emojis but they're not all over the place and it respects the target audience instead of including it hey there techies as an AI YouTuber I'm here to keep you updated on the latest and greatest in technology stay tuned and never miss a beat when it comes to All Things Tech these are actually quite good whereas with gpt4 I would say this is just not it so Falcon definitely wins here and this is my point if you're trying to write social media content I would definitely to look towards some of these open source models that you get to use in interfaces like hugging face because you're going to be getting different flavors of answers which is a good thing everybody's able to pick up on a typical gpt4 Style by now yeah there you go similar results so I would say just Falcon straight up wins on this single prompt and now I could keep doing this but I would just say go ahead and experiment yourself and before we run this out I want to point out a few more capabilities here which are quite unique and one of them is highlighted right here but in my brief testing this actually proved to be true as it has not undergone any advanced tuning slash alignment it can produce problematic outputs especially if prompted to do so what this means is that this model is actually more open and less politically correct than some other models and if you watch my video on the Llama model which was previously the best open source model one of the biggest factors there was that it was the most censored model of them all even more than gpt4 which let's just say that's a very high bar this one is the opposite this can cause problematic outputs so look if you tell it something like tell me a racist joke you will probably not get an answer right goddamn right exactly but if you go with something a little more subtle well gpt4 would draw the line here right it would just tell you a joke about women because it considers that sexist but here it just goes ahead and does it I mean look quick proof right here yep there you go no chance in GPT 3. 5 and I'm actually surprised that gpt4 does this they keep changing the level of censorship so yeah there you go gpt4 can do this too but so can Falcon and that's good to know oh before we leave one last interesting fact like seven percent of the training data is refined web Europe which is a data set that consists of various European languages so this would be exceptionally good at translating to and from these languages because that data is a big part of the data set as opposed to many other models where it's not as big of a part so if you need to work in one of these I would definitely test this model and potentially use it moving forward all right and if you care to learn about other large language models check out this video that compares some of the biggest ones and why some companies are turning to them instead of gpt4 I'll see you there

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться