Meta's NEW INSANE LLaMA GPT : SHOCKS The Entire Industry! (GPT Facebook ANNOUNCED!)

9:24

Meta's NEW INSANE LLaMA GPT : SHOCKS The Entire Industry! (GPT Facebook ANNOUNCED!)

TheAIGRID 13.03.2023 38 299 просмотров 724 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

OpenAI's ChatGPT, Google's Bard, and Microsoft's Prometheus have some new competition from Meta, the company formerly known as Facebook. They have introduced their own AI, named LLaMA, which stands for Large Language Model Meta AI. https://ai.facebook.com/blog/large-language-model-llama-meta-ai/ Business enquiries: sponorships@theaigrid.com

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

so Facebook just announced llama a foundational 65 billion parameter large language model and this is truly insane the Llama actually stands for large language model meta AI because of course Facebook changed their name to meta in order to represent the metaverse but you can see here from this paragraph that they are actually managing to focus their research on AI Additionally you can also see that this large language model beat gpt3 in some common sense reasoning test with 92 percent fewer parameters meaning that these large language models that we're running are now getting much more efficient and much more streamlined in terms of how they produce the text Generation Now what's also interesting was that Facebook's llama AI model was only intended for government and approved researchers and was leaked onto GitHub which means that many people were actually running this software and running this large language model on their own computers now this research scientist at Facebook provided us with some very interesting tweets on exact actually how this large language model by meta actually works take a look at this you can see today we release llama it outperforms GPT and gpt3 on most benchmarks now what was more interesting was that llama's 65 billion parameter model was actually very competitive with the palm e 540 billion parameter model now what's interesting about this is that this parameter the 540 billion one is by Google which was released a couple of days ago which you all know since I made a video on it and it's actually on par with several less parameters which is going to show just how effective this model is now what's also interesting is some more data that was released you can all see here that this model is actually outperforming cool when it comes to code generation so you all know that Google is definitely something that coders today is a great company to work for now you can also hear that somehow llama and meta have actually managed to outperform Google in this aspect which is truly insane what's also interesting is that people have managed to run this software on their computers now usually these large language models are actually run through servers in the cloud but some users have actually managed to get these large language models to run on their computers now running this on your device locally is pretty cool because it means that the future applications are going to be very interesting and there's still a lot more that we need to get into which I need to show you because this large language model is truly a pivoting point because if people can now run these large language models on their own devices it's going to change the way that this information is going to be distributed and used by users worldwide so this is of course the research page that you can see right here and this was actually released on February the 24th I'm just sure that it slipped under a lot of people's radar because it isn't as usable as chat GPT so one thing that I found really interesting about this was that this large language model that was released not actually meant to be used by many people worldwide you can see right here it says to maintain into was that this large language model was not meant to be used by many people worldwide it was only to be used in specific cases and had restrictive access you can see right here that it says to maintain integrity and to prevent misuse we are releasing our model under a non-commercial license found cost on Research use cases access to the model will be granted on a case-by-case basis to academic researchers those affiliated with organizations in government civil society and Academia and Industry Research Laboratories around the world people interested in applying for Access can find the link to the application in our research paper so essentially this model wasn't truly released as a final version just yet I'm guessing it's still kind of in the research stage and of course they're still trying to improve this large language model as you can see here it also says there is still more research that needs to be done to address the risks of bias toxic comets and hallucinations in large language models so of course as you know something that is very evident in chat GPT as there is huge wave of people who are stating that chat gbt is in Far fact too biased in its comments so as you can see here this is exactly why Elon Musk is reportedly building his own chatbot because apparently it is too biased okay and you can see that he is saying that work AI versus closed Ai and base AI is going to be the one that he is so the reason that chat GPT has been seen as bias is that it won't actually argue for fossil fuels it also won't praise Donald Trump but will praise Joe Biden and although these are maybe controversial topics depending on where you stand this is something that llama is trying to focus on preventing because it's still in the research phase one thing that you do have to understand is that when these large language models are produced they're checks that they generate is of course generative meaning that it's going to be new pieces of work when this is generated what tends to happen is that the companies that generate set text are going to be on the hook for whatever is said by this AI so for example if an AI says something bad about one particular group then of course meta may be liable for what it says and of course that could open many different kinds of potentially

Segment 2 (05:00 - 09:00)

lawsuits and potentially terrible PR exactly what has happened before if you're wondering why they probably have released this in the research phase remember that Twitter actually taught Microsoft AI chatbot to be a racist person in less than a day essentially what happened was here was that Microsoft released their AI very early and I'm not going to show you the exact tweets that they had but essentially they had a Twitter bot okay that was a artificial tweet pretty much anything and this Twitter bot actually tweeted a whole bunch of just harmful content and pretty much racist content on the internet now you can go ahead and find this content yourself but it just goes to show exactly why we need to actually look into what AI is going to be doing and how these parameters are set and if biases are too far left or too far right or where these biases can lead us and if this model can be trained in ways which actually make it do certain things which isn't really helpful to society what's also cool here is that this is an article by Simon Wilson and he says large language models are having their stable diffusionment and of course this was cool because people can now generate images on their own hardware and if you don't know what stable diffusion is it's an image generator that you can use on your computer now of course as you can see we've come a long way from that now essentially this is crazy because of course as you know llama is easy to run on your own Hardware large enough to be useful has equivalent capabilities to gbt3 and it's open source enough that it can be tinkered with which means that essentially what people can do is that they can Tinker in a way that they can actually use it for specific use cases so maybe they want to use it in I guess you could say military applications they could do that maybe if they wanted to use it in the healthcare industry maybe in the be in the education industry there's many different ways that you could actually use this kind of stuff so it's really really good of course there are some risks now what we're seeing right here is we're seeing a gif of the large language model actually running so this is exactly how this large language model is run now I'm not sure if Facebook are going to take down the GitHub link because technically they didn't want this large language model out there and they only wanted it in a licensed state so I'm not really sure how much longer this is going to be public for but I guess it just depends on Facebook stance on this now of course there are some risks with this being on the internet because as you know some people use software for good and some people use it for bad now you can see right here this technology can be used for harm for generating spam automated romance scams trolling and hate speech Just essentially a vast use cases of definitely heaviest ways to use a software which I hope people don't use the software in that way because of course as you know sometimes people do take advantage of software and eventually what will happen is it'll get restricted in a way where people aren't able to use it creatively so essentially uh this article goes over the entire thing which is a really good article and of course it says assuming Facebook don't relax the licensing terms llama will likely end up more of a proof of concept owing that local language models are feasible on consumer Hardware than a New Foundation model that people use going forward so essentially this article pretty much states that although this large language model is actually pretty cool it's only showing the capabilities that we can run models like tpt3 on our computers which goes to show that in the future it's likely that we're going to have a very large language models running on our Hardware locally which means that we're going to have I guess you could say a much more refined large language model that we're going to be using for our own personal gain and it's definitely going to be interesting to see which company manages to develop this and manages to put this out where because I do believe that it's going to be much more effective and much more interesting because as we all know cloud computing is very expensive and that's something that these companies can't continue to afford to pay if you're wondering what kind of examples there were you can see that there are examples here that are quite similar to chat TPT it says write a conversation between the Sun and Pluto and it writes that example right here it also says how do I send a HTTP request in JavaScript here's an example of how to make a get request using the JS HTTP API and of course you can see that it's done and it says the same thing using fetch It also says remove all the HTML tags in a python string and it's definitely very interesting to show that if this can actually run locally on someone's computer it's going to make computers very more effective and very more powerful in the near future and bear in mind that this is still the early days of AI and we're seeing many different companies jump on the race

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник