Google Gemini LEAKED + Rumours and Predictions - It's about to get real silly
14:14

Google Gemini LEAKED + Rumours and Predictions - It's about to get real silly

TheAIGRID 03.11.2023 12 204 просмотров 297 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Welcome to our channel where we bring you the latest breakthroughs in AI. From deep learning to robotics, we cover it all. Our videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on our latest videos. Was there anything we missed? (For Business Enquiries) contact@theaigrid.com #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience #IntelligentSystems #Automation #TechInnovation

Оглавление (6 сегментов)

Intro

so Google Gemini recently just had some leaks and the verdict is in there's quite a lot of stuff about Google Gemini that you probably should know so in this video we're going to dive into absolutely everything that exists on the internet all research every piece of data that you should know about Google's Gemini because boy oh boy I think Gemini will shock the world and I do think it will fundamentally change what we know as AI systems on the internet so let's get straight into things so this is the

Gemini Leak

singularity subreddit this is the subreddit where people discuss things pertaining to the technological singularity and any related topics for example artificial intelligence and of course human enhancement now a couple of days ago around 5 days ago someone posted this okay techman 123 Gemini leaked alongside stubs an app builder powered entirely through AI so if you don't know what stubs is essentially an AI product that allows you to generate your own AI generated applications directly from the maker Suite A Google product and you can see here this is the entire product so what happened with this leak is that the Gemini model leaked in maker Suite so it appears to be the platform that employees are using to test the model alongside stubs so this is the image that we do have here and allegedly this is the Gemini model so you can see that multimodal is seen in the maker Suite alongside the image input and you can see right here that it says multimodal and of course if you don't know what multimodal is just essentially more than one input so of course multimodal is or can be text and image text and audio or text and video but video is something that we haven't managed to do yet so you can see right here that you can write your prompt and this is where the leak is somewhat confirmed or unconfirmed you can see right here that the name or the ID is Gemini you can see it right here on the code it says multimodal itm confirmed to be Gemini so it says the multimodal it does not appear for everyone rather it is only rolled out to a few people as Google has stated including Google employees testers and other Affiliated companies then we have stubs an app generator that seems to be powered through Gemini to create fully-fledged apps and publish them in one place though the first leak provides an overview technical details of stubs will be out soon to help you form your own opinion as to what it does in another leak so this is very interesting because as you know Gemini is supposed to be Google's new flagship model that is supposed to Dethrone opening eyes GPT 4 now there's another article which fully breaks down these entire leaks and we're going to go through it because it's rather interesting because there are a few things that we do need to confirm and see now what was interesting was that this article was actually published around 10 days ago it says Gemini is coming to make a s and so are stubs now of course right here you can see that they do talk about stubs and this is essentially the app where you can create working apps directly in a site with just one prompt a revolutionary app that hasn't been mentioned anywhere so I don't know if this has ties to Gemini I don't think it does because if it did although it might we don't know yet it could mean that Google is going to be the first company to create a fully fledged AGI I genuinely wouldn't be surprised if this was the case you can see right here that this person talks about this won't replace app developers but rather be a massive boost to the industry from the looks of it will be like an AI generated figma prototypes and won't create full code but rather working AI generated prototypes so that means that there's going to be prototypes which are simply based on AI generations and I'm guessing that they're training it on a bunch of AI user interface images and a bunch of videos so it could be something that is linked to Gemini then of course you can see right here you can even generate deploy and even publish your stubs there will be a community Gallery where you can publish it for everyone to see and you can also remix your stubs and have your own twist on an idea now of course Gemini it says time to talk about the multimodality makeer suite has a code name Alkali so does Gemini in the image above you can see a sample section and these are specific to alkali makeer Suite so these cherry pick samples showcase what it can do and it seems like it will do a lot so it says text recognition object recognition captioning and understanding the image so you can see right here this is essentially part of the image now I don't think this is how Gemini is going to be released maybe Gemini is powering certain parts of it so that they can test out the capabilities in house on certain users rather than just keep it private but that is just my best guess that they just decided to roll it out to a few people so you can also see here that it says it also has an output type you can allow it to include images whether this will be capable of generating images or not is unknown it's also possible and more likely that it will include links to external images the description says include images and not generate images now Google funnily enough did actually recently release some image generation software which was actually pretty

Google Image Generation

decent and it wasn't really covered by many people now essentially I think this is what is going to power Gemini's model and if you're wondering which video I'm looking at here this is a video by Matt vid Pro aai I would recommend you watch this video because it goes into detail as to how good this model is now remember this is still Google's experimental model so it's likely that they're going to fine-tune this model continually before a big release if you're wondering how do you access this model like Matt did well you're going actually have to wait because although you may want access to this model to be able to generate very high quality images like this unfortunately if you do try to search for this software and try to use it you're going to be met with this right here it says currently search Labs is only available to a limited number of people in the United States India and Japan and it's only available in English Hindi and Japanese So currently as someone who's in the United Kingdom it doesn't seem like we're going to be getting access to this any time soon so if there is a way that you have found to access this please do leave a comment below because currently I haven't seen any way to be able to do that but as we were stating the results from this do show a promising increase in terms of the complete capability and they do seem to be on part with DAR 3

Gemini Integrations

then of course we can see that it says in some articles you may see Gemini described as an addition to B I'm here to tell you nuh-uh it's an integration it will be in vertex Ai and the available to developers through maker suite and then can be placed anywhere so then it shows you prototype with generative Ai and essentially you're going to be able to use Gemini allegedly in terms of chat prompt free form prompt like chat gpt's system interface and then other ways where you can build an app with it and then you can have the data analysis and of course as we continue to go down here we've seen this screenshot before but this article continues to go ahead and talk about the functional app prototypes then it goes on to state that deepmind SL jetway is multimodal prompt creation that can take in images and seemingly output multimodal content including HTML content now I do have to take this with a grain of salt because I haven't really seen this article published anywhere other than medium and besides some Whispers on Twitter we never really know what the final version is going to be like although sometimes insiders do to give us that piece of information that we may have been looking for I do believe that even if Gemini is going to be powering some of these things in make a suite I do believe that Gemini's release is likely going to be at a conference or some kind of Google event because Google is trying to make a statement with their large language model and doesn't just want to release it haphazardly because they know that there is true competition with GPT for an opening I as they've already surpassed them and you have to understand that Gemini is going to be no joke there's tons of information as to why this is largely going to be better than GPT 4 which means that the release of this product is likely going to be separate to B I think B is probably going to be a standalone version whereas Gemini is soup up version that's going to be completely multimodal and potentially even have agents so there's two points with as to why

Why Im EXCITED

Google's Gemini makes me really EXC excited because they're doing things that are really Innovative you see the Google deepmind team the team that are working on Gemini are going to use an Innovative technique okay and this Innovative technique is basically very similar to Alpha go system known for mastering the complex game go so if you don't know what that is I'm going to play a clip from that now go is the world's oldest continuously played board game it is one of the simplest and also most abstract beating a professional player at go is a longstanding challenge of artificial intelligence everything we've ever tried in AI just Falls over when you try the game of Go the number of possible configurations of the board is more than the number of atoms in the universe Alpha go found a way to learn how to play go so far Alpha go has beaten every challenge we've given it but we won't know its true strength until we play somebody who is at the top of the world likely to and basically what you're going to be able to do with Gemini the reason it's going to be truly incredible is that it's going to take advantage of that tree search capability I'm not sure if this is a default in GPT 4 GPT 4's abilities have been up and down with even some research papers stating that GPT 4 has gotten worse over time now of course with this ability to perform tree search like capabilities we know that Gemini should be vastly smarter than anything prior now what's also interesting was that Sunda Pai also hinted at Future capabilities like memory and planning that could enable tasks requiring reasoning so this is definitely something that is more interesting than the text and images because something like memory and planning shows us that they are looking for the long-term Vision with this large language model you see planning was always something that when we talked about AI was seem to be very dangerous but now that they're seemingly incorporating this willingly into Gemini means that they're pushing the boundaries on what they believe is risky

Additional Details

then of course a few additional details were obtained from Demis aabis he said that Gemini will be a series of models that will be made available in various sizes and capabilities and it's going to utilize memory and fast checking against sources like Google search and improved reinforcement so I think this might be Google's Edge because being able to fact check against Google not like with chat GPT or browse with chat GPT which just isn't really great I think this is going to give it an extra ability to where it's connected to the internet and has that improved reinforcement is going to give it an edge over GPT 4 because the data is going to be up to date additionally Google's tactic of releasing much smaller models which are going to be able to be ran on your phone and offline which they talked about at the conference earlier this year means that Gemini could be one of those models that you install to your phone you don't need any internet and you simply use it as you would use chat TBT but you just don't need internet for it so that would definitely be a first what's also interesting was that they said the early results of Google Gemini are promising they stated that Google's AI may go on to employ retrieval methods to Output entire blocks of information rather than a word by word generation to improve factual consistency this would be really interesting because as you know large language models output text word by word but if Google Gemini manages to Output complete blocks of text instantly that would be something different than what we've seen before and I would like to see how this works and if it is more factual or less factual CU I do believe that a block of text being output instantly could result in more or less hallucinations but it will be something that we will have to play with now overall I do think Gemini is going to be better than GPT 4 because Google literally has to make it better than GPT 4 or else people are just going to lose faith in Google's ability to put out a product that is on par with what open AI have because right now they're leading the race and everybody is waiting for their product next and if it isn't up to the hype I think people are just going to give the crown to open Ai and just wish Google a goodbye so I think the fact that Gemini is going to be multimodal utilizing new Frameworks in terms of planning and the ability to solve problems and even potentially the ability to use certain applications and generate apps instantly I think it's got a very good shot at being one of the large language models out there or AI systems that leads us towards the first stages of AGI now I know that is definitely a crazy statement but I wouldn't be surprised if Google Gemini surprises everyone as for a release date we aren't really sure many people have speculated that it might be late 2023 but I would predict early 2024 just because of safe testing and how much catching up Google have to really do in addition to that I do think that Google would rather release a tool that's better later than to release it early like they did with bod and risk getting caught with their pants down once again

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник