# Google Deepmind CEO Shocks Everyone With NEW Statements On GEMINI

## Метаданные

- **Канал:** TheAIGRID
- **YouTube:** https://www.youtube.com/watch?v=oIxw_rYvAuc
- **Дата:** 09.08.2023
- **Длительность:** 9:56
- **Просмотры:** 18,774

## Описание

Welcome to our channel where we bring you the latest breakthroughs in AI. From deep learning to robotics, we cover it all. Our videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on our latest videos.

Was there anything we missed?

(For Business Enquiries)  contact@theaigrid.com

#LLM #Largelanguagemodel #chatgpt
#AI
#ArtificialIntelligence
#MachineLearning
#DeepLearning
#NeuralNetworks
#Robotics
#DataScience
#IntelligentSystems
#Automation
#TechInnovation

## Содержание

### [0:00](https://www.youtube.com/watch?v=oIxw_rYvAuc) intelligence landscape.

there have been some absolutely wild developments in the artificial intelligence landscape the Google deepmind CEO Demis hasabis has stated that their new AI that they are working on called Project Gemini is going to Eclipse chat GPT or in other words be much better than the current model that everyone loves which is gpt4 the company's vision for Gemini is ambitious merging the strengths of their renowned game playing AI with the expansive language understanding of models like chat GPT this unique blend seeks to establish a new standard in the AI World combining deep knowledge with Advanced strategic thinking the incorporation of alphago's techniques is especially noteworthy alphago's historic victory over a go world champion in 2016 showcased the prowess of deep Minds reinforcement learning methods in the realm of strategic games like go where there are more potential board configurations than atoms in the universe the software's ability to predict and plan was

### [1:02](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=62s) and plan was groundbreaking

groundbreaking by integrating these techniques with a large-scale text model Gemini could potentially bridge the gap between raw information processing and nuanced

### [1:11](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=71s) information processing

decision making take a look at this clip of alphago where the extraordinary capabilities of alphago are demonstrated world's oldest continuously played board game it is one of the simplest and also most abstract beating a professional player at go is a long-standing challenge of artificial intelligence everything we've ever tried in AI just Falls over when you try the game of Go a number of possible configurations of the board is more than the number of atoms in the universe alphago found a way to learn how to play code so far alphago has beaten every challenge we've given it but we won't know its true strength until we play somebody who is at the top of the world like Lisa doll now something in a which is quite similar to the research earlier this year which is called tree of thoughts and tree of thoughts was basically where they got gpt4 to list out all possible answers to a question then rank them in terms of which is likely to be most correct it improved the reasoning by 400 percent however if Gemini uses tree search it will be remarkably interesting you see tree search was quite similar but a little bit different tree search looks at many moves think of tree search like exploring a big tree where each branch is a possible move in the game the more it looks the bigger and more detailed

### [2:44](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=164s) detailed the tree becomes.

the tree becomes Balancing Act tree search has a cool

### [2:48](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=168s) Act Tree search

strategy instead of just looking at every possible move it balances between trying out new moves exploration and

### [2:55](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=175s) new moves, exploration

focusing on moves that seem really good based on what it knows exploitation

### [3:02](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=182s) Exploitation

AI boost with neural networks alphago gives the tree search method a boost it uses artificial brain-like systems called neural networks to help choose moves and assess the game situation this

### [3:14](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=194s) and assess the game situation.

makes alphago smarter than other go

### [3:17](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=197s) smarter than other go programs

programs starting point and exploration alphago

### [3:21](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=201s) and exploration.

Begins by looking at the current game

### [3:24](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=204s) at the current game situation

situation and then explores different moves from there it keeps exploring until it finds a move it hasn't fully checked out yet refining choices

### [3:33](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=213s) Refining Choices

as alphago keeps looking at the tree and exploring it gets a better and better idea of the game and what moves might be best hasabis has also hinted at other Innovative features in Gemini that have yet to be disclosed the promise of

### [3:48](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=228s) interesting innovations

interesting Innovations combined with the noted multimodal capabilities

### [3:51](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=231s) multi-modal capabilities

implies a breadth of application beyond what current models offer the emphasis on multimodality suggests that Gemini might be capable of understanding and generating content across various media types be it text image sound or even potentially video moreover the model's planned Proficiency in integrating with tools and apis means it could be a game changer in Automation and system-to-system communication now when we do discuss multimodal capabilities in AI it's important to note that most AIS currently aren't multimodal although with the release of gpt4 they did tease image functionality in this clip here let's actually try this one as well what's funny about this image oh it's already been submitted so once again we can verify this making the right API calls squirrels do typically eat nuts we don't expect them to use a camera or act like a human so I think that's a pretty good explanation of why that image is funny it still is yet to have a worldwide release although some users can access it in Bing chat most AIS that are currently available exist as narrow AI or more easily understood as AI with one

### [5:02](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=302s) with one specific purpose.

specific purpose for example chat GPT for generating text using 11 labs for a

### [5:05](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=305s) GPT for generating text

voice over or audio generation and using mid-journey for images but what if we were able to combine it all into one AI well it's not like it hasn't been done before that's what Microsoft's earlier project aimed to do Microsoft Jarvis is an Innovative multimodal AI powered platform that can connect and collaborate with multiple artificial intelligence models to

### [5:31](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=331s) to deliver a final result.

deliver a final result named after Iron Man's personal AI assistant Jarvis aims to bring together

### [5:36](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=336s) Jarvis aims to bring together

the power of the open source community

### [5:39](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=339s) community and chat GPT.

and chat GPT the platform is hosted on hugging face

### [5:43](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=343s) hosted on hugging face

and it is connected to as many as 20 different models including T5 base stable diffusion 1. 5 Bert Facebook's bartlarge CNN Intel's DPT large and more the standout feature of Jarvis is the idea behind it which can be condensed to the definition language as an interface by using language as a general interface and putting the llm large language model in the brain position it is possible for

### [6:11](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=371s) in the brain position.

many different specialized AI models to

### [6:14](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=374s) for many different specialized

work together this allows Jarvis to handle various tasks such as pose detection image

### [6:22](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=382s) detection, image generation

Generation image classification image captioning and text to speech by calling

### [6:26](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=386s) captioning and text to speech

on the appropriate models Jarvis Works similarly to how open AI demonstrated gpt4's multimodal capabilities with

### [6:33](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=393s) four's multimodal capabilities

texts and images but it takes it one step further by integrating various open source llms for images videos audio and more it can also connect to the internet and access files allowing users to enter

### [6:46](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=406s) and access files

a URL from a website and ask questions about it so with this in mind it seems like Google's Deep Mind could utilize this framework in order to create Gemini

### [6:57](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=417s) in order to create Gemini

which would certainly be much more powerful than gpt4 now one thing that they did say which concerned me was this right here they said that they are aiming to give the system new capabilities such as planning or the ability to solve problems now that sounds good in theory but one of those things is much more dangerous than the others and that's planning you see giving an AI the ability to plan long-term goals can be dangerous due to the risks of existential catastrophe and

### [7:28](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=448s) of existential catastrophe

incomplete goal specifications

### [7:31](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=451s) Specifications

misalignment of objectives if the ai's long-term goals are not perfectly aligned with human values and interests it could pursue actions that are harmful or contrary to our well-being an AI with poorly defined or misaligned objectives might prioritize its own

### [7:48](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=468s) objectives might prioritize

self-preservation or optimization at the

### [7:51](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=471s) or optimization

expense of humans unforeseen consequences long-term

### [7:55](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=475s) Unforeseen consequences.

planning involves considering complex

### [7:59](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=479s) considering complex scenarios

scenarios and making predictions about the future AI systems may not fully comprehend the consequences of their actions leading to unforeseen and potentially negative outcomes the AI

### [8:09](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=489s) negative outcomes.

could pursue its goals without understanding the broader implications leading to unintended harm

### [8:15](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=495s) leading to unintended harm.

lack of adaptability long-term planning

### [8:18](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=498s) Lack of adaptability.

implies setting specific objectives that

### [8:21](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=501s) setting specific objectives

the AI will work towards over extended

### [8:25](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=505s) extended periods.

periods however the world is constantly changing and unforeseen events May alter the relevance or desirability of those goals an inflexible AI could continue to pursue outdated or harmful objectives

### [8:38](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=518s) or harmful objectives.

despite changing circumstances resource optimization at all costs an AI

### [8:42](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=522s) resource optimization

with long-term planning capabilities May

### [8:48](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=528s) may prioritize resource

prioritize resource optimization to achieve its objectives which could lead to excessive consumption of resources monopolization or unethical practices

### [8:55](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=535s) monopolization or

super intelligence risk an AI with the

### [9:02](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=542s) with the capacity for long

capacity for long-term planning could potentially become super intelligent surpassing human intelligence and

### [9:08](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=548s) and understanding

understanding such an AI may be difficult to control or predict increasing the risks associated with its actions lastly the projected availability of Gemini in

### [9:18](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=558s) availability of Gemini

different sizes and capabilities indicates deep mind's intent to cater to various user needs from lightweight models suitable for

### [9:26](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=566s) from lightweight models

mobile applications to behemoths

### [9:30](https://www.youtube.com/watch?v=oIxw_rYvAuc&t=570s) crunching data in the cloud.

crunching data in the cloud the flexibility of Gemini could redefine AI accessibility in wrapping up Google deepminds Gemini represents a bold step forward in ai's evolution with a Rich Blend of strategies from gameplay and language understanding bolstered by undisclosed Innovations Gemini stands to reshape the landscape of artificial intelligence and its applications in the modern world

---
*Источник: https://ekstraktznaniy.ru/video/14747*