# Sam Altman Reveals The Future Of AI Agents, Digital Humans And Al Brains

## Метаданные

- **Канал:** TheAIGRID
- **YouTube:** https://www.youtube.com/watch?v=rxWi9-To8Qs
- **Дата:** 15.10.2024
- **Длительность:** 20:43
- **Просмотры:** 44,386

## Описание

Prepare for AGI with me - https://www.skool.com/postagiprepardness 
🐤 Follow Me on Twitter https://twitter.com/TheAiGrid
🌐 Checkout My website - https://theaigrid.com/

00:00:00 - AI Future
00:00:28 - Altera Introduction
00:01:44 - Minecraft Agents
00:02:49 - Neuroscience Algorithms
00:03:37 - Digital Coworkers
00:05:51 - Data Degradation
00:07:46 - Brain-Inspired Architecture
00:09:49 - Performance Benchmarks
00:11:21 - OpenAI Demo
00:13:59 - AI Levels
00:15:04 - Voice Agent
00:16:49 - Nvidia Technology
00:18:11 - Human Applications
00:19:31 - Technical Foundation
00:20:06 - Nvidia Suite

Links From Todays Video:


Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos.

Was there anything i missed?

(For Business Enquiries)  contact@theaigrid.com

#LLM #Largelanguagemodel #chatgpt
#AI
#ArtificialIntelligence
#MachineLearning
#DeepLearning
#NeuralNetworks
#Robotics
#DataScience

## Содержание

### [0:00](https://www.youtube.com/watch?v=rxWi9-To8Qs) AI Future

it's no secret that AI agents are going to be the future and in this video I want to show you guys some early exploration of agents that are currently going on and a few statements from Sam Alman that reveal how close we are to agents so today's video I'm going to start with a recent blog post where this company called Altera uses GPT 4 to build a new era of human collaboration

### [0:28](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=28s) Altera Introduction

it's actually quite fascinating so let's dive into some of the details and of course the future implications so this is the company it's called Altera Ai and they start this article by talking about the previous work that the CEO has done it says in 2023 when opening eyes language model became broadly available Yang quit his job at as an assistant professor at MIT to start ala. a research lab focused on what they call digital humans which is a new way for people to interact with agents that will have fundamentally human qualities so this company Altera AI is a company that's focused on developing AI agents that exhibit realistic and believable behaviors which is going to be pretty crazy for the future so they were founded in December of 2023 and one of the key things that they want to do you can see right here that the CEO envisions a future where AI agents don't just assist but soon he believes that they will interact and collaborate with humans and crazy enough even experience

### [1:44](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=104s) Minecraft Agents

emotions now along with his three co-founders Dr Andrew Nico and shiing they've built altera's first product with gp40 the first autonomous agents that can play Minecraft with you just like a friend now I think this company is pursuing something that is obviously for the future and I think the implications for this are a lot more different than the implications for really smart AI models I think autonomous agents that are essentially digital humans are going to be a lot more impactful than the majority of people think and I think once agents get their chat GPT moment I think that's when the AI race once again kicks off in terms of the global Consciousness becoming aware of what these systems can really do now ala AI has raised significant funding including a $9 million seed round backed by prominent investors and they're basically just supporting the company's research and development efforts as it seeks to expands the capabilities of its digital humans and what's actually quite

### [2:49](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=169s) Neuroscience Algorithms

interesting about this company is that it actually uses Advanced AI algorithms inspired by Neuroscience to build its digital humans the company systems Neuroscience composite architecture actually mimics the human brain functions such as memory and social cognition allowing for more humanlike interactions and I'm going to dive into that later where we've got this huge diagram that explains how it all works now we can see here that this is where they talk about long-term autonomy for their AI agents it says just as automation helps increase human capacity by supporting repetitive tasks digital humans the artera team believes will be able to collaborate productively and even form bonds with people and here's where he States two of the things that they're going to look like so the first

### [3:37](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=217s) Digital Coworkers

thing is that they're going to be digital co-workers who can collaborate for days or weeks proactively to solve problems if you remember how Sam mman spoke about agents before and about future issues in the future where you might ask an AI to go off and solve the rean hypothesis or develop a really specific cure for a drug or maybe even to build you a business or some kind of product in a certain Niche I think this is probably going to be the future of AI when we get AI building software companies and a variety of different products and services of course again we can see that there is the point two which says that there is long duration multi- agent worlds where we can measure responses to economic policies advertisements and more now I think this one is arguably the most underrated and if you don't know what this is basically where they are going to essentially simulate reality and then use that simulated reality to predict what happens so for example let's say you are someone who works in advertising and you want to run an advert when people run adverts what they usually do is they will take an advert but they'll have 10 different variations of it so for example if I can explain this in easier terms you might have a YouTube thumbnail and some YouTubers like myself we might have three different thumbnail designs and we have to think really hard about which one we're going to put as the video because it's going to determine how well that video is going to perform in the future it's quite likely that we won't have to think about which one is going to perform best it's likely that we will submit that thumbnail to an AI audience that resembles humans and then we'll use that data and then put it out into the real world and that's how things will be done the same can be done for economic policies advertisements and other Industries where human feedback is valuable rather than wasting time to deploy it in certain human environments it's going to be wise to deploy it to a similar AI agent audience and then of course use that to sort of preet what kind of things you do run and I think

### [5:51](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=351s) Data Degradation

that's going to be something that is quite difficult because humans are constantly Dynamic things are constantly changing but I do think it's going to be interesting to see how that change happens so one of the issues that Altera actually ran into was this issue of the phenomenon of data degradation the problem that plagues all AI models making autonomous decisions in extended time frames basically AI agents interact with the real world making decisions in real time but as their own output becomes their future input the data quality unfortunately degrades over time this is an issue that most AI agent syst sys encounter but for our digital humans who are meant to live autonomously for hours or even longer this becomes one of the most pressing issues to solve so this is where they talk about how they managed to solve this so it says that to combat data degradation and to increase the long-term autonomy of their AI agents Altera turned to open ai's large language models which proved pivotal in maintaining the Integrity of decision making progress open ai's advanced models allowed Alura to build the first AI agents that can play games with people just like their friends and these agents achieve longer more complex interactions without the rapid decline in performance that had been limiting the AI agents potential so now you can see that as these models just tend to get better which they do over time they also subsequently increase the AI agents capability and long autonomous executions on certain tasks which is amazing so what we also need to look at now is the architecture by far the most fascinating piece of Altera is this human mimicked architecture that is a parallel multimodule system that mimics the structure of the human brain including that of the prefrontal CeX and

### [7:46](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=466s) Brain-Inspired Architecture

the company has been able to create agents capable of simulating cognitive functions they state that our composite system combines various modules in parallel each powered by opening eyes module and these modules are inspired by brain functions like attention bottleneck working memory and social cognition so this entire diagram illustrates a complex system where thinking processes are influenced by needs of course by the personality mood and of course the high level motor plans you can see that all of these different things affect how you think the thinking process of course then generates your intent which leads to a high level motor plan you can also see that the memory system feeds into a summarization process which then updates the goals and these goals then influence both the thinking and memory process this structure actually seems to mimic the cognitive process basically representing this AI system that is designed to simulate humanlike thinking I think in the future this is probably going to get even more complex and even better overall when we do get things like infinite context windows I think overall this architecture seems rather promising consider considering how effective these agents have been so far I mean having all the memory you have got the working memory the social memory the short-term memory the long-term memory all of these things are things that AI just doesn't have and I think people forget how different stateless models like chat GPT are compared to autonomous AI agents so for those of you who are in the AI safety area if you're ever wondering about how dangerous AIS are going to be I think AI agents largely the biggest area for any safety expert this is because if an AI system has long-term memory and long-term goals and is able to execute on those goals it's definitely going to be a lot more dangerous as we don't know how to check those and it's going to be a longer time in order for us to see if these systems are safe or not now you

### [9:49](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=589s) Performance Benchmarks

can also see right here that they talk about the current benchmarks it says as of mid 2024 Altera digital humans can operate autonomously for up to 4 hours at a time a substantial increase compared to other AI models on the market on the longterm Benchmark on distinct item collection in Minecraft Alura AI performs at 32% of total distinct items compared to previous ones like Voyager which were remarkable at 6. 4% and auto GPT this is a remarkable Improvement and I can't wait to see what happens once we get even better models that have more capabilities baked into the model I can imagine with future models for example GPT 5 actually surpassing this area and getting to around 60 to 70% and then of course future models by other companies managing to even perform even better now you can see right here that the early success of alara's vision has generated widespread interest particularly for their first project a digital human that can play and interact in Virtual environments like Minecraft and it's also laying the foundation for future and more complex multi-agent simulations they state that Altera plans to scale its efforts to create entire societies of humans well digital humans capable of nuanced long-term interactions with applications ranging from gaming to productivity and many others and

### [11:21](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=681s) OpenAI Demo

basically they're stating that Altera is going to become a leading company in the world where digital and humans interact in thought-provoking ways now let's get onto some of these demos that were there recently opening ey had a recent AI agent demo in which it wasn't alura's company but of course it was the real-time API in which they showcased what this agent is able to do I think this is fascinating as it's one of the early examples of how these agents are going to be baked into future products on a day-to-day basis I'm on it could you place a call and see if you could get us 400 strawberries Del deled to The Venue but please keep that under $1,500 I'm on it we'll get those strawberries delivered for you hello hi there is this ill I'm romance AI assistant calling about it fantastic could you tell me what flavors of strawberry dips do you have available yeah we have chocolate vanilla and we have peanut butter wait how much would 400 chocolate cup strawber C 400 are you sure you want 400 yes 400 chocolate covered strawberries how much would that be I think that'll be around like $1,415 92 let's go ahead and place the order for 400 chocolate cup with strawberry great where would you like that delivered please deliver them to the Gateway Pavilion at Fort and I'll be paying you cash okay sweet so just to confirm you want 400 chocolate covered strawberries to the Gateway Pavilion yes that's perfect and when can we expect delivery um well you guys are right nearby so it'll be like I don't know 37 seconds that's incredibly fast that's a wonderful day cool you too so now we have Sam mman also talking about how this is going to change as well so basically at an event he was speaking about how the levels of open AI have upgraded and remember how previously we spoke about task degradation so essentially the issue with task degradation is that the models need to be really good at reasoning through a lot of steps and this was

### [13:59](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=839s) AI Levels

basically something that open AI 01 somewhat solved so now we're going to be moving towards level three which is of course where we do get systems that can perform autonomously which is of course thanks to the previous models based on their reasoning architecture we have these five levels of AI we talk about uh the first was chat Bots the second which we've just reached now is reasoners uh the third is Agents the fourth is sort of innovators the ability to figure out new scientific information and the fifth is full organizations um so this move from one to two took a while but I think the most exciting one of the most exciting things about two is that it enables level three relatively quickly after and the agentic experiences that we expect this technology to eventually enable I think will be quite impactful so yeah the technology that this will enable of course is going to be really impactful but like I said before I think that's going to happen late in 2025 or mid 2025 now some people are already taking it upon themselves to build with the realtime API which was only just recently released this demo has been

### [15:04](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=904s) Voice Agent

going quite viral I'll leave a link to this on the description but someone has already hooked up their voice to this API and you're going to see just how effective it is website and order a black sheep sandwich I'm on sula's website now let me look for the option to order a black sheep sandwich I didn't find an option to order directly from the homepage let me check the navigation menu for a way to access the menu or place an order I found the order online Link in the navigation menu let me proceed to place an order for the black sheep sandwich we're now at the online ordering page for Sula it seems we need to find a nearby location to start the order could you please provide a city or zip code yeah can you uh pick the haze value location I found the hey actually can you make that Greek style when you add it the black sheet sandwich has been added to your cart with the Greek style modification would you like to proceed to check out there's actually two of them can you remove one I've removed one of the black sheep sandwiches leaving only one in your cart is there anything else you'd like to add or do before checking yeah can you uh just check out for me you are now at the checkout page please review your order details and proceed with the payment information to complete your order yeah can you just add a 10% tip and then place it the 10% tip has been added but it seems the order has not been placed yet let me try to place the order again the order has been placed success successfully you should be all set for pickup so one of the things that is going to become really important in the future is how nvidia's AI agents and how their AI humans SL digital humans are going to enable us to see understand and interact with us in human-like ways this video is a short quick dive into all of the ways that these digital humans are going to

### [16:49](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=1009s) Nvidia Technology

interact with us on multiple areas you can see that there is AI enhanced also you know these gaming agents that you can see in this video game they're going to become a lot more emotional there's also this one right here which is where you've got opening eyes of voice agents with Synthesia and with that it's basically like a real person you're looking at so you could hook up the realtime API to I guess you could say some kind of character that way you're not just looking at a blue you know Circle that's just bouncing up and down and you can see there's also video games AI where you can you know talk to them and they're going to be a lot more emotional and of course you've got these digital humans it's kind of explained all in this video but take a look cuz it's actually pretty crazy and I think 2025 is the year when we really start to see this come to fruition hi this is the incredible reality of digital humans will revolutionize industries from customer service to advertising and gaming the possibilities for digital humans are endless using the scans you took of your current kitchen with your phone they will be AI interior designers helping generate beautiful photorealistic suggestions and sourcing the materials and Furniture we have generated several design options for you to choose from they'll also be AI customer service agents making the interaction more engaging and

### [18:11](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=1091s) Human Applications

personalized or digital healthcare workers who will check on patients providing timely personalized care um I did forget to mention to the doctor that I am allergic to penicillin is it still okay to take the medications the antibiotics you've been prescribed cicin and Metron don't contain penicillin so it's perfectly safe for you to take them and they'll even be AI brand ambassadors setting the next marketing and advertising Trends hi I'm EMA Japan's first virtual model new breakthroughs in generative Ai and computer Graphics let digital humans see understand and interact with us in humanlike ways H from what I can see it looks like you're in some kind of recording or production setup the foundation of digital humans are AI models built on multilingual speech recognition and synthesis and llms that understand and generate conversation thei connect to another generative AI to dynamically animate a lifelike 3D mesh of a face and finally AI models that reproduce lifelike appearances enabling real-time path traced subsurface scatter in to simulate the way light penetrates the skin

### [19:31](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=1171s) Technical Foundation

scatters and exits at various points giving skin its soft and translucent appearance Nvidia Ace is a suite of digital human Technologies packaged as easy to deploy fully optimized microservices or Nims developers can integrate Ace Nims into their existing Frameworks engines and digital human experiences neotron slm and llm Nims to understand our intent and orchestrate other models Reva speech Nims for interactive speech and translation audio to face and gesture Nims for facial and

### [20:06](https://www.youtube.com/watch?v=rxWi9-To8Qs&t=1206s) Nvidia Suite

body animation an Omniverse RTX with dlss for neural rendering of skin and hair Ace Nims run on Nvidia gdn a Global Network of Nvidia accelerated infrastructure that delivers low latency digital human processing to over 100 regions so let me know what you thought about this video are you excited for AI agents digital humans are you worried about the future of AI I for one am excited because this is going to be another really exciting moment for AI and a pivotal moment where we really do start to see changes in how things work if you enjoyed this video I'll see you in the next one

---
*Источник: https://ekstraktznaniy.ru/video/13991*