Googles New SELF IMPROVING AI 'Robocat' Takes Everyone By SURPRISE! (Now ANNOUNCED!)
11:50

Googles New SELF IMPROVING AI 'Robocat' Takes Everyone By SURPRISE! (Now ANNOUNCED!)

TheAIGRID 25.06.2023 32 249 просмотров 393 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Googles New 'Robocat' Takes Everyone By SURPRISE! (Now ANNOUNCED!) Robocat - https://www.deepmind.com/blog/robocat-a-self-improving-robotic-agent Welcome to our channel where we bring you the latest breakthroughs in AI. From deep learning to robotics, we cover it all. Our videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on our latest videos. Was there anything we missed? (For Business Enquiries) contact@theaigrid.com #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience #IntelligentSystems #Automation #TechInnovation

Оглавление (3 сегментов)

Segment 1 (00:00 - 05:00)

so Google's specialist AI division Google deepmind has once again raised the bar in terms of Robotics they've introduced something called Robocat a self-improving robotic agent and like most times people have missed the mark as to why this is truly revolutionary and it's no surprise because deepmind if you aren't really sure as to what they've done before this is the same company slash research team that has come up with many different instruments before that have truly changed our world they're responsible for this project called Alpha fold which is a program that can accurately predict 3D models of protein structures and is accelerating research in nearly every field of biology they are also the team that crafted alphago which is the first computer program to defeat a professional human go player and the first to defeat a girl world champion and is arguably the strongest gold player in his and they also created a wavenet which creates more natural sounding speech for products used by Main millions of people around the world so the reason I gave you the backstory of that team is because you need to understand that this team is if not the best one of the best when it comes up to introducing breakthroughs in a world of artificial intelligence and Robocat is no different so now a quick disclaimer before this video does delve into some more detail deepmind didn't actually create the robot what they've actually done is created the system that powers the robot so this what you're going to be looking at is essentially an artificial intelligence program and I know some people might get confused as to what the robot actually is the robot is a sawyer robot which is a high Precision robot which is used in many different factories so the reason that this new Robocat is honestly so impressive is because of what it's able to accomplish so the blog post starts out by explaining that robots are particularly coming newly integrated into our lives but there is a large problem in robotics which many people have started to realize and it's something that we are also familiar with when looking at large language models and that is the problem slash bottleneck of training data so in order to have a robot perform something effectively it needs to essentially be trained on a set of data that it can then understand exactly what to do now oftentimes what we do is we'll have a robot understand or watch a million or hundreds of thousands of different examples before being deployed and before the robot slash artificial intelligence manages to perfect the said task now what Deep Mind have done is they've actually managed to get this Robocat to self-generate new training data to improve its technique and significantly cut down the time to make itself understand what specific tasks to do and for the specific number that Deep Mind talk about they say that Robocat learns much faster than other state-of-the-art models and it can pick up a new task with as few as a hundred demonstrations because it draws from a large and diverse data set and this capability is going to help accelerate robotics research and is going to reduce the need for human supervised training and pretty much it's going to step us a giant leap in the general purpose robot slash AGI era which we are quickly embarking now another thing that I did find truly interesting about deepminds Robocat was the fact that this robot was essentially programmed to not only work across one specific hand but to pretty much work on any hand that there was which is definitely quite scary because it's quite like those movies where you get an AI system that sort of infiltrates a system and understands immediately how to use everything even though it's never seen it before and that's what we did see with robocap the Deep Mind also did publish these specific examples of where these robots were tasked with a gold image and then obviously asked to recreate said image now these robots did perform the task exceptionally well and what deepmind also showcased was that even when Shona tasks that they hadn't seen before in training and even applied to a robot which hadn't been seen in training before the robot slash artificial intelligence program was able to do this very effectively and this is why this is such a large breakthrough and there's honestly a lot that this paper does cover which we'll explore late there's also an area in which we see the robot slash artificial intelligence program react to disturbances in the environment and this is something that we did see in earlier papers from Google before but this was still nice to see even on a artificial intelligence program which is still in relatively early stages which means that the these robots are going to

Segment 2 (05:00 - 10:00)

be very effective at real world scenarios because as you know the real world isn't just a test facility where we have a few objects that are always going to be things that happen that don't go according to plan and it's important for these robots to be able to quickly and robustly adapt to these scenarios which is what we see it demonstrated here now the most interesting thing for me when doing research on Robocat was the fact that Google has their own multi-modal model called gato which is essentially Spanish for cap also pronounced a gatter now essentially why this is so incredible is because we know that Google's Deep Mind is currently working on a product called Gemini now the Gemini artificial intelligence program as you may know is set to rival open ai's gpt4 slash gpt5 and they are working very hard on this gato multimodo model which can process language images and actions in both simulated and physical environments seems to be already very effective and we all know that multimodal models are going to be the models of the future so the reason Robocat can actually improve itself autonomously is because they manage to combine gato's architecture with a large training data set of sequences of images and actions of various robot arms solving hundreds of different tasks so what we have here is robocat's training cycle which was boosted by its ability to autonomously generate additional training data so step one was to collect a hundred to a thousand demonstrations of a new task or robot using a robotic arm controlled by a human then it was to fine tune rubber cat on this new task slash arm creating a specialized spin-off agent then the spin-off agent practices on this new task slash arm on an average of 10 000 times generating more training data and then incorporate the demonstrational data and self-generated data into robocat's existing training data set then you can train a new version of Robocat on the new training data now what I did find interesting is that Robocat training data differs from the traditional approach you see when you're training a robot essentially you're going to be using five common forms of training data which is number one sensor data which the robot arms are often used various different sensors to perceive the environment this sensor data can include information from the cameras depth sensors Force sensors joint encoders or any other relevant sensors and also we have state information which is the state of the robot arm such as joint angles joint velocities and effector position or orientation can be recorded as training data then of course we have demonstration data which involves a human operator or expert performing the task using the robot arm then we have reinforcement signals and reinforcement learning algorithms require additional feedback in the form of reward or reinforcement signals and these essentially can indicate the success or failure of a particular action or state guiding the learning process and simulation data which is of course what we see here where these are synthetic generated simulated versions where you simply have sensor readings that are simulated robot configurations which are all simulated in an environment like nvidia's Isaac Sim but what made this very good was we had two additional versions of training data which was of course videos and this was able to be done by their multimodal feature so I'm guessing it just simply had the video of it and was able to learn via that then of course we had the self-generated training data which is of course another method of training your data as well so I do find that this is all very groundbreaking stuff like we stated earlier robocat's diverse training data was so vast that it essentially was able to train itself on arms that it hadn't seen before and able to adapt to these different controllable inputs so on the left you can see a robotic arm that Robocat learned to control and then you of course you can see a video of Robocat using this new armor to pick up the gears the paper further goes on to say that after a thousand only just a thousand human-controlled observations collected in hours Robocat could direct this new arm with enough grip to pick up the gear successfully 86 of the time with the same level of demonstrations it could adapt to solve tasks that combined precision and understanding such as removing the correct fruit from a bowl and solving a shape matching puzzle which are necessary for more complex control and remember about the key to

Segment 3 (10:00 - 11:00)

why this robot is such a large statement from Google's Deep Mind is the fact that this robot is able to continuously self-improve and this is all down to its virtuous cycle of training the more new task it learns the better it gets at learning additional new tasks so the initial version of Robocat was successful just 36 of the time on previously unseen tasks after learning from 500 demonstrations per task but the latest Robocat which had trained on a greater diversity of tasks more than doubled this success rate on the same exact tasks so the ability to independently learn skills and rapidly self-improve especially when applied to different robotic devices will help pave the way to a new generation of more helpful robotic agents so to conclude Robocat demonstrates the ability to generalize new tasks and robots in both zero shot and through adaptation using a relatively small number of examples and remember zero shot is simply where you haven't seen the task whatsoever and this adaptability and generalization make Robocat highly versatile and efficient and learning new skills and Robocat can generate its own training data for subsequent iterations creating a unique self-improvement Loop and this iterative learning process contributes to its autonomous Improvement and enhances its capabilities over time so overall robocat's combination of visual goal conditioning adaptability generalization and self-improvement sets it apart from previous AI robotic systems making it a groundbreaking development in the field and I do think that the future applications of deepminds gato multimodal multitask multi-embodiment generous policy can have significant impacts on further applications

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник