OpenAIs New Model Stuns Even DOCTORS!

12:52

OpenAIs New Model Stuns Even DOCTORS!

TheAIGRID 26.12.2024 25 986 просмотров 782 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

Join my AI Academy - https://www.skool.com/postagiprepardness 🐤 Follow Me on Twitter https://twitter.com/TheAiGrid 🌐 Checkout My website - https://theaigrid.com/ 0:00 Introduction to AI in Medicine 0:33 Testing AI with Real Medical Cases 1:02 Key Findings: AI vs. GPT-4 1:36 Case Studies: AI Diagnoses 2:51 Comparing Diagnostic Systems 3:53 The Power of AI in Diagnosis 4:13 AI in Medical Management Reasoning 5:12 Superiority of "Oh One Preview" AI 5:57 Landmark Diagnostic Cases 7:22 AI and Critical Diagnoses 8:29 AI Planning Medical Tests 10:01 Future Impact of AI in Medicine 10:46 Expert Opinions on AI 11:07 The Future of AI and Humans in Medicine 11:54 The Possibility of AI Doctors 12:29 The Role of Context in AI Diagnosis 12:44 Closing Thought Links From Todays Video: https://arxiv.org/pdf/2412.10849 Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos. Was there anything i missed? (For Business Enquiries) contact@theaigrid.com Music Used LEMMiNO - Cipher https://www.youtube.com/watch?v=b0q5PR1xpA0 CC BY-SA 4.0 LEMMiNO - Encounters https://www.youtube.com/watch?v=xdwWCl_5x2s #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience

Оглавление (17 сегментов)

Introduction to AI in Medicine

So today we're diving into a fascinating study that could change about how we think about AI in medicine researchers tested one of the newest AI systems called 01 preview against both human doctors and previous models like GPT 4 just to see how good AI has become at medical diagnosis and decision-making now this wasn't a simple test the researchers put the AI through four or five different intense challenges ranging from diagnosing complex medical cases that have stumped doctors to suggesting treatment plans to identifying critical conditions that absolutely can't be missed they used

Testing AI with Real Medical Cases

real medical cases from the prestigious New England Journal of Medicine these are the kind of complex cases that even experienced doctors find challenging and what makes this study particularly interesting is that they didn't just use multiple choice questions instead they tested the abilities of the AI to think and reason like a doctor would in real world scenarios because they wanted to see if the AI could handle complex multi-step thinking that doctors use every day when treating patients and in

Key Findings: AI vs. GPT-4

this video I'll break down exactly what they tested what they found and what this could mean for the future of healthcare now one of the things that they found in this research was the fact that this AI model the open ai1 model was actually really impressive in comparison to GPT 4 they actually showcase around three different cases where GPT 4 cannot solve a complex case it can't diagnose it and it manages to get it completely wrong whereas 01 gets the diagnosis completely right so

Case Studies: AI Diagnoses

case one they had some really complex disease GPT 4 got it completely wrong with a bond score of zero then 01 preview managed to get it completely right and identified the exact condition case number two there was another complex task which GPT 4 completely missed and listed common conditions instead then 01 preview completely nailed it and got the rare condition completely right then we had case three there was an actual condition and GPT 4 was close which you know managed to get a bond score of three and listed some correct information but incorrect conditions whereas 01 preview got it exactly right again and in this what's particularly interesting is that the bond score shows how close each AI got zero is completely wrong five is exactly right and these were actually really tough cases so these were like medical Mysteries and GPT 4 tended to guess more common conditions but 01 preview was able to identify rare and complex conditions pretty accurately and this just basically shows us that with each Improvement of AI and of course with this new series of models whilst yes you might use this AI on a day-to-day basis when we are tackling complex scenarios like this is where these thinking models really do shine now there was

Comparing Diagnostic Systems

also this image right here and this image shows a comparison of how well different diagnostic systems both Ai and human perform at correctly diagnosing medical conditions using cases from the New England Journal of Medicine and this is from 2012 to 20 so now the types of systems showns in the blue colors are of course the modern AI systems and the light blue is where you have the older diagnostic systems that required doctors to manually input symptoms and of course in the brown bar at the bottom that is where you can see the human clinicians performance now overall what we can see here is that there is of course a Stark impr Improvement when we look at the 01 preview compared to GPT 4 then when we look at these older AI systems we can see that they're not as good and of course we can see compared to the clinician there is a large increase in terms of the percentage correct diagnosis from here you can see it's around 30% whereas with these llms it's around 60 to above 75% which is rather surprising and this really goes to show

The Power of AI in Diagnosis

us just how powerful these AI systems are I know a lot of people give these generative AI system Flack because oh they're just regurgitating stuff but when you apply them to medical use cases you can see that these tools are remarkably powerful for diagnosing different diseases or diagnosing different things in a variety of different scenarios processing complex

AI in Medical Management Reasoning

bits of medical information and arriving at correct diagnosis is the kind of thing that AI is exactly designed for or should I say uniquely designed for now we can see here figure five comparison of GPT 4 01 preview and Physicians for management and diagnostic reasoning and we can see here that this image shows how well different groups performed when managing medical cases called gry matters management cases comparing scores between 01 preview by itself which scores are remarkable 85 to 90% GPT 4 AI scoring around 40 to 50% and human Physicians using a GPT 4 as a tool scoring around 40 to 50% and then of course human Physicians using standard traditional medical resources scoring are whopping 30 to 40% so this is rather fascinating once again the scores ranging from 0 to 100 show us that 01 preview clearly outperformed all other options by a large margin and this is

Superiority of "Oh One Preview" AI

fascinating because this performed significantly better than both GPT 4 and the human Physicians interestingly there wasn't much difference alone between GPT 4 and the Physicians using GPT 4 but this visualization powerfully demonstrates how much more capable 01 preview is at Medical Management reasoning compared to both earlier AI systems and human Physicians even when those Physicians have access to AI or traditional resource now in addition to this I do want to caveat this by saying this is 01 preview this isn't even the full 01 nor is it even 03 which was recently released by opening ey/ demode and we know that model is even smarter so imagine what kinds of results that would get if this preview model is getting around 80 to 90% we can also see

Landmark Diagnostic Cases

this in terms of the landar Mark diagnostic cases and these cases are basically the greatest medical Mysteries that have been solved they're like famous cases that have become teaching Classics in medicine kind of like the greatest hits of medical diagnosis now these are real patient cases from the past that were particularly challenging or groundbreaking they helped doctors learn something new about a disease or condition and they often changed how doctors approach diagnosing similar problems now what makes these landmark cases is that they're usually complex cases that weren't obvious to solve they often involved unusual combinations of symptoms and the final diagnosis was essentially surprising or taught doctors something new and they become standard teaching tools in medical schools now when they managed to test these AI systems on this we can see once again that 01 preview manages to get a extremely high score on the leftand side and we can see that gp4 only also manages interestingly to outperform Physicians with gp4 and Physicians with gp4 does perform better than Physicians and resources now interestingly here we can see that the AI didn't manage to supersede humans that much because there were several cases where humans managed to get this stuff but we can see here that the AI is definitely really effective when it does come to these Landmark diagnostic cases I mean whether or not you could say that this is a training data thing I still think that this is remarkably impressive considering the Physicians are seeming better off with these AR tools rather than without them now this graph right

AI and Critical Diagnoses

here shows how often different groups caught the most critical diagnosis and this is what they call cannot miss diagnoses these are the diagnosis conditions that if they are missed they could be life-threatening for patients so we have four different categories so we got the residents in pink which are junior doctors in training we've got the attending physicians in green which are experienced fully qualified doctors then we've got gp4 in blue the previous AI model and 01 preview in purple the newest AI model now what the graph shows is a scale that goes from 0 to 1 or 0% to 100% And the boxes show where the majority of the scores were and the black lines show the full range of different scores and of course the dots show the individual results now all groups perform similarly around a 50% to 100% rate but we can see once again that 01 preview was more slightly consistent and residents showed more variation in performance experienced doctors performed about as well as these AI systems and this was rather fascinating because once again we see that AI manages to perform really well in these scenarios now let me break down this

AI Planning Medical Tests

table which shows how 01 preview planned medical tests compared to what actually happened in the case if we take a look at this first case you can see you know there was a certain plan which the doctors actually planned and then interestingly the 01 preview managed to suggest another plan which was actually very similar to exactly what these doctors suggested so you can see here in this case it managed to get a two score which is a completely correct score when it comes to planning certain things in terms of the range of tests that you would conduct when you're trying to figure out what kind of diagnosis that you would have now there were some things here that were rather interesting it was impressive that the AI didn't just suggest random tests it laid out a comprehensive stepbystep plan that included backup plans and Alternatives it explained why each test was needed and it matched what expert doctors actually did in real life and this was rather fascinating because there are complex steps that go into doing this and it's important to understand that all of those reasoning steps have to be completed successfully for the AI to get the right answer now there were certain areas where the AI was wrong there were two other scenarios where the AI got half the answer right and then the other one got completely incorrect but I think the most fascinating thing about this is that this is an AI system which isn't just purely medically based like it isn't fine-tuned on medical issues but remarkably we can see that when we're looking at these diagnosis we're seeing these suggested plans we're seeing that it's able to sometimes get the right suggested plan and the right steps to

Future Impact of AI in Medicine

take which is rather impressive and we can only imagine what's going to happen in the next 5 years the kinds of models that we're going to be get and just how accurate they are in terms of diagnosing conditions and of course suggesting plans of course I would say though that I hope humans don't become too reliant on this because of course with hallucinations you wouldn't want to have you know a tired dentist that is overworked or a tired doctor atire clinician or physician that just uses what the AI says and then next thing you know a UC ination manages to mess up a person so of course I do think that humans will always have a role to play when it comes to diagnosing individuals we could also see here that this individual said that I had A1 analyze a very specific immune disease for my friend who happens to be one of the top scientists in the field

Expert Opinions on AI

and After High said the results his response was oh my God I just read it this is breathtaking this is insanely good so we can see also that the qualitative results from individuals using this at the top of their field does seem to be one that proves that these models are also rather fascinating so with that being said what do you guys think is the future of AI and humans when it comes to the medical industry I

The Future of AI and Humans in Medicine

think it's really fascinating that we're now starting to explore this in further detail I do think that with rules and regulations it's going to be pretty hard to actually get these models out into a real sort of practice but I do think we're going to start to see more and more cases where doctors may have missed certain things but users taking it into their own hands to consult with a model like 01 or even 03 and get remarkable results that doctors simply would have missed this is something that I've discussed before that literally millions of Americans die each year because doctors manag to make mistakes we will make mistakes we're humans but the only problem is that in the medical industry sometimes there are situations that are simply life or death and those mistakes do cost lies so maybe having an AI System review every single decision made maybe we could catch those rare conditions or diseases that we otherwise would have missed and then of course

The Possibility of AI Doctors

having humans check over and run the necessary test to ensure that what the AI suggests Ed is potentially factual with that being said would you be open to having an AI doctor I personally think that with the next 15 to 20 years we're certainly going to have maybe some pods or something where you prick your finger you get an instant blood test you get an AI doctor that tells you everything wrong in your body you get instant diagnosis you get an AI that reasons over all of your personal data Maybe it knows everything you've done everything you've seen eaten and it's able to condu probably the most effective plan for you because it understands your emotional state your physical state your water levels how much you've been drinking and it can probably suggest the

The Role of Context in AI Diagnosis

most accurate thing context is of course key and I find that the more context you give these models and of course your doctors the better they become and if we look at how AI is going to be integrated into our lives I wouldn't be surprised if we're going to be sharing that AI data with our doctors very soon a very

Closing Thought

interesting world for those of you who are trying to live for other with that being said if you enjoyed this video I would like to see you in the next one

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник