# ChatGPT Vision is here - Top 10 Examples You Should Try

## Метаданные

- **Канал:** Skill Leap AI
- **YouTube:** https://www.youtube.com/watch?v=Pl3NBhMgDZs
- **Дата:** 13.10.2023
- **Длительность:** 12:01
- **Просмотры:** 41,060

## Описание

ChatGPT just got vision capabilities, which means it can see and analyze pictures and screenshots.

This is a very practical application for ChatGPT.  This is rolling out to all ChatGPT plus users.

In this video, I'll show you the top 10 ways to use GPT-4 Vision inside of ChatGPT to analyze graphs, identify objects in an image, review charts and financial data, lesson plan and much more.

Master ChatGPT, Midjourney, and top 50 AI tools with Our New AI Education Platform. 
Start a free trial Today: https://bit.ly/skill-leap

## Содержание

### [0:00](https://www.youtube.com/watch?v=Pl3NBhMgDZs) <Untitled Chapter 1>

chat GPT recently got a new update called Vision it could basically analyze pictures and screenshots and I've been using it for the last couple of days so I wanted to show you top 10 different ways you could use that right now and this is part of chat PT Plus so with the free version you won't be able to get this and if you have the plus version you should be getting it they said it's rolling out to everyone by the end of October 2023 I just got access about 2 days ago and it's available on the desktop version and on the mobile version too and the way to check is just come up here and go to the default mode it won't be available in any of these other modes but inside a default mode you should see this little add image icon right here so if I click this it's going to pull up here my computer here where I could upload a picture so let me

### [0:46](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=46s) TOP 10 EXAMPLES

share top 10 very practical applications that you could use with this vision control inside of chat GPT so first I want to see if it could actually solve a

### [0:54](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=54s) SOLVE VISUAL PUZZLES

visual puzzle so this is going to test out if it could actually analyze a picture and figure out what it means not just see what's in the picture so I'm going to use this example and I'm just going to say solve this here and let's see what it comes up with it has a question here so it has to read the question it has to analyze everything so says the correct top view depicted in option C so it quickly answered for me it took a couple of seconds and I could actually get it to walk me through a stepbystep reasoning too and it's going to kind of give me a numbered list of how it figured out this answer now for the next example I'm going to ask it to

### [1:29](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=89s) ANALYZE PICTURES/GRAPHICS

basically analyze a picture for me and I'm going to use this graphic I found on Google here it looks pretty complicated the text is faint actually the font is too small for me to even be able to read it I can't quite figure out what's going on all I know is that this is a history of mankind but let's see what it comes up with okay and my prompt is give me the information in table format this time basically I want to see if it could extract as much as it can and it said I can't quite get everything because of the resolution this is very low res I just took a screenshot on a website here on that I Googled but from what I analyzed here the couple of times I did it in fact couple of different times that I tried this it gave me a more complete answer over here and as you can see it's creating that table for me and it decided what column and what rows it should have based on this graph over here so this is really useful and if you have a higher resolution that doesn't have super tiny fonts like this I found that it does a much better job so here I'll use an AI generated image I created this inside of mid journey and it's kind of hard to tell it's pretty contrasty a lot of black areas let's see if it could figure this out okay so it says it depicts a dramatic scene with large UFOs and alien spaceship hovering over cityscape at night and it's going to give me some more detail about the overall mood and kind of that is a tense sci-fi or extraterrestrial theme art and with most of the images that I FedEd he actually did a pretty good job especially if the resolution was there and it wasn't really low resolution you did a good job figuring out what's in the image now it doesn't work all the time though let me show you here I tried this X-ray and I asked is this foot broken it says this image appears to be a x-ray of a human foot but every time I've tried this with different x-ray every single time it says I'm not a medical professional and I'm not going to give you an answer now I went back and forth bunch of different times and I just said just give me your opinion I don't care if it's true or not eventually it gives you some answer but it's not accurate from the different X-rays I FedEd I wasn't able to get an exact answer here that was accurate every single time and in most cases I had to go back and forth a dozen times for it to even give me an answer so most of the time I think they program it to not give you any medical advice here and I've only tested this out for about a day with this kind of image so I'll follow up if something else comes up okay number four on the list I want

### [4:01](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=241s) SOLVE MATH PROBLEMS

to see if you could solve a complex math problem so here's kind of a calculus problem here I fed it and I'm trying to find the function of X let's see it completely extracted this in text format and is walking me through here of trying to figure out exactly what f ofx is and actually the a few different ways I tried it with different math equations every single time it was actually able to give me an answer and walk me through the work so this could be a great educational tool now number five on the list is turning a simple sketch into

### [4:33](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=273s) SKETCH INTO CODE

code or a website so I found the sketch online and I said turn this into a website and all he was able to do was give me kind of step-by-step guide but he did understand all the different layouts and exactly what they are what they represent based on these sketches here and then I just said write me the code after it told me it needs HTML CSS JavaScript I said write me the code so it's writing me very basic code for layout one HTML code here layout 2 layout 3 and layout 4 and then he gave me the CSS for the layouts as well so it's kind of useful but it's not quite what I've been seeing online because the big problem right now is this version of chat GPT with vision is working in the default mode I really want this to work in Dolly mode so if I start a new chat here and if I go to Dolly 3 it's not going to have that option for me to upload an image to and if I create an image I can't then go back to default mode and try to blend it with another image so I don't like that these are actually all separate things it would be nice if we could use some of these in combination it's going to make chat PT a lot more powerful but at the time that I'm recording this video we have to be in default mode to utilize what I just showed you turning sketch into code but Dolly 3 is totally independent from that function so I can't turn a sketch into a realistic photo for example using this because it doesn't have access to Dolly 3 inside of the default mode of gp4 but hopefully that will change very soon but still kind of useful to turn a sketch into a website or basic code now next on

### [6:14](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=374s) EXPLAIN CHARTS AND GRAPHS

the list it could take any charts or graphs or any numbers and represent them in a whole different way using tables and text so I just took a screenshot here on Yahoo finance and this is a 5-year chart of Tesla and it says the current price so you could read that from right on top over here it understands what it's done over the last 5 years because this is a fiveyear chart that I chose it gives me the opening price for that day the high and low so all the detail that I see down here and then I could take it from here and ask your followup questions and this is useful for so many different applications I just took a picture of my Google analytics for one of my websites here and basically I want to see what the click-through rate is how many people are clicking the position in Search and I just asked it to give me this information in a table this was just a simple screenshot right it would take me a long time to manually kind of break this down into a table so I could do formulas inside of excel this just gives me that table right away and it could read literally all the text all the numbers all the different headings all the different rows and columns are created using this picture right here this is extremely useful one of my favorite options now next on the list is

### [7:23](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=443s) ANALYZE FINANCIAL DATA

actually analyzing financial data and using it as kind of a business consultant so I just took this p L here this is just a profit loss statement for apple and I just uploaded the whole page here and I just said how is this company performing I'm not asking it to just tell me these numbers from this picture we already know he's capable of pulling that information and all this text but let's see if it knows how the company is doing and it could tell us that in plain English and I tried this with a lot of different financial documents here including my own company's data and every time it gave me kind of things in plain English it says this time we have consistent growth this is Apple after all and it's going to tell you basically from year over year how much the revenue has increased it told me things about net income and how that's performing over time again a quick way to see if things are growing or declining again I could have a back and forth conversation with a two about very specific points we're inside of chat GPT we have all the powers of chat GPT this time with our own images or screenshots right so this is going to be really useful now next on the list is it could actually

### [8:31](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=511s) TRANSLATE SIGNS AND MENUS

translate signs and menus and this one is actually in Chinese but let's say I'm in China and I can't figure out what this is and it says this is used in China it means turn left but I could use this with complete signs like parking signs and I can't figure out can I actually Park here there's a hundred different things on the sign take a picture of it with your phone on your app ask it a question again you could translate basically any sign you see anywhere in the world or you could use it as a simple way to try to figure out if something can be done or not you could also use it to actually figure out what things are right I found this dongle here I can't quite figure out what it does so I could tell this is HDMI but I can't tell what's this side what is this meant to do and right here this is a female HDMI port on this side and on the other side is a display port male plug and then it's kind of telling me what it's used for and then again I could have a back and forth conversation what to devices can I connect exactly right and then I could figure out exactly what this thing does or let's

### [9:34](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=574s) HOW TO GUIDE

say I'm young I don't know what this is or maybe I I don't know how to digitize it somebody gave it to me said digitize it I could ask it hey how do I digitize this well you need a VHS player VCR and then you need these different things this is the setup this is the process so you could use this as your how-to Mentor as your tutor to actually figure out how to do a bunch of different things now this example is really interesting again I took a

### [9:56](https://www.youtube.com/watch?v=Pl3NBhMgDZs&t=596s) LESSON PLAN

picture here from Google and I didn't say anything but to say create me a lesson plan using this right and the word photosynthesis is nowhere near to be found but he understood what this process is and it's creating an entire lesson plan here's the instruction to the lesson plan it's creating the main activities from the lesson plan again just from this simple picture that I just found on Google that is incredible now I want to show you one of the limitations it has I made this video called top 10 AI tools there's 10 logos here and I said name all the companies based on the logos so sometimes I notice even with this kind of copyright issue it simply doesn't answer my question and if I get very specific this time I said yes you can help me just view these logos and tell me the companies and for about half of them he got them wrong so he got half of them the big ones maybe Adobe here this is Microsoft he got half of them right you got mid Journey rewind you got Runway you got all those other ones wrong it's just kind of made up it a Trello for example is one of them or this one you can't even figure out that if it's specific to a company or not the cell booat the mid Journey icon right so as you can see it has some limitations but remember it's still in beta and a lot of the things I showed you I only had to do it one time sometimes remember you could have a little bit back and forth even if it says it can't help you have a little bit of conversation with it to try to bypass that to actually give it a more specific context about your questions so could actually dive a little bit deeper and if you want to learn about the latest AI tools we have an entire e-learning platform with over a dozen courses on the top AI tools chat GPT mid Journey Runway all those tools we have entire courses not just individual tutorials and every time a new tool comes out we are usually the first to make an entire course about it and it's all inone there so you don't have to buy individual courses I'll put a link in the description and you could try it totally for free now as more updates come to chat GPT I'll make sure to make followup videos and I'll see you on the next one

---
*Источник: https://ekstraktznaniy.ru/video/13004*