5 Unique AI Projects (beginner to intermediate) | Python, LangChain, RAG, OpenAI, ChatGPT, ChatBot
27:36

5 Unique AI Projects (beginner to intermediate) | Python, LangChain, RAG, OpenAI, ChatGPT, ChatBot

Tina Huang 16.08.2024 77 449 просмотров 2 469 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Check out this free resource to help you use AI to be more productive: https://clickhubspot.com/7kn In this video I walk through 5 unique AI projects from beginner to intermediate to advanced. We'll be using python as the coding language and use technologies such as LangChain, RAG, GPT4o, DALL-E, Whisper, Claude, and other open source models. Resources & Code: https://bit.ly/3WLXMXv ✉️ NEWSLETTER: https://tinahuang.substack.com/ It's about learning, coding, and generally how to get your sh*t together c: 🤖 AI Lunch & Learn series: https://www.lonelyoctopus.com/email-signup It's a FREE weekly 1hr livestream about AI & tech topics eg. how to build a GPT, how to build AI products, jobs in the era of AI etc. 🐙 Lonely Octopus: https://www.lonelyoctopus.com/ Check it out if you're interested in learning AI & data skill, then applying them to real freelance projects! 🤝 Business Inquiries: https://tally.so/r/mRDV99 🖱️Links mentioned in video ======================== 🔗Affiliates ======================== My SQL for data science interviews course (10 full interviews): https://365datascience.com/learn-sql-for-data-science-interviews/ 365 Data Science: https://365datascience.pxf.io/WD0za3 (link for 57% discount for their complete data science training) Check out StrataScratch for data science interview prep: https://stratascratch.com/?via=tina 🎥 My filming setup ======================== 📷 camera: https://amzn.to/3LHbi7N 🎤 mic: https://amzn.to/3LqoFJb 🔭 tripod: https://amzn.to/3DkjGHe 💡 lights: https://amzn.to/3LmOhqk ⏰Timestamps ======================== 00:00 intro 📲Socials ======================== instagram: https://www.instagram.com/hellotinah/ linkedin: https://www.linkedin.com/in/tinaw-h/ discord: https://discord.gg/5mMAtprshX 🎥Other videos you might be interested in ======================== How I consistently study with a full time job: https://www.youtube.com/watch?v=INymz5VwLmk How I would learn to code (if I could start over): https://www.youtube.com/watch?v=MHPGeQD8TvI&t=84s 🐈‍⬛🐈‍⬛About me ======================== Hi, my name is Tina and I'm an ex-Meta data scientist turned internet person! 📧Contact ======================== youtube: youtube comments are by far the best way to get a response from me! linkedin: https://www.linkedin.com/in/tinaw-h/ email for business inquiries only: hellotinah@gmail.com ======================== Some links are affiliate links and I may receive a small portion of sales price at no cost to you. I really appreciate your support in helping improve this channel! :)

Оглавление (1 сегментов)

  1. 0:00 intro 5455 сл.
0:00

intro

hi everyone in this video I'm going to show you guys five AI projects that you can get started working on immediately for each project there will be three levels the first one is just going to be prompt engineering and some no code tools it's actually pretty crazy how much you can do with no code tools now even like build simple applications level two is going to be intermediate will involve some code and level three we're going to be building full-on applications as well as implementing techniques like rack we'll be using lots of different models like your open air models CLA as well as open source models like llama different apis and tools like Lang chain by the way in case you don't know me hello my name is Tina I used to be a data scientist at meta now I create content and I run a program called Lonely octopus where we teach people Ai and data skills to work on real freelance projects also in case you're wondering the reason why I have this on is because I [ __ ] ran into a pole and fractured my hand all right without further Ado let's get started a portion of this video is sponsored by HubSpot AI personal finance chap bot recently I got really obsessed with this anime called 886 I watched the entire anime in like 2 days then I had to buy all the light novels and figurines all the posters like everything anyways amazing anime it will make you cry go check it out if you're interested in anime but I realized that my spending habit is out of control and I really need to come up with a new financial plan and like a new budget and things so I'm in the market for a personal finance AI tool like a chatbot that can help me get my finances back on track basic skills you will need is prompt engineering and you can use it on chat GPT or a platform like GPT for all or you can also use open source models additional skills for level two and three are python AMA some sort of front-end dashboard like plotly and if you're feeling extra Challenge and you want to build your own custom web app Lang chain mostly for rag which is retrieval augmented generation and flask level one is just prompt engineering so first you need to go to your bank account and download these statements within the period of time that you're interested in for the prompt I like to Loosely follow the 5 w framework which is who what when where why so for the who here you are a personal finance analyst and adviser with the personality of my guy from Naruto someone that is very positive and is perceptive of the feelings and goals of other people all right this time you'll be picked I just know it you ready Lee I need some positive vibes when confronting my spending habits for the what it's your goals to help me better understand my spending habits to achieve my financial goals of buying a house by the end of 2025 so is kind of also the why which is buying a house and it's also when um by the end of 2025 and for the where I'm currently at is I am missing $200,000 but I'm struggling to come up with a plan that I can stick to I have attached my financial statements so I'm also going to attach my financial statements all right dynamic entry youthful determination let's dive into your transactions like a leaf ninja so motivational analysis your financial Journeys just like riger's training consistency and precision are key we need to analyze where your hard ear cash is flowing and find areas where we can tighten the belt to save for that dream home by the end of 2025 so steps to take is to examine the income sources scree expenses creating a budget and tracking progress makes sense um so the analysis is income and expense overview so looking at the income and the expenses and then categorize the expenses Essentials and non-essentials and identify saving potentials uh cut down non-essential spending and optimize essential spending let's first summarize your total income and total expenses so as well as the ending balance by the way if the numbers are looking really big that's because this is also including like business income and business expenses as well so yes at least our balance is positive okay so then it goes on to categorize the analysis for some reason it categorized everything as non-essential so we're going to do is say that not all of it is non-essential maybe we can start by roughly categorizing in things and then deciding what is essential and non-essential list out the categories okay so key expense descriptions American exppress payments uh door Dash rent utilities grocery store subscriptions medical loan payments and transport so for the American Express payments what I will do is I will go to my aex and then download all everything from there as well to Mac um what is here from my Bank of America statements so this works fine but maybe you don't want open AI to be collecting your financial data this is where you can consider using something like GPT for all you can use it similarly to chat GPT but you can download different types of models including open source models like llama 3. 1 add in your Finance statements here and chat with it level two you're going to have some code first you want to download AMA which will allow you to programmatically download open source models like llama 3 and allow you to use locally on consumer Hardware it's kind of similar to GPT for all except you're able to directly access these models through code now you can do things like Loop to the statements do some summarizations and group things by categories okay so far you might be like yeah but this is basically what we were doing level one well yes that is correct but by doing it programmatically the first benefit is that we have a lot more control you can do things like clean up the data a little bit you can um combine different statements together easily when you're looping through it you can also specify like what are the transaction types that you want to do secondly and more importantly it allows you to take that data and then create something else out of it for example a dashboard level three is making your own custom web app with your own custom rag which is retrieval augmented generation is a technique for enhancing the accuracy of AI models it allows a model to access a specific data source so when it's talking with you it'll be referencing back to the information that's stored as opposed to just trying to come up with the answer based upon the original data it was trained on platforms I chat GPT and GPT for all actually have that built-in already so that's why you're able to upload your statements and then interact with it based upon the statements there but since we want to implement our own custom web app which by the way you might want to do if you're actually trying to build a consumer grade product like a personal AI Finance bot that you can actually sell you need to implement your own rack Lang chain is great it offers you a suite of different Tools in order to Implement rag then you can use something like flask to put everything together and make your custom web app something I'm sure you've heard me talk about before if you've ever watched me before is how to leverage AI tools like cha gbt to help with both your professional and personal tasks I found it especially relevant to increasing my productivity something that's a crucial skill in keeping up with the future HubSpot has a whole bundle of resources designed to help you understand and make the most out of chat gbt it will make you super productive it's super well laid out and gives you ideas on a lot of ways you can incorporate chap GPT for example using chbt to consolidate research materials like different articles tutorials all types of resources you can directly ask it questions instead of having to flip through all of these different resources yourself or keeping up to date with industry news and updates they also have this nice flowchart on when it's a purp to use chat GPT to solve a problem or streamline your workflow my favorite part is that it gives you over a 100 prompts to get started with chat gbt which is great since prompts is so important to using L to the fullest I highly recommend that you download it using this link over here also linked in description thank you so much HubSpot for providing free resources to help us Leverage The Power of AI and for sponsoring this portion of the video now back to the video your personal AI assistant with video so it's already very cool that you can use something like Chach BT um to talk to it and ask it questions through text you can also use audio functions and you can even send it images but sometimes you just really want to communicate using video maybe you're on a first date and you have poor social cues and you're not really sure what your date is feeling like what's the right thing to say so maybe you can take a video of her or him and be like what are they feeling right now uh what should I say how's everything going not creepy actually that's pretty [ __ ] creepy never mind okay other use case say like you're wanting to learn ASL like American Sign Language and you want to be able to like show what it is that you're doing and see it if you're doing it correctly and speaking of learning languages if you are an elder person like me and you are trying to communicate with the youths these days the Gen Alphas um you may find that you have no idea what the [ __ ] they're talking about mad lit on God no cap I have no [ __ ] idea what's going on here so wouldn't it be nice if I could take this and give it to Ai and be like can you explain to me what is happening here so basic skills is prompt engineering additional skills include python the open AI API using gp4 o whisper and text to speech and if you're feeling extra fancy shiny which is a package for building interactive web apps and you can host your assistant on internet so level one is just prompt engineering at the time of this video um open AI did an announcement like a couple months back saying that they're going to actually release this functionality this video functionality where you should be able to communicate via video in real time well can you take a guess at what I might be doing based on what I'm showing you here H from what I can see it looks like you're in some kind of recording or production setup however they announced like two weeks ago that they're actually going to delay this functionality and we're not sure like when it is that we're actually going to get it so at this time you can't directly upload a video um into GPT 40 what is being shown on the poster in this video I can't directly view video content but I can help it yes you can analyze it frame by frame there you go I've extracted and displayed several frames from the video if you can identify the poster from any of these frames or if you need a closer look at any specific frame please let me know last frame no here's a closer look at the describe the Poster's contents briefly the poster in the last frame of the video seems to be challenging to analyze for text content using OCR likely due to resolution or boo honestly as you can see not great it kind of drops different frames and it doesn't really know what's happening that well and other um AI models even worse most of them don't even allow you to upload videos so we can build something similar by ourselves and by the way if by the time you watch this video open AI has already released that model with the video you may still want to consider building this product because you might not want open AI to collect your video data as well or if you actually want to build your own application so level two is how we're going to Implement processing video information and the accompanying audio to access the open a API so you can't directly just put in the video to the API you first need to take the video and just extract the audio part using whisper one and then you convert that to text which we will send as text input to GPT 40 then you want to take the video part and feed it in separately as frames of images then you use open AI text speech in order to take the response and then speak it to the user to put everything together you also need to set a system prompt which is the prompt that kind of governs the entire AI bottel um so it's able to make sure to split the video into different components and then put it all together I will leave the system prompt in the description below if you want to check it out all right level three so now I have something that functions it's basically a recording of a video which then you're passing it along to open ai's apis and then getting it back as audio however it's kind of ugly because it's just like you know on this python notebook so that's why level three is when we make it pretty and actually make it a proper application that's also when you use shiny is a web application framework so you're able to have this web app where you can take a video and record yourself um and then afterwards it would be able to speak directly to you can also easily put in different UI components and deploy it yay now you can directly interact with your AI assistant using video what does this painting Say the video showcases a sign that emphasizes the key components of success it lists hard work persistence late nights rejections s sacrifices discipline and criticism at the top highlighting these as essential elements then it also includes doubts failure and risks further down showing that these challenges are part of the journey towards success too it's a powerful visual reminder that success isn't just about the positive aspects but also about overcoming difficulties and setbacks personal AI travel app the top thing on my bucket list right now is to drive from the west coast of North America all the way to the east coast and I recently just took my driver's license test um and I should be getting my license soon I'm honestly so excited uh especially because recently I've been watching a lot of like Journey anime for example Fen let me know in the comments if you've watched Fen she's like an a thousand-year-old elf and her companions travel across this like fantasy landscape there's also Kino's Journey where Kino just with her gun and her talking motorcycle called Hermes travels across the world learning about different people different Customs the beauty and also the bad parts there's also Somali and the forest beard okay I will stop talking anyways I'm just so excited to take this road trip and I'm starting to plan for it now which is why I want to build a personalized AI travel app so that I can plan an itenary that has all the places I want to go to has the right budget and the best restaurants and try to make it to things like concerts and festivals as well basic skills for this project just prompt engineering you can use chat gbt GPT for all your favorite AI chat interface additional skills include make. com which is a no code platform that allows you to automate different flows using different types of plugins I was honestly really impressed by this platform considering that it's no code and I would say you can build like maybe 80% of it just by using this tool but of course if you actually want to make a really good product and potentially even commercial product you will need python the open AI API were your choice of model some sort of front-end dashboard like streamlit and other types of apis in order to incorporate different sources of data level one prompt engineering roughly using our 5w framework again who at when where why you are a personal travel agent with the goal of Designing a custom travel plan from West Coast Vancouver to the east coast I want to start Vancouver Canada because that's where my car is going to be licensed okay I couldn't not add this part um I've been really into anime like frein Kino's journey and Somali and the forest Spirits so I like to replicate some of their travels but on earth like deserts caves beaches big libraries ask me additional questions until you have a good understanding of my interest traveling preferences timelines budget Etc to suggest a detailed plan for me here we go we answer all these questions and voila a draft itinary week one Vancouver to the Rockies including Stanley Park local seafood cozy cab Whistler hiking in the summer or skiing the winter Art Museum Jasper Rockies to the Prairies hiking to B National Park a Lakeside Cabin um a vibrant art scene and a quaint bed across the Prairies sasron sasron uh Manitoba and all the way North finally ending in Journey's End to the east coast wrap the journey to St John's where Halifax you can ask to change the format of dier ask it to elaborate on certain things and then just interchange things in and out too level two so this is great we have this itinery here within chat GPT and we can look at it but it'll be nice if I actually could have a way to document when the dates are and like more details about these specific events that is where make. com comes in you can take these itenerary items translate them into calendar events that you can put onto your Google calendar and then also put these iten events on notion so this way you can add more detailed information for all of these items and have them all nicely organized on notion so you can look through them and edit them easily that's pretty cool right honestly really impressed for a no code tool but to make it even better wouldn't it be great if you can visually see this and as you're traveling be able to track your journey that is why for level two we're going to get into some code you can use a Jupiter or collab notebook and use your model if you're choosing if you want to use open AI you can use open AI apis uh open source models you can use AMA to download the models that we talked about earlier and then connect that to streamlet which is a really nice A little python module that allows you to create a dashboard so this is sort of like the foundation and in this way you can start showcasing stuff onto the dashboard but what really makes it magical is level three this is where you can start incorporating other sources of data for example if you want to track your journey you can use the Google Maps API and be able to label where all of these things are and be able to click into them you can also incorporate apis like the weather so you're able to know like what the weather is like wherever it is that you're going to be staying information about certain events or festivals uh you can even incorporate where the restaurants are which ones are the best restaurants the ratings of the restaurants in different places yeah the world is your oyster you can incorporate all the information that you care about and use that information to plan out uh where it is that you want to go what's the order of doing things and specific places that you want to visit all of that can be showcased visually on this dashboard like a truly custom AI travel app AI music composer if you watch Short from content you might have heard this audio that went viral recently how the hell you spell did you know that this is actually AI generated I was pretty surprised when I first found out and after playing around with some of the AI music generators I was like damn that is pretty good as someone who's not particularly musically inclined I honestly can't tell a difference between pop songs these days and these AI generated tracks so for this project we're going to make an AI music composer so I was thinking why not play around more generating AI tracks and potentially even like putting them together into playlists this could actually be really useful like for example um I do the study with me live streams and I like to play lowii music when I'm studying but I'm pretty Limited in terms of playlist choices because most things are copyrighted so if I use AI generated soundtracks I could make myself a playlist that is not copyrighted and I can customize it to however it is that I want that is why so in this project we're going to make playlists that are completely AI generated that can be used for free for live streams background music or just like music to vibe to when you're studying basic skills super simple we're just going to use AI generation platforms like udio and sunno oh and you also need decent taste in music additional skills you know the drill pyth python the udio API and the open AI API to use GPD 40 text to speeech and Dolly for image generation and your choice of frent for example streamlet level one we're just going to play around with these AI generation platforms soono and udio consider best in Industry they're honestly pretty similar I think udio is slightly better and also I like udio more because they do have an API which we'll be using for level two um but yes level one just try around use different proms try out the different features see how you can put in your own lyrics generate different lyrics set the mood for different things see what are the controls that you can have sh underneath your bed in the shadows okay level two so okay technically you can download all of the tracks and then put them together to make a playlist yourself like using some sort of sound mixing thing but you can also do it programmatically by using udo's API this here is their API they list out all the things that you can do it's honestly pretty simplistic uh but the one that we're interested in particular is being able to generate links in order to share the musics you can use Python to access their API and take the link of the music that you generated and pratically put them together into a playlist so I'm not really a music person so you probably have to tweak things around to make it actually like sound good and flow well and things like that you can do it programmatically like with these libraries or I guess you can just import it into your normal DJ software and put it together definitely a lot less tedious than downloading everything manually and putting them all together like 50 songs so ideally speaking it would be so cool if we can also programmatically create music like tweak different uh text and like different parameters and then just try out a bunch of different ones and see which one works the best right like maybe Loop through all the different settings and find the right combinations unfortunately the udio API does not allow you to do that and I couldn't find one that does if you find something like that please do leing a comments I would love to check it out but at least we have a playlist now yay now we can go to level three to take it to the next level so we have this playlist now and you can tweak it and you can make it relatively good but what would make it even more cool is if we can generate a company imagery based upon the lyrics and the emotions that the music invokes something like that you can even put it on YouTube as a playlist like similar to the playlist that is currently going on in the background here like you have some music that's playing you guys can't hear the music right now because it's muted uh but there is some music that is playing and you can have different imagery show up based upon what is happening in the lyrics and you can change the style to be whatever it is that you want it can be like anime style it can be realistic it can be Loi style whatever you want to do this you take what you did from level two and then you put it through whisper from open AI in order to convert it into text then you can take that text and put into GPT 40 in order to get the imagery and the mood of the different songs then you can ask the model to create a prompt for do Le to generate images related to the mood and imagery of the songs there you go finally you just need to write a script that puts together the audio and the visual so that you have a full playlist final project workbooks for books book workbooks I got inspiration for this project when I was rereading one of my favorite personal finance and business books called the millionaire fast lane highly recommend if you're interested in personal finances becoming financially free all those things anyways this is a really dense book and there's just so much good information and there's so many exercises that the author MJ tells you to do throughout the book as well so in my copy I have these little scribbles where I write down the answer to some of these and like some thoughts and stuff but I thought to myself wouldn't it be nice if there was like a workbook that actually compiled together all the different questions and exercises that he had so I can just fill them out there instead of you know trying to figure out where it is I scribbl random things so then I thought oh maybe there is and I looked and I googled mire fast lane workbook and I found one on Amazon that was selling for $1. 99 unfortunately it only had one star so I figured it would probably wasn't really good um but that kind of gave me the idea like AI would be really good at generating these workbooks for people you can definitely use it for personal use I don't know about commercial use not a lawyer not lawyer advice not Financial advice I don't know so in this project we're going to make AI generated workbooks how I would approach this level one just prompt engineering so first you got to find a pdf version of the book you're looking for um or like you know images whatever it is some sort of version you might be able to find some of the PDFs online or if you just have a version of the book you could like take pictures of it I don't know but technically you can find a lot of these PDFs online assuming that you found the PDF in a fashion that is legal all you have to do is feed it to your favorite large language model and create a prompt in order to summarize the book and ask it to extract the questions that are being asked in the book and put it together into a workbook that you can then download as a PDF in this particular case this book is over 300 pages so it's a relatively midlink book and I couldn't manage to feed it into chat GPT using GPT 40 entirely so I had to actually like split it up into different chunks and then feed it in and as to summarize create a workbook and then put it all together again it's a little annoying but it does work fine but if you want to be able to just do it in one go and not have to like go through that process you can also check out other models that have larger context windows for reference the standard like GPT 40 has 128,000 contact window and they probably CLA on other hand does have more at 200,000 but I believe the model that has the biggest context window is actually Gemini by Google it says that it has up to 1 million tokens or if you can find any open source models that has bigger context windows that can do this please link in comments I would love to check it out because this is such a useful use case for me level two let's do this programmatically so the strategy we talked about earlier I think it's fine for most use cases um but for some reason if you want to program automatically like Mass produce a lot of these different workbooks for different books level two is for you to do this we're actually going to use another no code loow code tool called bubble is a full stack no code app builder that connects with a lot of different apis so it's actually pretty similar to make. com but it focuses on building web apps not automations so yeah you can use bubble to connect to the open AI API put in the prompt that we talked about earlier then take that data and use it to populate the web app bubble's pretty cool because not only can you do the backend stuff like connecting to open AI um it also has really cool UI features that you can just drag and drop to make the front end look really nice as well you can even have signups and logins now finally level three so in this level you want to go beyond what bubble is capable of maybe you want more flexibility in the way that things look maybe you want to be able to edit the workbooks programmatically as well that is why you're going to use the API from pdf. Co pdf. C is a low code and rest API that allows you to do stuff like converting PDF editing extracting and automating it so after getting the workbooks from open AI you can directly edit the PDF so that you can make it look like the way that you want and you can actually reformat it as HTML templates you can make them even to be editable and save the responses in the work buts um in a database for more details and resources check in the description all right that is all I have for you guys today thank you so much for watching I hope you enjoyed these five different projects and link your GitHub links if you end up doing any of them I would love to check them out also put other project ideas that you have too I will see you guys in the next video or live stream

Ещё от Tina Huang

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться