Programming with LLM Agents in 2025

1:02:10

Programming with LLM Agents in 2025

sentdex 16.02.2025 115 045 просмотров 3 890 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

Some tips and tricks for using modern LLM agents for building stuff. I am using openhands here, but you're free to take some of my advice from here and apply it to just about any of the web-based UIs or other agents...etc. OpenHands github: https://github.com/All-Hands-AI/OpenHands Neural Networks from Scratch book: https://nnfs.io Channel membership: https://www.youtube.com/channel/UCfzlCWGWYyIQ0aLC5w48gBQ/join Discord: https://discord.gg/sentdex Reddit: https://www.reddit.com/r/sentdex/ Support the content: https://pythonprogramming.net/support-donate/ Twitter: https://twitter.com/sentdex Instagram: https://instagram.com/sentdex Facebook: https://www.facebook.com/pythonprogramming.net/ Twitch: https://www.twitch.tv/sentdex

Оглавление (13 сегментов)

Segment 1 (00:00 - 05:00)

what is going on everybody Welcome to a video on programming in 2025 using um large language models llms and uh agents possibly so I see a lot of kind of critiques and questions about agents and llms and coding and is it like all hype and then like some people love these things and then some people act like um they're just like a giant waste of time or they don't understand like how are people using this or they just building like really basic apps only or like what's the deal there and I think there's two groups of people uh really three groups one group of people just gets it uh doesn't get it and then the other group doesn't want to get it and I can't help the latter group but the uh the second group I can definitely help at least maybe a little bit so uh what I'm going to use today is this thing called open hands and this is kind of an a agentic layer and I will admit the term agent just sounds fancy but for the most part it is just kind of like a almost a for Loop some if statements and some fancy prompting and boom you have an agent but done in the right way uh it can make your life much easier and um there's also cursor the reason I'm not going to use cursor here in this video is you have to pay for cursor it's like 20 bucks a month and I am a little annoyed at everything is like 20 bucks a month um but for the most part uh all these things are like massive amplifications of my actual output so I do think it's worth it but just because I don't really want to keep showing everything that's super expensive uh or I just want to show that you don't have to pay money for like any of this stuff you can even use open hands with closed Source or I'm sorry with an open source open weights model it doesn't have to be with a closed Source model um but you can also do a lot of things I'm going to be talking about today with uh something like Claud or even chat GPT the actual UI um as you can see here I even used uh Chad GPT because I was really struggling with um what are my various options for um actually loading uh that deep seek model I started off thinking I was going to load fp8 on a CPU uh so that that's where I came from I like never run models on the uh on the CPU so anyways um okay so anyways you can follow along with a lot of the tactics that I'm going to use today are similar across everything you don't even have to use an agent but if you are going I do like Open Hands I do like cursor has their little uh what is it composer agent or whatever you can use that too you just have to it I think there's like a free trial and then you have to pay for it so anyways uh keep that in mind I do think this is pretty cool you can install it essentially with two commands run this and then run this and again that's what I'm going to use but again also the hopefully um I haven't done many videos lately because I feel like there's like nothing that is uh um nothing is sticking these days like within a week or two everything that you just talked about is like outdated so that's probably going to happen today but uh but I these principles are principles that I've been using with coding with llms now for um at least a little bit so uh anyways so I'm running that second command or actually did I already no no yeah so I'm running that second command and I'll just open it in a browser and then boom you're in open hands now I think if this a first time you're loading it you do have to set up uh an API key um I'm using anthropic I already uploaded um my key but you can use basically all of these um and then I'm not actually sure uh if like I'm pretty sure you can load your own um like you can set up something with like uh ll. CPP set up your own little private uh API with that and then you could run it off that so you don't have to pay a penny uh if you don't want to but I like CLA onic 3. 5 right now as my coding model although that varies from time to time um I swear that model gets really dumb I think it's just they change the number of concurrence uh per GPU uh that's my best guess is why it gets crappy but uh so anyways to start you can upload a zip and the zip works I wish you could upload single files for some reason that does not work so usually my opening statement at least to this agent is one moment I will upload some well actually I'm just going to upload a file so I just open with that um and again every agent is going to be a little different so there's some little quirks that are just super specific to what I'm doing right here but um not always so and also let's just move I'm going to move my little head up here um as for the ultra patriotic background um I am changing where I'm putting the camera in this space mostly because actually the old direction is filled with a bunch of toddler stuff so I like this backr a little better uh

Segment 2 (05:00 - 10:00)

and actually these are all like family heirloom things the swords on the top and bottom are from uh the US uh from the Korean War those are from the uh Captain EXO and then uh that uh I don't know if Katana I don't think Katana is the right word it's a sword I don't think it's a katana I forget I don't know what the word is um but that's from it's like a captured uh Korean sword uh and then the bayonet is from the Civil War so slightly different era there but anyways and then a commission flag anyways um contining on uh let's see um upload so we need to upload our files so for upload and then sometimes download at least for me open hands used to the download always worked and then lately the download does not work so like whenever you're all done you can usually click this button and then download all the files but for some reason that has not been successful for me lately but no matter what you do first of all there's a million ways like you're using just keep in mind you're using an agent you can do whatever you want don't let yourself be like contained in like a little box so like in this case like you could just open vs code in the browser and then now this yes I trust myself uh you can upload and download files right here and what if for example you don't want to download all those like individual files you come over to the agent and you tell it zip up the files and then you come over here and you just download that one single zip okay so start thinking in those terms um now for me I am going to just upload uh Shakespeare text here and so I'm just going to slap that in here now no matter what problem I do here there's going to be people who are like oh you just solved like that again you just started from nothing uh you just worked on like a simple R& D project agents don't work for a large code base that's all a lie it works for large code bases I'm not going to show you a giant large code base I also this will be the third time I'm recording this video hopefully I can actually get it condensed into one video and the other ones were like three plus hours of recording just simply because that's how long a big project takes and really if I was doing it by hand it would take me weeks like weeks to do big projects so anyways uh this is I'm going to try to keep this really simple um and still show all the my tips and tricks I suppose for working with agents but also hopefully be able to show you why these things are so powerful so okay with that in mind let's go so we have shakespeare. text and now one of the actual benefits is quick R& D so for example imagine uh you don't like language models like we've been on this Transformer stick for so long like wouldn't it be nice if there was something else uh besides Transformer models for either language models but then also in my opinion uh before we reach like ASI we are not going to be uh doing it with chatbots I just don't think so and like the base core uh thing the base core element is not going to be tokens right there certainly not word tokens so um what is that word let's just scroll down a little bit here uh this is Trump's America I suppose but anyways um what is that word we I do need to know I need to know do I have to like re redo this video or not uh what is Shakespeare's usage of the word mean um so I don't think yeah what's the uh add anyway um we just got to figure out if uh it says Claud says it's unrelated we're good guys okay so um so I don't think that like even think about like humans human intelligence is it the result of language or communication right and I think language is like a um an emergence of communication and intelligence as opposed to being like The Source okay so I don't think AI in the future like ASI type stuff it's not going to be word uh from uh like tokens right okay so anyway so what if we try other things okay that's let's imagine that's the uh the pretext of our new project here now that we're 10 minutes into this video that I really don't want to be three hours so uh so we've uploaded our shakespeare. text we're going to use this as training data and let's say we want to use like an evolutionary algorithm to train on shakespeare.

Segment 3 (10:00 - 15:00)

shakespeare. now to begin we could we can do quite a few things so first of all on shakespeare. text we could say hey um I want to eventually train a language model on the Shak spear. text file I uploaded I would like to have the input be an encoded uh let's say 15 characters and the output be an encoded three characters I am not positive on the uh encoding but I think it would be cool to uh make that a like bit style of encoding like make each like uh make each character the bit representation of that character okay something like that hey bro uh something like that would be really cool right like a language model that's actually like built from bits like that would be uh darn near a Holy Grail I think um there's all kinds of problems that are going to likely arise from doing this bit and coding and I promise we are not going to solve it in this video uh this is a very complex problem um but I just kind of want to show some of my usual things of like working with these um agents and large language models um so one of the things is you would never approach the agent and ask it to do the whole thing in one prompt right that's way too big you would not approach that problem now maybe in the future when we actually have ASI we could ask you know hey this is what I want make it happen and maybe that will work but today uh you can't do that just like regular programmers can't you can't do it that way you have to break it down into little sub problems and then solve the sub problem one step at a time so um so I'm just informing it hey this is what I I'm headed towards right um and I think I want to encode it this way so maybe so I think the first step we should do is take that uh text file and create some training data from it in this way let's create a python file that first loads this uh text file and encodes the characters to bits and then creates the input 15 characters to Output three oops to put putut putput to three characters but uh encoded into bits can you do that okay so hopefully uh we'll start off there sometimes like this agent will like write the code but it won't make a file I think it's G to make a file here we'll see what happens and we want to do uh three characters instead of one character because I think a lot of times we'll end up on like spaces and stuff like that also um I think if we do only one character the model will very especially with like an evolutionary or RL based kind of algorithm but many algorithms would learn to predict the most common letter and then probably get stuck like in that kind of a like a local minimum basically and we don't want that so we're waiting on this thing to hurry up and code so yeah it calls it pre-process that's good I mean that's what it should be called um and we're just kind of waiting for it so one of the other cool things um about agents is you know some people say that it takes you it takes much longer to produce code than if you just like wrote it yourself and sometimes that's true like sometimes there's like a really easy solution and like you think that you can get the AI to do it for you but it would just be quicker if you just did it um oh that's cool so it saved at as a numpy so remove uh so how many training samples from the first thousand characters um I don't know if I want that because it's it is pro is likely a sliding window but I'm not really sure um if I like it being only a thousand let me just see if the down this historically well historically this has worked but lately it has not worked so I just want to see if it works it is not going to work today I don't know why that is oh no I forgot this is like when this breaks you have to like refresh it's like super annoying um but anyways I'll we'll discuss that momentarily um so here's the next tip and trick that I like to do so it's done this already and so as you can see we already have uh we have that text file we've got these numpy files

Segment 4 (15:00 - 20:00)

and then we got the python file and as you continue to increase the size of this code base you will wind up with more and more issues along the way so uh what I like to do is could you also create a and in this case we'll call it a readme. md file that uh summarizes our current goals and objectives the things we've done so far the files in the code code base uh actually we'll call the files in the workspace and a bit of a to-do um beyond that please try to keep this file up to date so the reason why I want to do that is as I discuss as we talk in like this little chat box here uh the context gets longer and longer and eventually it gets a little unruly I in general I find the correct number of essentially steps to take now it depends on the context per step but in general after by the time I reach like 200 steps I find it's best to go ahead and just reset uh reset everything and so in order to do that you need to at least have kind of like this running sort of summary of what's going on what you're working on and all that to like come back from so you'll either download all the files if that actually works um again historically that has worked and just for the record of this so like right now as I record the version 0. 23 I feel like the version updates like once a week or something like that it's pretty frequent so um hopefully at some point that'll be fixed uh so we'll see but and honestly it's been a lot of fun using this one because it updates like so fast and there's like little things that hopefully we'll get to see some of that uh in this recording um that I'll kind of point out but like little behaviors that you see come through that clearly like the prompting has changed and you're like oh that's so cool so anyway so now we have this read me file so again we can come over here and we can kind of just see what it's done so far so it just populates this with some useful information that if we want to start over the context the first thing you tell the agent when you start a new chat is please check the readme. md file to understand what we're doing and right before you reset you'll tell it you'll just make sure that it up has updated the readme. md you should double check it and you can make any modifications that you want or whatever but I find that to be a really great way to uh keep context down keep everything kind of flowing pretty good but then yeah not have this like giant because like sometimes like that giant context actually makes the model worse so whether or not you care about paying for those tokens it's also just like trying to keep it focused so anyways coming back to uh where we're working okay so we got the read me file very good and we created some samples now we want to make uh there's all kinds of stuff that you could do you could you I like evolutionary algorithms I just think they're really cool um but in the case of binary uh it could be an evolutionary algorithm RNN right it could be all kinds of things it something new is coming for sure it won't always be Transformers um but so anyways like okay let's say you want to try an evolutionary based algorithm okay so I'd like to start on this data what do you think um and then also let's check this pre-process so what I will say is um I think that we probably would want to have more than a thousand samples most likely but then also one of the things I want to point out is a lot of times when I'm working with these agents I you'll I'll have the agent will have produced like thousands of lines of code for me and I won't have even seen it like I won't even look at it uh that's how that's where things are lately and I just think that's pretty crazy uh so like I get a little bit of benefit having been a programmer in a past life that like I know it's possible I know it's not possible like when I'm asking the model here to uh use an evolutionary based algorithm I could tell it hey use the neat library because that's a really good library to get things going um okay this is just a sample I I'm not sure what it's doing right now uh is it actually running something what are you doing so it's pre-processed but then it's not actually oh so it is building something I just don't see it for some reason yet here's that file so I get a benefit because I know like what is possible or what exists but again I do think that we're kind of at the point now where we could um you could probably be you could probably program without being a like being a programmer at all like you could ask an agent like this to do things and just report the results um okay so coming back here um an evolutionary is an interesting

Segment 5 (20:00 - 25:00)

approach blah blah update the read me look at that being a good little agent um input 120 bits hidden layer 64 so I do think that we probably need two minimum two hidden layers uh I think that has been very well documented that you have to have at least two hidden layers so we'll go ahead and add a hidden I think we'll start with two hidden layers minimum genetic operations blah blah mutation flips bits with a given probability comines two parent selection I do think we want some sort of mutation and evolution that adds layers adds nodes removes nodes adds connections all that kind of stuff Fitness is slightly above monism blah blah architectural changes blah blah okay so again I do benefit from having been like a programmer working on these things um literally writing the book on neural networks you kind of need a second hidden layer so let's go ahead and um let's make the uh default starting point have two hidden layers um well how many was it 64 that's just that's well 64 is fine we'll keep 6 the problem is like you don't really even need probably 64 uh two hidden layers um two hidden layers uh 32 nodes we'll just start there uh to start um during evolution we should allow for mutation to add nodes uh add nodes connections or remove as well as add new layers adding new layers should be relatively rare but possible I'm trying to think if there's anything else I want to change so uh coming over here as well uh let's see so I mean we did we improved general fitness we did 50 Generations we don't have very much data um and then it's also important to recognize the average fitness I think until average fitness goes up that means no improvement really I'm not sure about 50% I guess 50% because it's a zero or a one I guess that's why how that's doing the calculation um and then the actual predictions I mean most of these are like special characters um I'm going to call that Emer so I'm ready to raise trillion on hey I just created a you know people always say that language models um can only produce what they were trained on like they only know Boom here's some evidence I got a model right here it's emerging on a new language 7 trillion uh okay so uh one of the other benefits of these agents is like I'm doing this and talking to you again I'm not really writing code I'm writing English and that's about it in reality like when I'm working I'm espe even if I just was trying to solve this one problem I would have four of these up at any given time five maybe and uh and if I'm not working on just this one problem maybe I'm working on multiple problems I'm still uh I'm going to have many things up all at once so even if the agent itself is barely faster than you are if you're just on that one agent the difference is while this agent's doing this I could be off instructing a different agent to do something else at a minimum this thing it speeds me up I feel like even one agent I'm usually about 10x my speed but I can use five agents at a time so I'm like 50x I mean it's like it's crazy the the uh amount of Leverage that you get when you're using something like um agents is it's absolutely staggering um okay so I'll modify the evolutionary model blah blah Dynamic architecture mutation types 80 chance hey it improved to 50% but I think a I'm not sure Best Fitness really matters I feel like average fitness has to you have to raise up the average fitness otherwise you probably haven't training increase so what is the population size possible improvements we could try a fitness function okay so I actually I mean it was a joke earlier but I actually don't think you want to get rid of this you might want to it might actually make it better but I don't want to do that it's more fun not to do that uh waight different bit positions differently I mean that is the other thing so one of the common things with uh with so like with 8bit representation I want to say the 128 comes from the

Segment 6 (25:00 - 30:00)

fact that you have even though it's represented in 8 Bits the zeroth bit is always going to be a zero or at least almost always I think there might be some scenarios some comment below if there's what scenarios that the zeroeth bit would be a one but you can almost guarantee success which is why it's unfortunate that we're still average fitness 50% like we should do a little better because you could always predict that zeroth every eight bit so like here zero uh 1 2 3 4 five 6 7 three four five six seven zero like it should always be a zero uh so it's shocking it hasn't quite figured that one out yet but that's okay um okay wait different okay first off since we're rep uh I don't even I don't know if I feel like that could that would like confuse the agent potentially so I don't even know if I want to touch that uh consider sequence level patterns I don't think I don't know if I want to do that feels too tokeny like architecture start with larger layers I'm not sure why would you need to start with larger layers if it's 8 bit representation but you do have okay fine we'll start okay fine uh let's go with larger layers 64 each to start remember though we can always add more neurons I'd rather start small and grow rapidly then start giant okay um what is our current uh increase the PO what is the population size I don't even know I have no idea it's not my code uh let's see if we 100 I feel like a hundred's a pretty good starting population but uh increase the population size to whatever you think is best yeah I don't really know I feel like 100 is actually pretty good because I also think the population size I don't know in this code if the population size will ever grow one moment please while I check something on my phone H okay um 64 that's F uh 200 population Dynamic bit flip I think bit flip is a potential issue that starts to suggest to me that maybe our activation functions are Dynamic I hate that it does this too but there there's something else we can do for that as well so as it runs oh it's running right now yes Dynamic evolutionary model um wait where is the activation where is our activation function wow I can't even find it add node remove node add layer am I blind I'll just ask the model um what are the activation functions for these layers started here improved to here yeah it's probably not and I bet the average was still yeah too close to 50% waiting bits by position I don't even know what MSB or LSB stands for I don't want to Val should we could probably improve immediately if we just made that change uh add Skip connections yes add attention with uh let's try to add some attention it is all you need damn it we're just going to wind up creating a Transformer probably but okay um Evolution strategy uh add speciation like in neat Island model I don't know what island model

Segment 7 (30:00 - 35:00)

let's also improve oh man I don't the the calculation is probably um adjusted from this point but um uh also um let's try to keep the pi files in the workspace somewhat clean uh please um put previous uh evolutionary scripts that we're no longer using into a folder called pref but hopefully it'll do like multiple things here yeah so there is no activation function so that's your problem it's really yeah an exor um exactly with new activation functions binary sigmoid binary tan I think a sigmoid would be good because then we could act we could we would likely see much more training binary and binary okay that's fine I don't mind a tan and sigo uh output layer I'm predicting all ones oh my god oh there you go so you do have scenarios where you could actually start with a z or with a one instead of a zero let okay scale the weights to prevent saturation add weight initialization scaling add a biased term to help with activation improve the fitness function to consider partial matches uh sure let's do all of those probably oh look at him he's still R usually it forgets to do what you asked it uh with respect to the read me file so I'm pretty impressed that he's remembering to do that all right let's see if this actually model did you put anything in there so wait is it putting D hold on no just evolutionary Transformer so why is this I don't know why these two unless it's using it doesn't look like it is okay this might be uh the new this might be an example I think it's actually probably still running let's see hard to say this is all probably just the spyware around my computer running uh let's see so let's check in on the current status yeah okay so this that's what I thought was would happen so this was I think new as in 0 22 or 0 23 but it used to be the case that if you had any command that would take longer than two minutes you just like couldn't run it in open hands because it would just time out it would just say oh this is taking too long and just give up uh whereas now it just it like just sends it leaves it running and then just sends this empty command and it should give you a result but it's possible that maybe this is not printing as it's going um so we'll see if it get let's interrupt the current process to add some progress indicator yeah it literally read my mind so again we're sitting here this is taking a while I'm looking at the timer on the recorder 34 minutes and what again I just want to say while it's doing this I would be in a different window doing something else entirely like I'm not going to sit here and keep staring at this like and there's so many things that are very common so for example let's check in on

Segment 8 (35:00 - 40:00)

the current status there's no information oh right you have to basically all your scripts should have tons of debugging happening in the console and if not in the console it should output to like a Json file something like that so if it takes a really long time like in the case of like um Benchmark scripts for language models for example it might take a while to run and get the inference for those benchmarks and then if you want to perform like analysis or you want to like work on the logic of extracting answers from those uh the inference rather than rerunning over and over just output to a Json and then you can iterate over that Json right um but again these are all uh this is all stuff that you should have been doing when you were programming as well so you you're like we're still in an era where you still kind of have to be the engineer and think about like how to break things down um and like a big problem break that down into smaller problem and chunks that you can kind of solve and use things like debugging and all that and for more challenging things generally if you can come up with like a test case where you would know if it worked if it passes these tests or you'd be very confident at least you can come up with those tests and then tell it this is the test you would save it to like a file even and say hey this is the test case just keep iterating until you can solve these test cases um and eventually it will either you know expend a ton of money or solve the test cases uh and you'll be good so like there's lots of like little ways that you still have to like be a programmer to some extent or an engineer to some extent but everything is just so much easier nowadays um and then let me see here I will say like at least Claude if you're seeing like I apologize or if you see like the oh now I see what the problem is if you start seeing that stuff a lot just you got to clean up start over cuz it'll be Stu going a loop forever uh as soon as you start seeing messages like that like definitely start the context over all that it's a lost cause at that point okay so now the fitness evaluation is taking quite long so for example there is a big how much what was my uh memory up to last time I don't remember H let's first see how things are progressing even if it's slow uh before we go m making changes uh I think we'd like to see at least a few Generations I mean we're definitely Uh something's happening because we are using massive amounts of memory um not quite sure what or why but uh we can address that at a later date I'm just I just really want to see number go up like how hard is that to show me number go up let's go ahead and break that then use numis vectorization plus let's use some parallelization I have 64 total cores let's Reserve 12 for me so use 52 to process yeah you start seeing stuff like this you should be on red alert at least with Claude ah anything like that no no you got to be careful whoa look at it go oh for a split second it must have hit another error probably uh that was awesome looking for a moment ah I see the problem ah no I know what to do ah I see ah there you go as soon as you if you have too many awz happening all at once things are not looking good we'll see though we'll see it's a red flag though watch closely dang look at them go uh did I or did I not say 52 I think it's violating my request it's probably going to screw up my video okay we are uh check the timer on the video 57 minutes 57 and A2 minutes and I have done no coding we have reached the point of basically here we got a Best Fitness up to uh 76% um reached that in generation 47 I think we only did 50 um and actually it really probably was it did 50 but it starts at zero so really it only had two more Generations so it sounds like it was continuing to improve as time went on started to predict valid asy characters um final uh predictions

Segment 9 (40:00 - 45:00)

show learning of character patterns I'm not really positive that I would necessarily agree with that but that's what it says um areas for improvement uh more sophisticated fit Fitness function maybe it seems like things are getting better but it if we're already at 76 and these are the predictions that we're getting uh my guess is um uh potentially uh this is more like a local Minima type scenario um but we'll get there um okay so probably the last thing I would do here is show you guys um well two things one again I don't know what step we're at but I don't think it we're near enough to reset uh yeah we're only at step 85 but again if you did need to reset context for any reason because this download files is not working you literally just ask the model hey could you please zip this entire workspace into a workspace do zip file for me thanks uh and it will do that it probably doesn't have zip installed but it will just install zip and put it in um a workspace um and then from here you can do all kinds of stuff and like stuff that I would just never personally do um like you just would never take the time to really make it really nice so for example is like visualizations dashboards stuff like that like as this trains uh one of the other things that you could ask uh is hey instead of like a tensor board necessarily that just shows the stuff that tensor board could show you could tell it like hey show me the best fitness show me a best fitness graph over time as it trains like create like some sort of uh dashboard for this um and then also things like show me some examples like every generation so show me some of the best uh final prediction examples stuff like that um and you can kind of keep going with that but anyway for doing stuff like that for like dashboards and all that uh it's really powerful so anyways um this is like a perfect stopping point it was an hour in again I did almost no coding a lot of that I was just like looking at something else doing something else um or talking to you on camera or like explaining something but really the actual input required of me was minimal to get to this point now again we did not solve we didn't make a new breakthrough in uh new language models or anything like that or we didn't make ASI today but the point is you can use this to do like if I'm not recording a video again I'm doing like five of these at any one time to try to solve certain different problems um so especially if you're an R& D or something like that it is much faster but it doesn't have to be R& D it can literally be like I use this with my work for benchmarking uh large language models it makes it so much easier to like quickly Implement a benchmark like any Benchmark there's always like all these like brand new benchmarks everyone does them in different ways everything has its own little quirks and features and every model is a little different and it's just kind of a nightmare implementing models on benchmarks for example but doing it with language models it's like so easy it's so fast like when you're using something like an agent so anyways I definitely encourage you everybody to try these things out try out Open Hands use the cursor agent even just using the chat you know the web UI um all these things are extremely powerful um and if you're not using them you're missing out if you're refusing to use them uh you're going to be replaced prop probably so anyways uh that's all for now uh if you have questions comments concerns suggestions something else I should look at uh let me know in the comments below if you're interested in learning about how normal networks work like at a deep level uh check out the noral networks from scratchbook at ns. otherwise I will see you guys anywhere from a few days to a couple of years uh till next time okay bonus points um I did ask it to do something and then I left and um I think it actually maybe worked so uh basically I just asked it to train uh save like best fitness and like all the fitnesses to like a Json and then we could display it in a live uh dashboard as it trains and then maybe view progress over time or something so it looks like it actually did that and then I just asked it hey will you zip everything up into a worksspace do zip because I was having a problem like doing the download all files thing so it did that made me a little zip file um okay so coming over here uh we will extract come into the works space and um what were the two files I already forgot we want to run a training dashboard um for the live visualization and then oops uh python training dashboard um okay we'll see uh and then finally H what was the other thing parallel evolutionary transformer.

Segment 10 (45:00 - 50:00)

Pi pull that up and actually that needs to be uh python parallel evolutionary transformer. Pi cool um oh it's uh it's acting like it's a local hold on let me fix that okay so in these two files this is what we want to do we want toplace replace workspace with like nothing now we could ask the model like you could pass his error to the model but I don't have time for this so I'm going to do that save come over here and then basically yeah do the same thing uh just get rid of all wait did it not there we go okay save so then coming back over to here one more time okay so things are training things are laggy as heck and then we'll come over here I don't really see anything yet Island one oh my gosh it's murdering hold on okay I just can't help myself I have to go um but uh this is just way too addicting first of all what the heck this is not what I was expecting for a dashboard um but this is like the coolest implementation of a dashboard that I could ever have expected I would never have even asked for something like this it does appear that the trend is going up um and then yeah what I want for the fitness history is to be both uh Best Fitness but also the average fitness I think that would be cool to track as time goes on um uh which okay that's easy um and then uh okay that's fine so hopefully it'll do all this we'll ask it to zip it one more time and then we'll run it and then that will be uh the end I just can't help myself this is the coolest visualization I would never have made this visualization this is just so cool uh I just I have to see this um very good uh okay great work space. zip to contain all of this awesome what is this about oh that's just historical okay so then we'll come over here and I'm going to grab this okay training dashboard run that and then run this and finally we will also pull up an H top make sure it's doing what we said to do please uh oh the dashboard's already running okay so run that oh this is this must be the uh average yeah it says so oh my gosh is this not just the cutest thing in the freaking world oh my God and it appears to be honoring our request that's cool oh my God I'm nerding out over this is so cool oh my God this is the best dashboard I've ever seen in my life oh my goodness this is so cool I I can't get over this dashboard that is so cool so probably I would do maybe more generations to see if this keeps going up but uh I'm having way too much fun that is just so cool uh let's see uh parallel evolutionary Transformer somewhere in here we got to be able to pick wait where did it okay num cores that's fine um my dog is like licking the floor making nasty noises for us uh where is this uh what was it uh 50 Generations 50 maybe it's just Islands oh generation here we go uh can we do that so Island size 50 Generations 50 let's do um let's do 200 save that and then I wonder can I just restart I have no idea this is not my

Segment 11 (50:00 - 55:00)

code it looks like we can dude I can't get over this chart this is the coolest um thing I wonder if it were going to ruin the chart though because in the past it looked like if this is 15 out of 200 I wonder what happens to our chart we're going to find out come on buddy let's go outside dude I love this is just so fun this is so fun I would never have made a dashboard look like this here let me uh let's see here move this over a little bit and then maybe do like that fit my face in there could also just get rid of it yeah isn't that awesome oh my goodness I love this I just love this okay so uh I am kind of curious what's going to happen because we're going to run out of space here and I'm guessing this will not handle for that but I mean this is we are off to the this was this is like the this is way better than I thought it was ever going to be on this video I'm so excited for this kind of a visualization that is so cool I hope you guys are enjoying this visualization as much as I am look at covering over this oh my goodness this is just like the coolest way dude I need to learn more about how is it doing that like I need all my visualizations to be in the console from now on that is so cool is it moving it is moving oh man oh my God I just had no idea this was even an option see this is a perfect example of like yeah in some respects it's useful to be a programmer because you know like what's possible what's not possible but then in some respects it's like man I didn't even know this was possible I had no idea this is so cool just I'm dying it's all with curses oh my goodness this is so this is mind blowing to me I wish we had the full history because it looks better because like now the gains are slowing down um but dang dude that's so cool that is so cool I hope you guys are loving this as much as I am I to be honest I don't even care if you don't uh that is super cool um it is predicting slightly better like it's getting closer and closer we do we have a lot of work to do still but like I said we were never going to solve this that um but this is our this is way better than I thought it was ever going to be so that is super cool um okay now for real I am going to cut it here I could spend oh my God I has spent hours at this I love this visualization that is so cool uh wow um yeah so anyways I'm going to stop here uh questions comments concerns suggestions whatever uh feel free to leave them below maybe I will have an update on this because I definitely have to keep trucking along and I love this dashboard I I'm going to like make everything I think I had been wanting to look into curses for some time now I just never had the opportunity um and like again this is a perfect example I will I would never have made this not in a million years but the joy that has been brought to me by my uh agent making this for me is immense uh wow okay yeah that's all for now I will see you guys in I don't know another you know a few days few months couple years who knows uh anyway till next time use some agents I just couldn't help myself I'm back with some more bonus content um I need to know two things uh first of all rather than having the sliding window can we make it such that it has um all of the history so we can see it more like long large scale and then also I must know uh can we make this like a line now I know that this is just curses like it's not a graphing library but I really need to see lines like I need to know is this doable so um so that's what I'm going to do I'm first I'm just going to say um looking really good uh could we uh change from a scatter plot to a line graph for both of these metrics by chance I think that would look better also

Segment 12 (55:00 - 60:00)

could we um add values to the x axis x- axis uh generation number uh and then finally what was the other thing um finally could we show um all the steps rather than oops rather than just the um a sort of sliding window so let's code for 200 generations and when we get to 200 I want to be able to see from the beginning to end all in the chart makes sense all right let's see if this can do it um uh I will be super excited if so I uh this has this has uh changed so much for me I don't know if we can label the x-axis that would be crazy if we can um this is just so cool I'm a real sucker for stuff like this um I'm like just making like colors in the UI of like some sort of console stuff like that just tickles me but this is something else especially when it had like a legend and everything like come on man that is crazy I mean it's just crazy we'll see if it makes these modifications I feel like the line graph is probably somewhat difficult like coating this is probably easy but making it be like lines I feel like would be pretty hard without I don't even know I mean I know you could do it but you would almost need like some sort of graphing package or something I don't know but even this is like really cool I'm just I'm really uh I'm tripping I'm going to need it to like always do this for me from like now on I don't want uh web- based UI anymore I need this um also I apologize my eyes are dry as heck we're having some crazy weather right now and it's getting very dry we have like this massive cold front coming in um okay so um cool so I think it's done great let's go ahead and zip all that up please into workpace uh that zip very good very good five oh okay I think okay that makes sense okay that'll make sense very good um I me let's just see if it works uh oops what happened here oh okay anyway uh blah all right wish us luck everybody okay I'm not sure what I'm seeing here I mean that's not terrible I was really hoping to like have slashes be used like to notch up use a slash I think that would have been pretty cool and then I'm not sure what's happening down here because it like was modifying that that's kind of odd um maybe we'll go back to the scatters I'm not really sure it's just so cool I just can't stop myself from uh tinkering with this and I don't know it's just it's what makes it so much fun is like it gets you straight to the fun of programming and again I think it probably varies by who you know who and why what you know why did you learn to program do you like to program or do you just like to build stuff and like I just want uh this just makes it really fun to do so because now okay like I just I'm trying to answer some questions you know so like this helps me to first of all see okay I can actually see like okay there's a little bit more of a trend going on here um and now I'm starting to wonder I mean this is running really fast as a parallel process now too and so like if we check uh htop real quick um yeah we got plenty of cores to dedicate to the cause so we could go longer we could add a much even bigger population we could add way more exploration all that so I think that's probably what I would likely do next um please don't expect uh more bonus content though because I really I have to stop at some point I just have to but I mean look at this curve I mean dang it just it really uh you know it's begging to like continue onward honestly I got to do more than 200

Segment 13 (60:00 - 62:00)

apparently oh man this is so addicting uh anyway yeah agents are awesome yeah that's just so cool I think we probably need to put the um uh the legend like down in the bottom right you know like that's probably a better spot for it or like outside the graph a little bit or something uh or even like up here like in the in this little title but uh overall uh yeah that's pretty cool there's a very clear Trend here I would keep going I would look possibly even further into what is Fitness and all that make sure that's actually still logical uh we probably want more than four species I don't know if that's like the island concept or what so I'd probably want to look more into that but again instead of like looking at the code I mean directly you would just ask the agent um you know hey what's going on here why is it only four species oh it's because the okay let's have more and like let's have more uh mutation like you would keep going from here um but I mean right now we're actually on a pretty interesting track like these uh predictions aren't exactly uh perfect but if I recall right we did not reward for ask not or yeah like straight up characters like e j e all that like the model is just learning that that's probably more likely what we're looking for and even that that's cool um I didn't think that this would get this far to be honest with you so uh yeah that's awesome that is just so cool and I love this curses oh man that just that really did that like made my day so okay I really should stop it here uh otherwise this will be yet again another like 5 10 hour video so anyways that's all for now bye for

Другие видео автора — sentdex

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник