Run LLMs on Kubernetes with LLMKube
Machine-readable: Markdown · JSON API · Site index
Описание видео
Follow the DevOps roadmap👉🏽 https://www.instagram.com/marceldempers
My DevOps Roadmap 👉🏽 https://marceldempers.dev
Patreon 👉🏽https://patreon.com/marceldempers
Checkout the source code below 👇🏽 and follow along 🤓
Also if you want to support the channel further, become a member 😎
https://marceldempers.dev/join
Checkout "That DevOps Community" too
https://marceldempers.dev/community
Source Code 🧐
--------------------------------------------------------------
https://github.com/marcel-dempers/docker-development-youtube-series
Like and Subscribe for more :)
Follow me on socials!
Instagram | https://www.instagram.com/marceldempers
X | https://x.com/marceldempers
GitHub | https://github.com/marcel-dempers
LinkedIn | https://www.linkedin.com/in/marceldempers
Music:
Track: souKo - souKo - Parallel | is licensed under a Creative Commons Attribution licence (https://creativecommons.org/licenses/by/3.0/)
Listen: https://soundcloud.com/soukomusic/parallel
Timestamps:
00:00 Intro
00:03 llama.cpp
01:12 what is llm-kube
01:42 define models as YAML
02:19 Creating a k8s cluster
02:41 The Documentation
03:56 Installing llm-kube
04:23 Check the installation
04:45 The new CRDs
05:39 The Model
06:48 The InferenceService
07:58 Under the hood
09:33 Testing\Using our Model
10:07 OpenAI endpoint (OpenCode)
11:10 The Source Code
11:35 Outro