When to choose CPU vs GPU: Databricks AI Runtime Explained

1:48

When to choose CPU vs GPU: Databricks AI Runtime Explained

Databricks 01.06.2026 2 331 просмотров 51 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

The conversation around GPUs has shifted this year. It used to be about training models from scratch. Now it is about token economics. Here is the simple mental model: → CPU for the data work. ETL, feature engineering, SQL, classic ML. → GPU for deep learning. Fine-tuning LLMs, computer vision, recommenders, neural networks. Calling a frontier proprietary model on every request adds up fast at production scale. A lot of teams are realizing they can fine-tune a strong open-weights model like Kimi K2 or Qwen on their own data, run it on GPU, and get a system that is cheaper per token and often better at their specific task. That is where on-demand GPUs start to matter. You pick your accelerator, A10 or H100, attach it to your notebook, and fine-tune the open model that fits your workload.

Оглавление (1 сегментов)

Segment 1 (00:00 - 01:00)

Databricks introduced AI runtime with serverless Nvidia GPUs. But the bigger question, when should you use CPU and when GPU? Choosing the wrong compute can make your workflow slower, more expensive, and unnecessarily complex. The answer comes down to the workload. Let me explain. CPUs are still great for the data work, ETL, feature engineering, SQL, data preprocessing, and classical machine learning. That's the work most teams need before they even get to model training. GPUs are different. They are built for deep learning workloads, fine-tuning LLMs, training computer vision models, recommenders, and neural networks. So, in a typical AI workflow, you might prepare your data on CPU, then train your model on GPU, and deploy through model serving. What Databricks AI runtime changes is how easy that GPU step becomes. Because setting up a GPU environment has traditionally been painful. You have to provision infrastructure, match code versions, manage dependencies, configure libraries, and sometimes figure out distributed training. AI runtime removes a lot of that setup. Now, you can attach your Databricks notebook, choose your accelerator, and start training with serverless Nvidia GPUs. You can choose from a single A10, a single H100, or eight H100s for distributed training. And the workflow stays inside Databricks. So, you can prepare data on CPU, train on GPU, track experiments with MLflow, and schedule production jobs with Lake Flow, all governed through Unity Catalog. This built for AI and ML teams that want to train custom models without spending weeks becoming infrastructure engineers. Resources are in the descriptions. Save this for your next AI/ML project.

Другие видео автора — Databricks

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник