What is interpretability?
Описание видео
A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the Interpretability team strives to change that — to understand these models to better plan for a future of safe AI.
Find out more: https://www.anthropic.com/research