Войти
  • 265549Просмотров
  • 3 месяца назадОпубликованоComputerphile

Sleeper Agents in Large Language Models - Computerphile

It's an older paper, but it checks out. Rob Miles discusses the problem of 'Sleeper Agents' - where LLMs could have hidden traits we don't know about until it's too late. Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: This video was filmed and edited by Sean Riley. Computerphile is a sister project to Brady Haran's Numberphile. More at