Войти
  • 383Просмотров
  • 2 месяца назадОпубликованоDevConf

llm-d: Kubernetes Native Distributed Inferencing - DevConf.US 2025

Speaker(s): Robert Shaw llm-d is a well-lit path for anyone to serve LLMs at scale, for any model across a diverse and comprehensive set of hardware accelerators. Come learn more about how llm-d enables distributed inference at scale! --- Full schedule, including slides and other resources: