Speaker(s): Robert Shaw llm-d is a well-lit path for anyone to serve LLMs at scale, for any model across a diverse and comprehensive set of hardware accelerators. Come learn more about how llm-d enables distributed inference at scale! --- Full schedule, including slides and other resources:











