Войти
  • 846Просмотров
  • 3 месяца назадОпубликованоAmazon Web Services

How to Build High-Performance Kernels on AWS Trainium | Amazon Web Services

This session provides a detailed introduction to the Neuron Kernel Interface (NKI) for AWS Trainium. We review the role of the compiler, kernel libraries, and the NKI language, followed by a technical exploration of NeuronCore hardware. Topics include SBUF, execution engines, and data movement across the architecture. Visual demonstrations connect low-level APIs to hardware execution. The session concludes with a live example showing a forward pass of FlashAttention implemented as a custom kernel. Subscribe to AWS: Sign up for AWS: AWS free tier: Explore more: Contact AWS: Next steps: Explore on AWS in Analyst Research: Discover, deploy, and manage software that runs on AWS: Join the AWS Partner Network: Learn more on how Amazon builds and operates software: Do you have technical AWS questions? Ask the community of experts on AWS re:Post: Why AWS? Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—use AWS to be more agile, lower costs, and innovate faster. #AWS #AI #GenerativeAI #AmazonWebServices #CloudComputing