Skip to content

Amazon Unveils Next-Gen AI Chips: Trainium2 and Graviton4

  • AWS Trainium2 offers 4x better performance and 2x improved energy efficiency, accelerating AI model training significantly.
  • Graviton4, designed for inferencing, boasts enhanced performance, more cores, and improved memory bandwidth.
  • Amazon's custom chips address GPU shortages, positioning AWS as a key player in AI infrastructure innovation.

Amazon introduced AWS Trainium2, boasting 4x better performance and 2x improved energy efficiency over its predecessor. This custom chip, available in EC Trn2 instances, scales up to 100,000 chips in AWS’ EC2 UltraCluster.

A cluster of 100,000 Trainium2 chips accelerates AI model training, enabling the creation of large language models with 300 billion parameters in weeks.

Revolutionizing inferencing with Graviton4

Amazon's Graviton4, designed for inferencing, outperforms its predecessor with up to 30% better compute performance, 50% more cores, and 75% more memory bandwidth. All physical hardware interfaces are now encrypted, enhancing security for AI training workloads and data.

Graviton4 is set to power Amazon EC2 R8g instances, available in preview, with general availability expected in the coming months.

Strategic move amid GPU shortage and rising AI demand

Amazon's latest chip offerings signify a strategic shift, providing alternatives to address the ongoing GPU shortage. Trainium2 and Graviton4 aim to cater to the increasing demand for generative AI, offering enhanced performance, energy efficiency, and security. These custom chips position Amazon as a key player in shaping the future of AI infrastructure.

AWS continues innovation amidst industry challenges

In the face of GPU scarcity predicted until 2025, Amazon's commitment to innovation is evident. The Trainium2 and Graviton4 chips showcase AWS's dedication to advancing AI capabilities meeting customer needs for faster, cost-effective, and secure AI model development and deployment.

The chips align with Amazon's vision to transform cloud infrastructure and lead in AI technology.

Edited by Shruti Thapa