UHG
Search
Close this search box.

AWS Unveils Graviton4, Trainium2 for Faster, Affordable AI Model Building

AWS claims to have built over 2 million Graviton processors, with approximately 50,000 customers using them

Share

AWS Re:invent Adam VP

At re:Invent in Las Vegas, Amazon Web Services (AWS) announced two new AI chips –AWS Graviton4 , AWS Trainium2. The new chips aim to provide advancements in price performance and energy efficiency for a wide range of customer workloads, including machine learning training and generative AI applications. 

Graviton4 offers up to 30% better compute performance, 50% more cores, and 75% more memory bandwidth than Graviton3. Trainium2 delivers up to 4x faster training than its first generation, with deployment capability in EC2 UltraClusters of up to 100,000 chips.

(Source: Business Wire)

David Brown, VP of Compute and Networking at AWS said that Graviton4 marks the fourth generation they have delivered in just five years, and is the most powerful and energy-efficient chip ever built. “Silicon underpins every customer workload, making it a critical area of innovation for AWS,” he added. 

AWS chief Adam Selipsky said that it has more than 50K customers for Graviton, and its other cloud providers are still just talking about making them, and are yet to deliver first server processors. At Ignite 2023, Microsoft recently launched Azure Maia 100 AI Accelerator, its first in-house custom AI system on a chip. 

Some of its customers leveraging AWS chips include Anthropic, Databricks, Datadog, Epic, Honeycomb, SAP and others. Naveen Rao, VP of generative AI at Databricks said that AWS Trainium gave them the scale and high performance needed to train our Mosaic MPT models, and at a low cost. 

“AWS Graviton4 instances are the fastest EC2 instances we’ve ever tested, and they are delivering outstanding performance across our most competitive and latency-sensitive workloads,” said Roman Visintine, lead cloud engineer at Epic Games. 

Juergen Mueller, CTO of SAP SE said that as part of the migration process of SAP HANA Cloud to AWS Graviton-based Amazon EC2 instances, we have already seen up to 35% better price performance for analytical workloads.

Graviron4-powered R8g instances are available today in preview, with general availability planned in the coming months. Check out here. Trainium2  is said to be available in Amazon Ec2 Trn2 instances Check it out here. 

📣 Want to advertise in AIM? Book here

Picture of Tasmia Ansari

Tasmia Ansari

Tasmia is a tech journalist at AIM, looking to bring a fresh perspective to emerging technologies and trends in data science, analytics, and artificial intelligence.
Related Posts
19th - 23rd Aug 2024
Generative AI Crash Course for Non-Techies
Upcoming Large format Conference
Sep 25-27, 2024 | 📍 Bangalore, India
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
MachineCon USA 2024
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Cypher USA 2024
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Cypher India 2024
September 25-27, 2024 | 📍Bangalore, India
discord-icon
AI Forum for India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.