UHG
Search
Close this search box.

Google Rolls Out Gemma 2, Leaves Llama 3 Behind

The 27B model can perform inference on a single NVIDIA H100 Tensor Core GPU or TPU host, reducing deployment costs.

Share

Cognitive Lab Introduces Tokenizer Arena for Devanagari Text

Google DeepMind announced the release of Gemma 2, an advanced version of its open models, available in 9 billion (9B) and 27 billion (27B) parameter sizes.

The model is accessible on Google AI Studio, Kaggle, Hugging Face Models, and soon on Vertex AI Model Garden. Researchers can apply for the Gemma 2 Academic Research Program for Google Cloud credits, with applications open until August 9.

Gemma 2 offers significant improvements over its predecessor, including competitive performance to larger proprietary models and optimised cost efficiency. The 27B model can perform inference on a single NVIDIA H100 Tensor Core GPU or TPU host, reducing deployment costs.

The new models integrate easily with major AI frameworks like Hugging Face Transformers, JAX, PyTorch, and TensorFlow via Keras 3.0. Developers can deploy Gemma 2 on various hardware setups, from cloud-based environments to local CPUs and GPUs.

https://twitter.com/reach_vb/status/1806343018640781675

Gemma 2 is available under a commercially-friendly license, encouraging innovation and commercialization. Google Cloud customers will be able to deploy and manage Gemma 2 on Vertex AI starting next month. Additionally, Google provides the Gemma Cookbook, offering practical examples for building and fine-tuning applications with Gemma 2.

Google emphasises responsible AI development with Gemma 2, incorporating robust safety processes, pre-training data filtering, and rigorous testing against bias and risk metrics. The LLM Comparator tool and text watermarking technology, SynthID, are part of these efforts.

The initial release of Gemma resulted in over 10 million downloads. Gemma 2 aims to support even more ambitious projects, with future plans to release a 2.6B parameter model to balance accessibility and performance.

📣 Want to advertise in AIM? Book here

Picture of Siddharth Jindal

Siddharth Jindal

Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts
19th - 23rd Aug 2024
Generative AI Crash Course for Non-Techies
Upcoming Large format Conference
Sep 25-27, 2024 | 📍 Bangalore, India
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
MachineCon USA 2024
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Cypher USA 2024
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Cypher India 2024
September 25-27, 2024 | 📍Bangalore, India
discord-icon
AI Forum for India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.