UHG
Search
Close this search box.

French AI Lab Kyutai Releases OpenAI GPT-4o Killer ‘Moshi’

Built on the Helium 7B model, Moshi integrates text and audio training, optimised for CUDA, Metal, and CPU backends with support for 4-bit and 8-bit quantization.

Share

Kyutai, a French non-profit AI research laboratory, has introduced Moshi, a real-time native multimodal foundational AI model. This open-source project features voice-enabled AI assistant offering capabilities that rival OpenAI’s GPT-4o and Google Astra. 

Moshi, developed by a team of just eight researchers in six months, can understand and express 70 different emotions and styles, speak with various accents, and handle two audio streams simultaneously, allowing it to listen and talk at the same time.

Built on the Helium 7B model, Moshi integrates text and audio training, optimised for CUDA, Metal, and CPU backends with support for 4-bit and 8-bit quantization.

Key features of Moshi include:

  1. Real-time interaction with end-to-end latency of 200 milliseconds
  2. Ability to run on consumer-grade hardware, including MacBooks
  3. Support for multiple backends (CUDA, Metal, CPU)
  4. Watermarking to detect AI-generated audio (in progress)

Kyutai chief Patrick Pérez said that the Moshi has the potential to revolutionize human-machine communication, saying, “Moshi thinks while it talks”.

Kyutai plans to release the full model, including the inference codebase, the 7B model, the audio codec, and the optimised stack. 

Founded in November 2023 with €300 million in backing from investors including French billionaire Xavier Niel, the startup aims to contribute to open research in AI and foster ecosystem development. 

The lab’s approach challenges major AI companies like OpenAI, which have faced criticism for delaying releases due to safety concerns. Notably, OpenAI has been withholding the release of its video generation model Sora, as well as the Voice Engine and voice mode features of GPT-4o.

Moshi contributes to France’s increasing influence in the AI sector, alongside other French-origin projects such as Hugging Face and Mistral.

📣 Want to advertise in AIM? Book here

Picture of Siddharth Jindal

Siddharth Jindal

Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Related Posts
19th - 23rd Aug 2024
Generative AI Crash Course for Non-Techies
Upcoming Large format Conference
Sep 25-27, 2024 | 📍 Bangalore, India
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
MachineCon USA 2024
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Cypher USA 2024
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Cypher India 2024
September 25-27, 2024 | 📍Bangalore, India
discord-icon
AI Forum for India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.