UHG
Search
Close this search box.

TII Releases Falcon 2 AI Model, Outperforms Meta’s Llama 3

Falcon 2 is a new AI model series that outperforms Meta's Llama 3 and includes the innovative vision-to-language model, Falcon 2 11B VLM.

Share

Why Falcon Sucks

Illustration by Nikhil Kumar

Abu Dhabi-based The Technology Innovation Institute (TII) has launched Falcon 2, a series of AI models that includes Falcon 2 11B and Falcon 2 11B VLM. These models surpass existing benchmarks. 

Falcon 2 11B is a text-based model, while Falcon 2 11B VLM is a vision-to-language model that can convert visual inputs into textual outputs. Both models are multilingual and open-source, providing developers worldwide with unrestricted access.

Falcon 2 11B has been verified to outperform Meta’s Llama 3, which has 8 billion parameters, and performs on par with Google’s Gemma 7B model, according to evaluations by Hugging Face Leaderboard

These models are designed to run efficiently on a single GPU, making them scalable and easy to integrate into various infrastructures, from high-end servers to personal laptops.

In a move to further enhance Falcon 2’s capabilities, TII plans to incorporate ‘Mixture of Experts’ (MoE). This advanced machine learning technique involves combining specialised smaller networks to improve performance by delivering more accurate and faster decision-making.

The models are released under the TII Falcon License 2.0, a permissive Apache 2.0-based software license promoting responsible AI use. More information is available at FalconLLM.TII.ae.

H.E. Faisal Al Bannai, secretary general of ATRC and Strategic Research and Advanced Technology Affairs Advisor to the UAE President, commented on the launch, saying, “While Falcon 2 11B has demonstrated outstanding performance, we reaffirm our commitment to the open-source movement and the Falcon Foundation.”

The first Falcon model, released in 2022, established TII’s commitment to open-source AI, focusing on large language models (LLMs). It was trained on 1 trillion tokens and featured 7 billion parameters. This model builds on this foundation, enhancing capabilities. 

📣 Want to advertise in AIM? Book here

Picture of K L Krithika

K L Krithika

K L Krithika is a tech journalist at AIM. Apart from writing tech news, she enjoys reading sci-fi and pondering the impossible technologies, trying not to confuse it with reality.
Related Posts
19th - 23rd Aug 2024
Generative AI Crash Course for Non-Techies
Upcoming Large format Conference
Sep 25-27, 2024 | 📍 Bangalore, India
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
MachineCon USA 2024
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Cypher USA 2024
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Cypher India 2024
September 25-27, 2024 | 📍Bangalore, India
discord-icon
AI Forum for India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.