Skip to content
GCC AI Research

Meet Jais, The World’s Most Advanced Arabic LLM - G42

Inception · · Significant research

Summary

G42's Core42 has released Jais, a collection of Arabic large language models, including a 13B parameter version. Jais-13B is trained on a 395B token dataset containing Arabic and English text. According to the blog post, Jais-13B achieves state-of-the-art results on Arabic NLP benchmarks. Why it matters: This release establishes a new benchmark for Arabic language AI, potentially enabling more sophisticated and culturally relevant applications.

Keywords

Jais · G42 · Core42 · Arabic LLM · Language model

Get the weekly digest

Top AI stories from the GCC region, every week.

Related

Meet Jais, The World’s Most Advanced Arabic LLM - G42

Inception ·

G42's Core42 has released Jais, a new Arabic large language model. Jais includes 13 billion parameters and was trained on a dataset of 126B tokens, including 43B Arabic tokens. According to the developers, Jais achieves state-of-the-art results on Arabic benchmarks and competitive performance on English benchmarks. Why it matters: Jais represents a significant step forward for Arabic NLP, providing a powerful new tool for researchers and developers in the region.

Meet “Jais”, The World’s Most Advanced Arabic Large Language Model Open Sourced by G42’s Inception

MBZUAI ·

G42's Inception has open-sourced Jais, a 13-billion parameter Arabic large language model (LLM). Jais was trained on a 395-billion-token Arabic and English dataset and outperforms existing Arabic models. The model is a collaboration between Inception, MBZUAI, and Cerebras Systems, and was trained on the Condor Galaxy supercomputer. Why it matters: This release establishes a new standard for Arabic language AI, providing over 400 million Arabic speakers access to generative AI and fostering innovation in the region.

Inception, Cerebras and MBZUAI Release Jais 2 – the next generation of the world’s leading Arabic open-weight LLM

MBZUAI ·

Inception, Cerebras, and MBZUAI have released Jais 2, a 70 billion parameter open-weight Arabic LLM. Jais 2 is trained on an Arabic-first dataset and features a redesigned architecture for stronger reasoning and fluency across Arabic dialects and English. It integrates a safety-first framework and demonstrates capabilities in understanding Arabic poetry, culture, and social media tone. Why it matters: Jais 2 addresses the historical underrepresentation of Arabic in AI by providing a culturally and linguistically faithful model, potentially accelerating innovation across the region.