Skip to content
GCC AI Research

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

arXiv · · Significant research

Summary

MBZUAI has released Jais and Jais-chat, two new open generative large language models (LLMs) with a focus on Arabic. The 13 billion parameter models are based on the GPT-3 architecture and pretrained on Arabic, English, and code. Evaluation shows state-of-the-art Arabic knowledge and reasoning, with competitive English performance.

Keywords

Jais · Jais-chat · MBZUAI · LLM · Arabic

Get the weekly digest

Top AI stories from the GCC region, every week.

Related

Meet “Jais”, The World’s Most Advanced Arabic Large Language Model Open Sourced by G42’s Inception

MBZUAI ·

G42's Inception has open-sourced Jais, a 13-billion parameter Arabic large language model (LLM). Jais was trained on a 395-billion-token Arabic and English dataset and outperforms existing Arabic models. The model is a collaboration between Inception, MBZUAI, and Cerebras Systems, and was trained on the Condor Galaxy supercomputer. Why it matters: This release establishes a new standard for Arabic language AI, providing over 400 million Arabic speakers access to generative AI and fostering innovation in the region.

Meet Jais, The World’s Most Advanced Arabic LLM - G42

Inception ·

G42's Core42 has released Jais, a new Arabic large language model. Jais includes 13 billion parameters and was trained on a dataset of 126B tokens, including 43B Arabic tokens. According to the developers, Jais achieves state-of-the-art results on Arabic benchmarks and competitive performance on English benchmarks. Why it matters: Jais represents a significant step forward for Arabic NLP, providing a powerful new tool for researchers and developers in the region.

Meet Jais, The World’s Most Advanced Arabic LLM - G42

Inception ·

G42's Core42 has released Jais, a collection of Arabic large language models, including a 13B parameter version. Jais-13B is trained on a 395B token dataset containing Arabic and English text. According to the blog post, Jais-13B achieves state-of-the-art results on Arabic NLP benchmarks. Why it matters: This release establishes a new benchmark for Arabic language AI, potentially enabling more sophisticated and culturally relevant applications.

Inception, Cerebras and MBZUAI Release Jais 2 – the next generation of the world’s leading Arabic open-weight LLM

MBZUAI ·

Inception, Cerebras, and MBZUAI have released Jais 2, a 70 billion parameter open-weight Arabic LLM. Jais 2 is trained on an Arabic-first dataset and features a redesigned architecture for stronger reasoning and fluency across Arabic dialects and English. It integrates a safety-first framework and demonstrates capabilities in understanding Arabic poetry, culture, and social media tone. Why it matters: Jais 2 addresses the historical underrepresentation of Arabic in AI by providing a culturally and linguistically faithful model, potentially accelerating innovation across the region.