Arabic Stable LM: Adapting Stable LM 2 1.6B to Arabic

arXiv · December 5, 2024 · Significant research

Summary

The paper introduces Arabic Stable LM, a 1.6B parameter Arabic-centric language model, in both base and chat versions. The Arabic Stable LM 1.6B chat model achieves strong results on several benchmarks, outperforming models with up to 8x more parameters. The study also demonstrates the benefit of incorporating synthetic instruction tuning data through a large synthetic dialogue dataset. Why it matters: This work makes Arabic LLMs more accessible by reducing the parameter size while maintaining strong performance, facilitating deployment in resource-constrained environments.

Keywords

Arabic NLP · LLM · Stable LM · instruction tuning · synthetic data

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

ALLaM: Large Language Models for Arabic and English

arXiv · Jul 22

The paper introduces ALLaM, a series of large language models for Arabic and English, designed to support Arabic Language Technologies. The models are trained with language alignment and knowledge transfer in mind, using a decoder-only architecture. ALLaM achieves state-of-the-art results on Arabic benchmarks like MMLU Arabic and Arabic Exams. Why it matters: This work advances Arabic NLP by providing high-performing LLMs and demonstrating effective techniques for cross-lingual transfer learning and alignment with human preferences.

Large Language Models and Arabic Content: A Review

arXiv · May 12

This study reviews the use of large language models (LLMs) for Arabic language processing, focusing on pre-trained models and their applications. It highlights the challenges in Arabic NLP due to the language's complexity and the relative scarcity of resources. The review also discusses how techniques like fine-tuning and prompt engineering enhance model performance on Arabic benchmarks. Why it matters: This overview helps consolidate research directions and benchmarks in Arabic NLP, guiding future development of LLMs tailored for the Arabic language and its diverse dialects.

Arabic Stable LM: Adapting Stable LM 2 1.6B to Arabic

Summary

Keywords

Related

ALLaM: Large Language Models for Arabic and English

Large Language Models and Arabic Content: A Review