Search

Results for "LLaMA-2"

Llama 2: a global release of local importance

MBZUAI · Invalid Date

MBZUAI is a global partner in Meta's release of Llama 2, joining organizations like IBM, AWS, Microsoft, and NVIDIA. MBZUAI will provide early feedback and help build the software as a global community. MBZUAI is working on large language models, developing a sustainable LLM named Vicuna, and strengthening infrastructure for LLM-chat evaluation. Why it matters: MBZUAI's involvement promises to bring about a new generation of UAE-born AI advancements built around the Llama 2 ecosystem and fact-checking capabilities.

ALLaM: Large Language Models for Arabic and English

arXiv · Jul 22

The paper introduces ALLaM, a series of large language models for Arabic and English, designed to support Arabic Language Technologies. The models are trained with language alignment and knowledge transfer in mind, using a decoder-only architecture. ALLaM achieves state-of-the-art results on Arabic benchmarks like MMLU Arabic and Arabic Exams. Why it matters: This work advances Arabic NLP by providing high-performing LLMs and demonstrating effective techniques for cross-lingual transfer learning and alignment with human preferences.

BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities

arXiv · Dec 10

MBZUAI releases BiMediX2, a bilingual (Arabic-English) Bio-Medical Large Multimodal Model, along with the BiMed-V dataset (1.6M samples) and BiMed-MBench evaluation benchmark. BiMediX2 supports multi-turn conversation in Arabic and English and handles diverse medical imaging modalities. The model achieves state-of-the-art results on medical LLM and LMM benchmarks, outperforming existing methods and GPT-4 in specific evaluations.

Falcon 2: UAE’s Technology Innovation Institute Releases New AI Model Series, Outperforming Meta’s New Llama 3

TII · Mar 17

The Technology Innovation Institute (TII) in Abu Dhabi has launched Falcon 2, a new series of large language models including the Falcon 2 11B and Falcon 2 11B VLM. The Falcon 2 11B outperforms Meta’s Llama 3 (8B) and performs on par with Google’s Gemma 7B, as verified by Hugging Face. Falcon 2 11B VLM is TII's first multimodal model with vision-to-language capabilities and is open-source, making it accessible to developers. Why it matters: This release strengthens the UAE's position in AI research and development, providing open-source models that can be deployed on smaller infrastructures and used in diverse sectors.