This paper introduces an enhanced Dense Passage Retrieval (DPR) framework tailored for Arabic text retrieval. The core innovation is an Attentive Relevance Scoring (ARS) mechanism that improves semantic relevance modeling between questions and passages, replacing standard interaction methods. The method integrates pre-trained Arabic language models and architectural refinements, achieving improved retrieval and ranking accuracy for Arabic question answering. Why it matters: This work addresses the underrepresentation of Arabic in NLP research by providing a novel approach and publicly available code to improve Arabic text retrieval, which can benefit various applications like Arabic search engines and question-answering systems.
MBZUAI researchers introduce ARB, the first comprehensive benchmark for evaluating step-by-step multimodal reasoning in Arabic across textual and visual modalities. The benchmark spans 11 diverse domains and includes 1,356 multimodal samples with 5,119 human-curated reasoning steps. Evaluations of 12 state-of-the-art LMMs revealed challenges in coherence, faithfulness, and cultural grounding, highlighting the need for culturally aware AI systems.
The Autonomous Robotics Research Center (ARRC) is developing underwater communication systems, including a multimode modem prototype, and has filed three patents. One key technology is the Universal Underwater Software Defined Modem (UniSDM), which supports sound, magnetic induction, light, and radio waves. ARRC also developed a network management framework for automatic network slicing (ANS) of communication resources. Why it matters: These advancements are crucial for improving underwater exploration, industrial maintenance, and marine monitoring in the region, enabling more efficient and reliable communication for underwater robots.
Abu Dhabi's Advanced Technology Research Council (ATRC) has launched AI71, a new AI company building on the Falcon generative AI models developed by TII. AI71 will focus on multi-domain specializations, offering AI data control options for companies and countries looking to self-host for greater privacy. The company will be taken to market by ATRC's VentureOne subsidiary, initially targeting the medical, educational, and legal sectors. Why it matters: AI71 aims to establish Abu Dhabi and the UAE as a major AI player by providing decentralized data ownership and promoting broader access to AI technology.
Researchers introduce AraDiCE, a benchmark for Arabic Dialect and Cultural Evaluation, comprising seven synthetic datasets in various dialects and Modern Standard Arabic (MSA). The benchmark includes approximately 45,000 post-edited samples and evaluates LLMs on dialect comprehension, generation, and cultural awareness across the Gulf, Egypt, and Levant. Results show that Arabic-specific models like Jais and AceGPT outperform multilingual models on dialectal tasks, but challenges remain in dialect identification, generation, and translation. Why it matters: This benchmark and associated datasets will help improve LLMs' ability to understand and generate diverse Arabic dialects and cultural contexts, addressing a significant gap in current models.
This research evaluates LLMs like ChatGPT, Llama, Aya, Jais, and ACEGPT on Arabic automated essay scoring (AES) using the AR-AES dataset. The study uses zero-shot, few-shot learning, and fine-tuning approaches while using a mixed-language prompting strategy. ACEGPT performed best among the LLMs with a QWK of 0.67, while a smaller BERT model achieved 0.88. Why it matters: The study highlights challenges faced by LLMs in processing Arabic and provides insights into improving LLM performance in Arabic NLP tasks.