Saudi Arabia's national AI authority, SDAIA (Saudi Data and AI Authority), is reportedly developing a National AI Index. This index aims to measure and track the Kingdom's progress and capabilities across various dimensions of artificial intelligence. It is expected to encompass areas such as research and development, talent, infrastructure, and ethical considerations. Why it matters: This initiative provides a crucial framework for evaluating Saudi Arabia's AI ecosystem, guiding future strategic investments, and benchmarking its national AI capabilities against international standards.
KAUST researchers found that sulfate ions reduce free water in aqueous batteries, mitigating parasitic reactions that degrade the anode and shorten battery life. Adding zinc sulfate increased battery lifespan by more than ten times. Sulfate salts stabilize the bonds of free water, acting as a "water glue" to reduce parasitic reactions. Why it matters: This finding provides a cheap and scalable approach to improve the viability of aqueous batteries for sustainable energy storage, particularly for integrating renewable energy sources.
Researchers introduce UnsafeChain, a new safety alignment dataset designed to improve the safety of large reasoning models (LRMs) by focusing on 'hard prompts' that elicit harmful outputs. The dataset identifies and corrects unsafe completions into safe responses, exposing models to unsafe behaviors and guiding their correction. Fine-tuning LRMs on UnsafeChain demonstrates enhanced safety and preservation of general reasoning ability compared to existing datasets like SafeChain and STAR-1.
This paper explores Dialectal Arabic (DA) to Modern Standard Arabic (MSA) machine translation using prompting and fine-tuning techniques for Levantine, Egyptian, and Gulf dialects. The study found that few-shot prompting outperformed zero-shot and chain-of-thought methods across six large language models, with GPT-4o achieving the highest performance. A quantized Gemma2-9B model achieved a chrF++ score of 49.88, outperforming zero-shot GPT-4o (44.58). Why it matters: The research provides a resource-efficient pipeline for DA-MSA translation, enabling more inclusive language technologies by addressing the challenges posed by dialectal variations in Arabic.
Researchers propose MS-NN-steer, a model-structured neural network for autonomous vehicle steering control that integrates nonlinear vehicle dynamics. The controller was validated using real-world data from the Abu Dhabi Autonomous Racing League (A2RL) competition. MS-NN-steer demonstrates improved accuracy, generalization, and robustness compared to general-purpose NNs and the A2RL winning team's controller. Why it matters: This research demonstrates a promising approach to developing transparent and reliable AI for safety-critical autonomous racing applications in the UAE.
KAUST is partnering with digiLab to develop AI for coral conservation within the KAUST Coral Restoration Initiative (KCRI). digiLab's AI platform will provide real-time simulations of the 100-hectare reefscape, aiding in understanding coral resilience and growth under changing conditions. The AI tools are expected to reduce coral assessment times from months to weeks and optimize sensor placement. Why it matters: This partnership sets a new standard for coral restoration by demonstrating a scalable AI-driven model for global conservation efforts.
KAUST researchers have developed deepBlastoid, a deep learning tool for evaluating models of human embryo development, called blastoids. deepBlastoid can evaluate images of blastoids at speeds 1000 times faster than expert scientists, processing 273 images per second. Trained on over 2000 microscopic blastoid images, it assesses the impact of chemicals on blastoid development using over 10,000 images. Why it matters: This AI tool accelerates research into early pregnancy, fertility complications, and the impact of chemicals on embryo development, with implications for reproductive technologies.
This paper introduces Absher, a new benchmark for evaluating LLMs' linguistic and cultural competence in Saudi dialects. The benchmark comprises over 18,000 multiple-choice questions spanning six categories, using dialectal words, phrases, and proverbs from various regions of Saudi Arabia. Evaluation of state-of-the-art LLMs reveals performance gaps, especially in cultural inference and contextual understanding, highlighting the need for dialect-aware training.
Researchers developed a two-stage AI pipeline to predict desalination performance efficiency losses due to climate factors in the UAE, achieving 98% accuracy. The model forecasts aerosol optical depth (AOD) and uses it to predict desalination efficiency, incorporating meteorological data. A dust-aware control logic was developed to optimize plant operations, and an interactive dashboard was created for decision support.
Saudi Arabia's Ministry of Education will introduce artificial intelligence (AI) education across all grade levels starting this academic year. The initiative aims to equip students with essential AI skills and knowledge to prepare them for future job markets. The curriculum will cover fundamental AI concepts, programming, and ethics. Why it matters: This nationwide initiative signals a major push to cultivate a domestic AI talent pool and position Saudi Arabia as a regional leader in AI innovation.
MBZUAI researchers have introduced MIRA, a novel framework for improving the factual accuracy of multimodal large language models in medical applications. MIRA uses calibrated retrieval to manage factual risk and integrates image embeddings with a medical knowledge base for efficient reasoning. Evaluated on medical VQA and report generation benchmarks, MIRA achieves state-of-the-art results, with code available on GitHub.
The paper introduces ScoreAdv, a novel approach for generating natural adversarial examples (UAEs) using diffusion models. It incorporates an adversarial guidance mechanism and saliency maps to shift the sampling distribution and inject visual information. Experiments on ImageNet and CelebA datasets demonstrate state-of-the-art attack success rates, image quality, and robustness against defenses.
The authors introduce Nile-Chat, a collection of LLMs (4B, 3x4B-A6B, and 12B) specifically for the Egyptian dialect, capable of understanding and generating text in both Arabic and Latin scripts. A novel language adaptation approach using the Branch-Train-MiX strategy is used to merge script-specialized experts into a single MoE model. Nile-Chat models outperform multilingual and Arabic LLMs like LLaMa, Jais, and ALLaM on newly introduced Egyptian benchmarks, with the 12B model achieving a 14.4% performance gain over Qwen2.5-14B-Instruct on Latin-script benchmarks; all resources are publicly available. Why it matters: This work addresses the overlooked aspect of adapting LLMs to dual-script languages, providing a methodology for creating more inclusive and representative language models in the Arabic-speaking world.