Humain, a company backed by Saudi Arabia's Public Investment Fund (PIF), has awarded an AI data center project to MIS. This project signifies a strategic investment in developing critical infrastructure to support advanced artificial intelligence capabilities within the Kingdom. The collaboration aims to enhance Saudi Arabia's capacity for processing and storing data essential for AI development and deployment. Why it matters: This development is a key step in Saudi Arabia's broader strategy to become a leading hub for AI technology and digital transformation in the Middle East.
Saudi Arabia has reportedly entered the global top five for AI growth, marking a significant advancement in its artificial intelligence sector. This achievement reflects the nation's strategic investments and initiatives aimed at developing its AI capabilities and ecosystem. The ranking underscores the Kingdom's progress in fostering innovation and technological development within the AI domain. Why it matters: This positions Saudi Arabia as a leading player in the global AI landscape, potentially attracting further investment and talent to the Middle East region.
Saudi Arabia is making massive investments in advanced technologies, including artificial intelligence and nuclear power, as part of its Vision 2030 plan. The Kingdom has pledged nearly a trillion dollars to these initiatives. These investments aim to diversify the Saudi economy and reduce its reliance on oil. Why it matters: This commitment signals Saudi Arabia's serious intent to become a major player in the global AI landscape and a hub for technological innovation.
The paper introduces Ara-HOPE, a human-centric post-editing evaluation framework for Dialectal Arabic to Modern Standard Arabic (DA-MSA) translation. Ara-HOPE includes a five-category error taxonomy and a decision-tree annotation protocol designed to address the challenges of dialect-specific MT errors. Evaluation of Jais, GPT-3.5, and NLLB-200 shows dialect-specific terminology and semantic preservation remain key challenges. Why it matters: The new framework and public dataset will help improve the evaluation and development of dialect-aware MT systems for Arabic.
This paper details the autonomous drone racing system developed for the Abu Dhabi Autonomous Racing League (A2RL) x Drone Champions League competition. The system uses drift-corrected monocular Visual-Inertial Odometry (VIO) fused with YOLO-based gate detection for global position measurements, managed via Kalman filter. A perception-aware planner generates trajectories balancing speed and gate visibility. Why it matters: The system's podium finishes validate the effectiveness of monocular vision-based autonomous drone flight and showcases advancements in AI-powered robotics within the UAE.
KAUST and SARsatX have developed a method using Generative Adversarial Networks (GANs) to generate synthetic SAR imagery for training deep learning models to detect oil spills. Starting with just 17 real SAR images, they generated over 2,000 synthetic images to train a Multi-Attention Network (MANet) model. The MANet model, trained exclusively on synthetic data, achieved 75% accuracy in identifying oil spill areas, matching the performance of models trained on larger real datasets. Why it matters: This advancement enables faster and more reliable environmental monitoring using AI, even when real-world data is scarce, reducing the need to wait for actual disasters to occur.
The paper introduces the Prism Hypothesis, which posits a correspondence between an encoder's feature spectrum and its functional role, with semantic encoders capturing low-frequency components and pixel encoders retaining high-frequency information. Based on this, the authors propose Unified Autoencoding (UAE), a model that harmonizes semantic structure and pixel details using a frequency-band modulator. Experiments on ImageNet and MS-COCO demonstrate that UAE effectively unifies semantic abstraction and pixel-level fidelity, achieving state-of-the-art performance.
The paper introduces AraToken, an Arabic-optimized tokenizer based on the SentencePiece Unigram algorithm that incorporates a normalization pipeline to handle Arabic-specific orthographic variations. Experiments show that AraToken achieves 18% lower fertility compared to unnormalized baselines. The Language Extension Pipeline (LEP) is introduced to integrate AraToken into Qwen3-0.6B, reducing evaluation loss from 8.28 to 2.43 within 800 training steps on 100K Arabic samples. Why it matters: This research provides an efficient tokenizer tailored for Arabic, improving performance of LLMs on Arabic text and benefiting Arabic NLP research by providing released resources.
A new benchmark, LongShOTBench, is introduced for evaluating multimodal reasoning and tool use in long videos, featuring open-ended questions and diagnostic rubrics. The benchmark addresses the limitations of existing datasets by combining temporal length and multimodal richness, using human-validated samples. LongShOTAgent, an agentic system, is also presented for analyzing long videos, with both the benchmark and agent demonstrating the challenges faced by state-of-the-art MLLMs.
KAUST and the International Maize and Wheat Improvement Center (CIMMYT) have signed an MoU to collaborate on developing climate-resilient food crops. The collaboration will combine CIMMYT’s expertise in maize and wheat breeding with KAUST’s strengths in genomics and computational agriculture. The partnership will focus on genomic selection, data analytics, and digital breeding technologies, including capacity-building programs. Why it matters: The partnership aims to enhance food security in Saudi Arabia and the wider region by developing resilient, high-yielding crop varieties suited to arid environments.
The paper introduces OmniGen, a unified framework for generating aligned multimodal sensor data for autonomous driving using a shared Bird's Eye View (BEV) space. It uses a novel generalizable multimodal reconstruction method (UAE) to jointly decode LiDAR and multi-view camera data through volume rendering. The framework incorporates a Diffusion Transformer (DiT) with a ControlNet branch to enable controllable multimodal sensor generation, demonstrating good performance and multimodal consistency.
A new study compares vision-language models (VLMs) to YOLOv8 for wastewater treatment plant (WWTP) identification in satellite imagery across the MENA region. VLMs like Gemma-3 demonstrate superior zero-shot performance compared to YOLOv8, trained on a dataset of 83,566 satellite images from Egypt, Saudi Arabia, and UAE. The research suggests VLMs offer a scalable, annotation-free alternative for remote sensing of WWTPs.
The UK-Qatar Joint AI Research Commission, involving institutions such as Queen Mary University of London, has published its final report. This document consolidates findings and recommendations from collaborative research efforts between the United Kingdom and Qatar focused on artificial intelligence. The report signifies the conclusion of a structured initiative aimed at advancing AI knowledge and fostering scientific cooperation between the two nations. Why it matters: This collaboration strengthens international research ties and provides strategic insights that could influence future AI development and policy in Qatar and the wider Middle East.
KAUST researchers discovered that the red algae strain Galdieria yellowstonesis can convert sugars from chocolate-processing waste into C-phycocyanin, a valuable blue pigment. The study found that high levels of carbon dioxide promote Galdieria growth, and the resulting phycocyanin was deemed food-safe by the U.S. FDA. Mars supported the research by providing chocolate samples. Why it matters: This research offers a sustainable method for waste management and contributes to a circular economy in the region, with potential applications in food, cosmetics, and pharmaceuticals.