Phase 1 of a new 5-gigawatt (GW) AI campus has officially launched in Abu Dhabi, signaling a major advancement in the region's artificial intelligence infrastructure. This development underscores strengthening ties between the UAE and the United States in the technology sector. The campus is poised to become a significant hub for large-scale AI operations and development. Why it matters: This launch represents a substantial investment in cutting-edge AI infrastructure, positioning Abu Dhabi as a global leader in AI capabilities and fostering critical international technological collaboration.
Phase 1 of a new '5GW AI campus' has been launched in Abu Dhabi, signaling a major infrastructure development in the region. This initiative highlights strengthening ties between the UAE and the US in advanced technology. The campus is poised to significantly enhance the UAE's capabilities in large-scale artificial intelligence operations. Why it matters: This project underscores the UAE's strategic commitment to building extensive AI infrastructure and fostering international collaborations to establish itself as a prominent global AI hub.
Saudi Arabia is collaborating with NVIDIA to develop and build 'AI factories' within the Kingdom. These 'AI factories' will accelerate the development and deployment of generative AI and other advanced AI applications, providing powerful computing infrastructure. The initiative aims to support Saudi Arabia's vision of becoming a global leader in AI development, enabling what NVIDIA terms the 'Age of Reasoning.' Why it matters: This major strategic partnership signifies Saudi Arabia's significant investment in advanced AI infrastructure, positioning the Kingdom as a key player in the global AI landscape and fostering domestic AI innovation.
The UAE is implementing a nationwide program to introduce AI education to schoolchildren, demonstrating a significant national commitment to technological advancement. This initiative aims to cultivate a generation proficient in artificial intelligence from a young age. The move is part of the UAE's broader strategy to position itself at the forefront of the global AI sector. Why it matters: This comprehensive approach to AI education is crucial for building a future-ready workforce and solidifying the UAE's ambition to become a leading hub for AI innovation and development in the long term.
MBZUAI introduces Agent-X, a benchmark for evaluating multi-step reasoning in vision-centric agents across real-world, multimodal settings. Agent-X includes 828 tasks with diverse visual contexts and spans six environments, requiring tool use and stepwise decision-making. Experiments show that current LLMs struggle with multi-step vision tasks, achieving less than 50% success, highlighting areas for improvement in LMM reasoning and tool use.
This paper analyzes Arabic text generated by LLMs like ALLaM, Jais, Llama, and GPT-4 across academic and social media domains using stylometric analysis. The study found detectable linguistic patterns that differentiate human-written from machine-generated Arabic text. BERT-based detection models achieved up to 99.9% F1-score in formal contexts, though cross-domain generalization remains a challenge. Why it matters: The research lays groundwork for detecting AI-generated misinformation in Arabic, a crucial step for preserving information integrity in Arabic-language contexts.
MBZUAI researchers introduce SocialMaze, a new benchmark for evaluating social reasoning capabilities in large language models (LLMs). SocialMaze includes six diverse tasks across social reasoning games, daily-life interactions, and digital community platforms, emphasizing deep reasoning, dynamic interaction, and information uncertainty. Experiments show that LLMs vary in handling dynamic interactions, degrade under uncertainty, but can be improved via fine-tuning on curated reasoning examples.
KAUST researchers have developed a hybrid cooling technology combining nanotech plastic and biodegradable mulch that significantly enhances crop yields in arid regions. The technology lowers greenhouse temperatures by 25 degrees Celsius and doubles crop yields in tests with Chinese cabbage. The nanotech plastic coating absorbs infrared light, while the biodegradable mulch reflects sunlight to keep the soil cooler. Why it matters: This innovation promises to improve food security in arid regions like Saudi Arabia while reducing energy consumption and plastic waste associated with traditional greenhouse cooling methods.
Researchers from KAUST and KACST have developed a quantum random number generator (QRNG) that is almost 1000 times faster than existing QRNGs. The device utilizes micro-LEDs and advanced post-processing algorithms and has passed randomness tests by the National Institute of Standards and Technology. The QRNG's portability and high generation rate will benefit industries such as health, finance, and defense. Why it matters: This advancement significantly strengthens data security capabilities in Saudi Arabia, aligning with Vision 2030 goals for technological leadership and innovation.
The Qatar Computing Research Institute (QCRI) has released SpokenNativQA, a multilingual spoken question-answering dataset for evaluating LLMs in conversational settings. The dataset contains 33,000 naturally spoken questions and answers across multiple languages, including low-resource and dialect-rich languages. It aims to address the limitations of text-based QA datasets by incorporating speech variability, accents, and linguistic diversity. Why it matters: This benchmark enables more robust evaluation of LLMs in speech-based interactions, particularly for Arabic dialects and other low-resource languages.
MBZUAI researchers release 'Fann or Flop', a new benchmark for evaluating Arabic poetry understanding in LLMs. The benchmark covers 12 historical eras and 14 poetic genres, assessing semantic understanding, metaphor interpretation, and cultural context. Evaluation of state-of-the-art LLMs reveals challenges in poetic understanding despite strong performance on standard Arabic benchmarks.
MBZUAI researchers introduce ARB, the first comprehensive benchmark for evaluating step-by-step multimodal reasoning in Arabic across textual and visual modalities. The benchmark spans 11 diverse domains and includes 1,356 multimodal samples with 5,119 human-curated reasoning steps. Evaluations of 12 state-of-the-art LMMs revealed challenges in coherence, faithfulness, and cultural grounding, highlighting the need for culturally aware AI systems.
Researchers from MBZUAI have introduced UrduFactCheck, a new framework for fact-checking in Urdu, along with two datasets: UrduFactBench and UrduFactQA. The framework uses monolingual and translation-based evidence retrieval to address the lack of Urdu resources. Evaluations using twelve LLMs showed that translation-augmented methods improve performance, highlighting challenges for open-source LLMs in Urdu.
MBZUAI researchers introduce FAID, a fine-grained AI-generated text detection framework capable of classifying text as human-written, LLM-generated, or collaboratively written. FAID utilizes multi-level contrastive learning and multi-task auxiliary classification to capture authorship and model-specific characteristics, and can identify the underlying LLM family. The framework outperforms existing baselines, especially in generalizing to unseen domains and new LLMs, and includes a multilingual, multi-domain dataset called FAIDSet.
MBZUAI researchers introduce LLM-BabyBench, a benchmark suite for evaluating grounded planning and reasoning in LLMs. The suite, built on a textual adaptation of the BabyAI grid world, assesses LLMs on predicting action consequences, generating action sequences, and decomposing instructions. Datasets, evaluation harness, and metrics are publicly available to facilitate reproducible assessment.
Core42 and PresightAI, subsidiaries of Abu Dhabi-based G42, have partnered with US firms to launch the first phase of a 5GW AI campus in Abu Dhabi. The initial phase involves a 400MW data center, with the broader campus aiming to support advanced AI applications. The project aligns with the UAE's strategy to strengthen its AI infrastructure and international tech partnerships. Why it matters: This initiative underscores the UAE's commitment to becoming a leading global hub for AI research and development, attracting foreign investment and expertise.
Saudi health innovators are increasing investment in smart health solutions, using KAUST's infrastructure and expertise. The King Salman Center for Disability Research (KSCDR) and KCSH are partnering on AI-based methods to identify genetic causes of rare eye diseases. KAUST and the Saudi Food and Drug Administration (SFDA) are expanding their agreement to enhance cooperation on AI and digital innovation. Why it matters: These partnerships signal a concerted effort to leverage AI for addressing critical healthcare challenges and advancing the Kingdom's health priorities.
This paper introduces DaringFed, a novel dynamic Bayesian persuasion pricing mechanism for online federated learning (OFL) that addresses the challenge of two-sided incomplete information (TII) regarding resources. It formulates the interaction between the server and clients as a dynamic signaling and pricing allocation problem within a Bayesian persuasion game, demonstrating the existence of a unique Bayesian persuasion Nash equilibrium. Evaluations on real and synthetic datasets demonstrate that DaringFed optimizes accuracy and convergence speed and improves the server's utility.
This paper introduces a method for quantifying the transferability of architectural components in Single Image Super-Resolution (SISR) models, termed "Universality," and proposes a Universality Assessment Equation (UAE). Guided by the UAE, the authors design optimized modules, Cycle Residual Block (CRB) and Depth-Wise Cycle Residual Block (DCRB), and demonstrate their effectiveness across various datasets and low-level tasks. Results show that networks using these modules outperform state-of-the-art methods, achieving improved PSNR or parameter reduction.