The UAE Government has launched a new training program specifically designed for Chief AI Officers across its public sector. This initiative aims to enhance future technological leadership by equipping government leaders with essential skills in AI integration and strategy. The program is part of a broader effort to leverage artificial intelligence for improved government operations and service delivery. Why it matters: This program signifies the UAE's strategic investment in human capital for AI, reinforcing its national AI strategy and ambition to be a global leader in AI governance and application.
MBZUAI introduces Agent-X, a benchmark for evaluating multi-step reasoning in vision-centric agents across real-world, multimodal settings. Agent-X includes 828 tasks with diverse visual contexts and spans six environments, requiring tool use and stepwise decision-making. Experiments show that current LLMs struggle with multi-step vision tasks, achieving less than 50% success, highlighting areas for improvement in LMM reasoning and tool use.
MBZUAI researchers introduce SocialMaze, a new benchmark for evaluating social reasoning capabilities in large language models (LLMs). SocialMaze includes six diverse tasks across social reasoning games, daily-life interactions, and digital community platforms, emphasizing deep reasoning, dynamic interaction, and information uncertainty. Experiments show that LLMs vary in handling dynamic interactions, degrade under uncertainty, but can be improved via fine-tuning on curated reasoning examples.
This paper analyzes Arabic text generated by LLMs like ALLaM, Jais, Llama, and GPT-4 across academic and social media domains using stylometric analysis. The study found detectable linguistic patterns that differentiate human-written from machine-generated Arabic text. BERT-based detection models achieved up to 99.9% F1-score in formal contexts, though cross-domain generalization remains a challenge. Why it matters: The research lays groundwork for detecting AI-generated misinformation in Arabic, a crucial step for preserving information integrity in Arabic-language contexts.
KAUST researchers have developed a hybrid cooling technology combining nanotech plastic and biodegradable mulch that significantly enhances crop yields in arid regions. The technology lowers greenhouse temperatures by 25 degrees Celsius and doubles crop yields in tests with Chinese cabbage. The nanotech plastic coating absorbs infrared light, while the biodegradable mulch reflects sunlight to keep the soil cooler. Why it matters: This innovation promises to improve food security in arid regions like Saudi Arabia while reducing energy consumption and plastic waste associated with traditional greenhouse cooling methods.
Researchers from KAUST and KACST have developed a quantum random number generator (QRNG) that is almost 1000 times faster than existing QRNGs. The device utilizes micro-LEDs and advanced post-processing algorithms and has passed randomness tests by the National Institute of Standards and Technology. The QRNG's portability and high generation rate will benefit industries such as health, finance, and defense. Why it matters: This advancement significantly strengthens data security capabilities in Saudi Arabia, aligning with Vision 2030 goals for technological leadership and innovation.