Skip to content
GCC AI Research

Weekly Digest

Sep 1 – Sep 7, 2025

Top Stories

In recognition of Sheikh Khalifa’s contribution to advancing science and technology, UAE President endorses launch of K2 Think, world’s most advanced open-source reasoning model - wam.ae

WAM · · LLM Research

The UAE President has endorsed the launch of K2 Think, which is described as the world’s most advanced open-source reasoning model. This launch recognizes Sheikh Khalifa’s contributions to advancing science and technology within the UAE. The announcement signifies a major national initiative in the field of artificial intelligence development. Why it matters: This positions the UAE at the forefront of open-source AI innovation and advanced reasoning capabilities, potentially setting new benchmarks for global AI development.

President Sheikh Mohamed endorses launch of UAE's world-leading AI platform - The National

WAM · · Policy Infrastructure

The President of the UAE, Sheikh Mohamed bin Zayed Al Nahyan, has endorsed the launch of a new national artificial intelligence platform. This initiative is described as a "world-leading AI platform" aimed at advancing the UAE's technological capabilities. The platform is expected to integrate AI across various sectors, fostering innovation and enhancing the country's global standing in AI development. Why it matters: This high-level endorsement signals a strong national commitment to accelerating AI adoption and innovation, positioning the UAE as a major player in the global AI landscape and attracting further investment and talent to the region.

SPECS: Specificity-Enhanced CLIP-Score for Long Image Caption Evaluation

arXiv · · CV NLP

Researchers from MBZUAI have introduced SPECS, a new reference-free evaluation metric for long image captions that modifies CLIP to emphasize specificity. SPECS aims to improve the correlation with human judgment while maintaining computational efficiency compared to LLM-based metrics. The proposed approach is intended for iterative use during image captioning model development, offering a practical alternative to existing methods.

AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs

arXiv · · NLP LLM

The paper introduces AraHalluEval, a new framework for evaluating hallucinations in Arabic and multilingual large language models (LLMs). The framework uses 12 fine-grained hallucination indicators across generative question answering and summarization tasks, evaluating 12 LLMs including Arabic-specific, multilingual, and reasoning-based models. Results show factual hallucinations are more common than faithfulness errors, with the Arabic model Allam showing lower hallucination rates. Why it matters: This work addresses a critical gap in Arabic NLP by providing a comprehensive tool for assessing and mitigating hallucination in LLMs, which is essential for reliable AI applications in the Arabic-speaking world.

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

arXiv · · NLP CV

The researchers introduce KAU-CSSL, the first continuous Saudi Sign Language (SSL) dataset focusing on complete sentences. They propose a transformer-based model using ResNet-18 for spatial feature extraction and a Transformer Encoder with Bidirectional LSTM for temporal dependencies. The model achieved 99.02% accuracy in signer-dependent mode and 77.71% in signer-independent mode, advancing communication tools for the SSL community.