Skip to content
GCC AI Research

Weekly Digest

Sep 1 – Sep 7, 2025

Top Stories

In recognition of Sheikh Khalifa’s contribution to advancing science and technology, UAE President endorses launch of K2 Think, world’s most advanced open-source reasoning model - wam.ae

WAM · · LLM Research

The UAE President has endorsed the launch of K2 Think, which is described as the world’s most advanced open-source reasoning model. This launch recognizes Sheikh Khalifa’s contributions to advancing science and technology within the UAE. The announcement signifies a major national initiative in the field of artificial intelligence development. Why it matters: This positions the UAE at the forefront of open-source AI innovation and advanced reasoning capabilities, potentially setting new benchmarks for global AI development.

SPECS: Specificity-Enhanced CLIP-Score for Long Image Caption Evaluation

arXiv · · CV NLP

Researchers from MBZUAI have introduced SPECS, a new reference-free evaluation metric for long image captions that modifies CLIP to emphasize specificity. SPECS aims to improve the correlation with human judgment while maintaining computational efficiency compared to LLM-based metrics. The proposed approach is intended for iterative use during image captioning model development, offering a practical alternative to existing methods.

AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs

arXiv · · NLP LLM

The paper introduces AraHalluEval, a new framework for evaluating hallucinations in Arabic and multilingual large language models (LLMs). The framework uses 12 fine-grained hallucination indicators across generative question answering and summarization tasks, evaluating 12 LLMs including Arabic-specific, multilingual, and reasoning-based models. Results show factual hallucinations are more common than faithfulness errors, with the Arabic model Allam showing lower hallucination rates. Why it matters: This work addresses a critical gap in Arabic NLP by providing a comprehensive tool for assessing and mitigating hallucination in LLMs, which is essential for reliable AI applications in the Arabic-speaking world.

Continuous Saudi Sign Language Recognition: A Vision Transformer Approach

arXiv · · NLP CV

The researchers introduce KAU-CSSL, the first continuous Saudi Sign Language (SSL) dataset focusing on complete sentences. They propose a transformer-based model using ResNet-18 for spatial feature extraction and a Transformer Encoder with Bidirectional LSTM for temporal dependencies. The model achieved 99.02% accuracy in signer-dependent mode and 77.71% in signer-independent mode, advancing communication tools for the SSL community.