February 2025

26 articles

Top Stories

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

arXiv · Feb 28 · NLP LLM

A new survey paper provides a deep dive into post-training methodologies for Large Language Models (LLMs), analyzing their role in refining LLMs beyond pretraining. It addresses key challenges such as catastrophic forgetting, reward hacking, and inference-time trade-offs, and highlights emerging directions in model alignment, scalable adaptation, and inference-time reasoning. The paper also provides a public repository to continually track developments in this fast-evolving field.

Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs

arXiv · Feb 28 · NLP LLM

A new culturally inclusive and linguistically diverse dataset called Palm for Arabic LLMs is introduced, covering 22 Arab countries and featuring instructions in both Modern Standard Arabic (MSA) and dialectal Arabic (DA) across 20 topics. The dataset was built through a year-long community-driven project involving 44 researchers from across the Arab world. Evaluation of frontier LLMs using the dataset reveals limitations in cultural and dialectal understanding, with some countries being better represented than others.

KAUST scientists link gene to pediatric heart defects

KAUST · Feb 27 · Research Healthcare

KAUST researchers have identified the gene 'CIROZ' as responsible for pediatric heart defects and misplacement of internal organs, working with institutes in Saudi Arabia and worldwide. The research examined samples from 16 patients from 10 families, including four from Saudi Arabia, revealing CIROZ's role in embryonic development symmetry. The findings provide insights into heritable diseases, which are more prevalent in Saudi Arabia. Why it matters: Identifying this gene allows for focused research on preventative strategies and curative therapies for congenital heart defects, particularly relevant in regions with higher rates of such diseases.

Language Models' Factuality Depends on the Language of Inquiry

arXiv · Feb 25 · NLP LLM

Researchers introduce a benchmark to evaluate the factual recall and knowledge transferability of multilingual language models across 13 languages. The study reveals that language models often fail to transfer knowledge between languages, even when they possess the correct information in one language. The benchmark and evaluation framework are released to drive future research in multilingual knowledge transfer.

Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts

arXiv · Feb 20 · Research CV

Researchers introduce TimeTravel, a benchmark dataset for evaluating large multimodal models (LMMs) on historical and cultural artifacts. The benchmark comprises 10,250 expert-verified samples across 266 cultures and 10 historical regions, designed to assess AI in tasks like classification and interpretation of manuscripts, artworks, inscriptions, and archaeological discoveries. The goal is to establish AI as a reliable partner in preserving cultural heritage and assisting researchers.

Commonsense Reasoning in Arab Culture

arXiv · Feb 18 · NLP Arabic AI

A new dataset called ArabCulture is introduced to address the lack of culturally relevant commonsense reasoning resources in Arabic AI. The dataset covers 13 countries across the Gulf, Levant, North Africa, and the Nile Valley, spanning 12 daily life domains with 54 fine-grained subtopics. It was built from scratch by native speakers writing and validating culturally relevant questions. Why it matters: The dataset highlights the need for more culturally aware models and benchmarks tailored to the Arabic-speaking world, moving beyond machine-translated resources.

MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection

arXiv · Feb 12 · NLP Arabic AI

The paper introduces MultiProSE, the first multi-label Arabic dataset for propaganda, sentiment, and emotion detection. It extends the existing ArPro dataset with sentiment and emotion annotations, resulting in 8,000 annotated news articles. Baseline models, including GPT-4o-mini and BERT-based models, were developed for each task, and the dataset, guidelines, and code are publicly available. Why it matters: This resource enables further research into Arabic language models and a better understanding of opinion dynamics within Arabic news media.

Under the patronage of His Excellency Qais bin Mohammed Al Yousef Minister of Commerce, Industry and Investment Promotion Microsoft AI Tour showcases groundbreaking AI innovations driving transformation and growth across Oman – Middle East & Afric - Microsoft Source

Oman AI · Feb 20 · Policy Product

February 2025

Top Stories

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs

KAUST scientists link gene to pediatric heart defects

Language Models' Factuality Depends on the Language of Inquiry

Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts

Commonsense Reasoning in Arab Culture

MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection

More This Month

H.R.H. Princess Reema celebrates 2025 KGSP convocation

Utilizing Social Media Analytics to Detect Trends in Saudi Arabias Evolving Market

Microsoft AI Tour showcases groundbreaking AI innovations driving transformation and growth across Oman - Times of Oman

Under the patronage of His Excellency Qais bin Mohammed Al Yousef Minister of Commerce, Industry and Investment Promotion Microsoft AI Tour showcases groundbreaking AI innovations driving transformation and growth across Oman – Middle East & Afric - Microsoft Source

KFAS launches ‘TechEdge’ to empower youth - Kuwait Times

New KAUST program in bioinformatics and AI

Omantel Partners with Shaffra to Introduce AI and Metaverse Solutions in Oman - Biz Today

UAE President issues resolution reconstituting Artificial Intelligence and Advanced Technology Council - وكالة وام

TAQADAM announces its latest startup cohort to receive $1 million

Celebrating 15 Years of Women and Girls in Science at KAUST

Language Shift or Maintenance? An Intergenerational Study of the Tibetan Community in Saudi Arabia

KAUST welcomes 2025 Ibn Rushd Postdoctoral Fellows, strengthening Saudi research talent

Powering KSA’s future: Alumna highlights KAUST’s role in developing Saudi energy labor force

Technology Innovation and Entrepreneurship students stand out at international competition

KAUST showcases new tools for sustainable development in Saudi Arabia

KAUST collaboration to benefit life sciences and nanotechnology research

Microsoft AI tour showcases AI innovations in Oman - Muscat Daily

Plant Science Family Night 2025

UAE strengthens its global leadership in Artificial Intelligence - وكالة وام