Topics ›

LLM

Large language models (LLMs) developed, deployed, and researched across the GCC — including Falcon (TII), Jais (MBZUAI/Inception), AceGPT (KAUST), ALLaM (SDAIA), and Fanar (QCRI).

50 articles RSS ↗

UAE enters new phase of government development built on deploying Agentic AI, says Al Gergawi - Economy Middle East

The National · Jun 19 · Policy Infrastructure

The UAE government has announced a new phase of development centered on deploying Agentic AI to enhance government services and efficiency. This strategic direction was articulated by Mohammad Al Gergawi at the World Governments Summit 2024. The initiative aims to leverage advanced AI capabilities across various public administration functions. Why it matters: This signifies a major policy commitment from a leading regional AI hub, signaling a practical and strategic move towards large-scale AI integration in public sector operations.

UAE targets 50 per cent of federal operations on Agentic AI within two years - Gulf Today

The National · Jun 19 · Policy Infrastructure

The UAE government has set an ambitious target to integrate Agentic AI into 50% of its federal operations within the next two years. This initiative aims to enhance efficiency, automate complex tasks, and improve decision-making across various government services. The move signals the nation's strong commitment to adopting advanced artificial intelligence technologies in the public sector. Why it matters: This aggressive timeline signifies the UAE's proactive strategy to leverage cutting-edge AI for public sector transformation, potentially setting a global benchmark for government AI integration.

UAE targets 50% adoption of agentic AI across government by 2028 - Sharjah24

The National · Jun 19 · Policy Infrastructure

The UAE government has announced an ambitious target to achieve 50% adoption of agentic AI across its various governmental operations. This strategic goal is set to be realized by the year 2028, aiming to significantly enhance public sector efficiency and service delivery. The initiative underscores the UAE's commitment to leveraging advanced artificial intelligence to drive digital transformation. Why it matters: This represents a major national policy initiative to integrate cutting-edge AI into public administration, positioning the UAE as a leader in AI governance and application.

Abu Dhabi's Core42 more than triples US data centre capacity to 60MW - The National

The National · Jun 2 · Infrastructure Product

Core42, an Abu Dhabi-based technology company and part of G42, has significantly expanded its data center capacity in the United States. The company more than tripled its US data center footprint, reaching a total capacity of 60 megawatts (MW). This expansion is aimed at meeting the increasing demand for high-performance computing essential for complex artificial intelligence workloads. Why it matters: This strategic infrastructure investment reinforces Core42's global capabilities, enabling greater scale for AI development and deployment, which is critical for supporting advanced AI models and services.

UAE Government partners with MBZUAI to train 80,000 federal staff in Agentic AI - EdTech Innovation Hub

The National · May 26 · Policy Partnership

The UAE Government has partnered with Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) to launch a training program for 80,000 federal staff. This initiative focuses on upskilling government employees in Agentic AI technologies. The collaboration aims to enhance national AI capabilities and accelerate digital transformation across federal entities. Why it matters: This major government-led program represents a significant investment in human capital, strategically positioning the UAE to leverage advanced AI for public sector efficiency and innovation.

Sheikh Mohammed reviews UAE Agentic AI transformation plans at government retreat - Gulf News

Gulf News News · May 20 · Policy Infrastructure

Sheikh Mohammed bin Rashid Al Maktoum, UAE Vice President and Prime Minister, reviewed the nation's Agentic AI transformation plans during a government retreat. This strategic review aims to integrate advanced AI agent technologies across various sectors in the United Arab Emirates. The discussions focused on how these plans would accelerate the UAE's development and adoption of cutting-edge AI capabilities. Why it matters: This signifies a high-level commitment from the UAE leadership to a proactive national AI strategy, particularly in advanced areas like Agentic AI, reinforcing the country's ambition to be a global leader in AI innovation and implementation.

UAE to train 80,000 federal employees in Agentic AI - Utilities Middle East

The National · May 19 · Policy Infrastructure

The UAE government plans a major initiative to train 80,000 federal employees in Agentic AI. This program aims to significantly enhance the capabilities of the public sector workforce in advanced artificial intelligence technologies. The initiative underscores the UAE's strategic commitment to integrating cutting-edge AI into its operational framework. Why it matters: This large-scale training effort represents a national investment in AI human capital, solidifying the UAE's position as a leader in government AI adoption and implementation within the Middle East.

'UAE Government 4.0': Cabinet approves project to train 80,000 employees in Agentic AI - Khaleej Times

The National · May 18 · Policy Infrastructure

The UAE Cabinet has approved a new initiative under its 'Government 4.0' framework. This project aims to train 80,000 government employees in Agentic AI technologies. The goal is to enhance operational efficiency, improve decision-making processes, and elevate the quality of government services. Why it matters: This initiative underscores the UAE's proactive strategy to integrate advanced AI into public administration, positioning it as a leader in AI-driven governance and digital transformation in the region.

UAE to train 80,000 government workers in Agentic AI under high-tech drive - The National

The National · May 18 · Policy Infrastructure

The UAE government has announced a comprehensive program aimed at training 80,000 government workers in Agentic AI. This initiative is a core component of the nation's broader high-tech drive to enhance digital capabilities and foster innovation within the public sector. The program seeks to equip a significant portion of the workforce with advanced AI skills to transform government operations and service delivery. Why it matters: This program represents a major strategic investment by the UAE in human capital development, signifying a strong commitment to accelerate AI adoption and digital transformation across its public services.

New gen AI guide: UAE aims to accelerate AI adoption across govt, business - Gulf Business

The National · May 11 · Policy Infrastructure

The UAE has launched a new guide focusing on generative AI, aimed at accelerating the adoption of these advanced technologies across its government and business sectors. This initiative seeks to integrate AI tools and frameworks to enhance operational efficiencies and foster innovation. The guide likely provides strategic direction and best practices for the responsible and effective deployment of generative AI solutions nationwide. Why it matters: This national policy underscores the UAE's proactive strategy to solidify its position as a global leader in AI adoption and leverage cutting-edge technologies for economic growth and digital transformation.

New gen AI guide: UAE aims to accelerate AI adoption across govt, business - Gulf Business

The National · May 11 · Policy LLM

The UAE has reportedly released a new guide focused on generative AI, aiming to accelerate its adoption across both government entities and private businesses. This initiative outlines strategies and frameworks for integrating advanced AI technologies into various sectors. The guide is designed to foster innovation and enhance efficiency throughout the UAE's economy. Why it matters: This development underscores the UAE's strategic commitment to establishing itself as a global leader in AI development and deployment, driving digital transformation and economic competitiveness.

UAE receives first shipment of Nvidia's advanced AI chips - The National

The National · May 8 · Infrastructure Policy

The UAE has received its initial shipment of advanced AI chips from Nvidia, marking a significant milestone in its national AI strategy. These chips are essential for powering the country's growing supercomputing capabilities and accelerating the development of large language models. This delivery underscores the UAE's commitment to establishing itself as a global leader in AI innovation. Why it matters: This acquisition directly enhances the UAE's capacity for advanced AI research and development, solidifying its competitive position in the global AI landscape and fostering local technological growth.

UAE’s G42 unit launches sovereign enterprise AI assistant - Gulf Business

The National · May 5 · Product LLM

G42, a prominent UAE-based AI technology holding company, has launched a new sovereign enterprise AI assistant. This product is designed to offer secure, localized AI capabilities for businesses and government entities within the region. It aims to prioritize data privacy and cater to the specific regional context, expanding G42's offerings in specialized enterprise solutions. Why it matters: This launch underscores the UAE's strategic drive to develop secure, locally-controlled AI solutions for critical sectors, thereby bolstering its digital sovereignty and reducing reliance on external AI infrastructure.

UAE’s G42 unit launches sovereign enterprise AI assistant - Gulf Business

The National · May 5 · Product LLM

A unit of UAE's G42 has launched a new 'sovereign enterprise AI assistant' aimed at supporting businesses with secure, localized artificial intelligence solutions. This product is designed to offer robust AI capabilities while maintaining data sovereignty and control for enterprises. The launch signifies G42's continued expansion into the enterprise AI market within the UAE and the broader region. Why it matters: This initiative reinforces the UAE's commitment to developing secure, in-country AI capabilities, enhancing operational efficiency for regional enterprises, and fostering data sovereignty.

Agentic AI to run 50% of UAE govt services: What this means for residents - Khaleej Times

Khaleej Times News · Apr 24 · Policy Product

The UAE government has announced an ambitious target to integrate Agentic AI into 50% of its federal government services within the next year. This initiative aims to enhance efficiency, personalize citizen interactions, and streamline government operations across various sectors. It involves deploying autonomous AI agents capable of handling tasks, responding to inquiries, and automating service delivery. Why it matters: This ambitious target positions the UAE as a global leader in AI governance and public service automation, potentially setting a precedent for other nations.

Abu Dhabi’s Technology Innovation Institute and AI71 Honored with UAE AI Award for Emirati AI Solutions

TII · Mar 17 · Policy LLM

Technology Innovation Institute (TII) won the UAE AI Award for Emirati AI Solutions for its Falcon LLM series. AI71 also won for LAW71, an AI-powered legal solution, and RAZI71, an AI-powered healthcare solution. The award recognizes AI innovations made in the UAE that demonstrate innovation, AI ethics compliance, maturity, and scalability. Why it matters: The award highlights the UAE's commitment to developing local AI talent and solutions, particularly in open-source models, for global collaboration and positive transformation.

UAE’s Technology Innovation Institute Launches ‘Falcon Foundation’ to Champion Open-sourcing of Generative AI Models

TII · Mar 17 · LLM Funding

The Technology Innovation Institute (TII) in Abu Dhabi has launched the Falcon Foundation, a non-profit dedicated to advancing open-source generative AI models. TII is committing $300 million to fund open-source AI projects, beginning with its Falcon AI models. The foundation aims to foster collaboration among stakeholders, developers, academia, and industry to promote transparent governance and knowledge exchange in AI. Why it matters: This initiative signals the UAE's commitment to leading in AI development through open-source innovation and collaboration, potentially accelerating AI adoption and customization across various sectors.

Abu Dhabi’s Advanced Technology Research Council launches ‘AI71’: New AI Company Pioneering Decentralised Data Control for Companies & Countries

TII · Mar 17 · AI LLM

Abu Dhabi's Advanced Technology Research Council (ATRC) has launched AI71, a new AI company building on the Falcon generative AI models developed by TII. AI71 will focus on multi-domain specializations, offering AI data control options for companies and countries looking to self-host for greater privacy. The company will be taken to market by ATRC's VentureOne subsidiary, initially targeting the medical, educational, and legal sectors. Why it matters: AI71 aims to establish Abu Dhabi and the UAE as a major AI player by providing decentralized data ownership and promoting broader access to AI technology.

Technology Innovation Institute Introduces World’s Most Powerful Open LLM: Falcon 180B

TII · Mar 17 · LLM Research

Technology Innovation Institute (TII) in the UAE has launched Falcon 180B, an open access large language model with 180 billion parameters trained on 3.5 trillion tokens. Falcon 180B ranks first on the Hugging Face Leaderboard for pretrained LLMs, outperforming Meta's LLaMA 2 and nearing the performance of OpenAI's GPT-4 and Google's PaLM 2. The model is available for research and commercial use under the 'Falcon 180B TII License', based upon Apache 2.0. Why it matters: This release strengthens the UAE's position in AI development and promotes open access to advanced AI technology, fostering innovation and collaboration.

UAE’s Falcon 40B Dominates Leaderboard: Ranks #1 Globally in Latest Hugging Face Independent Verification of Open-source AI Models

TII · Mar 17 · LLM Research

TII's Falcon 40B, a 40-billion-parameter open-source AI model, has ranked #1 on Hugging Face's Open LLM Leaderboard, surpassing models like LLaMA and StableLM. The leaderboard uses benchmarks like AI2 Reasoning Challenge, HellaSwag, MMLU, and TruthfulQA. Trained on one trillion tokens, Falcon 40B's weights are available for research and commercial use. Why it matters: This achievement positions the UAE as a leader in generative AI and promotes transparent, inclusive AI development.

Saudi Arabia’s HUMAIN invests $3 billion in xAI Series E ahead of SpaceX acquisition - Al Arabiya English

Al Arabiya News · Feb 19 · Funding Partnership

Saudi Arabia’s HUMAIN, an investment firm, has invested $3 billion in xAI's Series E funding round. This investment precedes xAI's anticipated acquisition by SpaceX. The funding will support xAI's endeavors in infrastructure development and advanced technologies. Why it matters: This marks a significant commitment from Saudi Arabia towards AI infrastructure, potentially fostering further technological advancements in the region.

Top-ranked Arab university unveils Middle East’s most powerful supercomputer

KAUST · Nov 17 · Infrastructure Research

KAUST has unveiled Shaheen III, the most powerful supercomputer in the Middle East and 18th globally, built by HPE. The system uses 2,800 NVIDIA GH200 Grace Hopper Superchips, tripling the processing power of its predecessor. Shaheen III will support research in Arabic LLMs, climate modeling, remote sensing, automated chemistry, and AI-driven healthcare. Why it matters: This infrastructure investment strengthens Saudi Arabia's position in AI and computational research, enabling advances tailored to the region's needs and priorities.

UAE President meets OpenAI CEO to discuss AI cooperation - Dubai Eye 103.8

WAM News · Sep 28 · Policy Partnership

UAE President Sheikh Mohamed bin Zayed Al Nahyan met with OpenAI CEO Sam Altman to discuss cooperation in advanced technology, particularly AI. The meeting focused on leveraging AI to accelerate development and benefit humanity. This high-level discussion underscores the UAE's strategic commitment to becoming a global leader in AI innovation. Why it matters: This direct engagement between the head of state and a leading AI figure signals the UAE's intent to forge top-tier partnerships and influence the future direction of AI development on a national and global scale.

President Sheikh Mohamed receives OpenAI CEO Sam Altman - The National

WAM News · Sep 27 · Policy Partnership

President Sheikh Mohamed bin Zayed Al Nahyan of the UAE received Sam Altman, the CEO of OpenAI. The high-level meeting likely focused on strategic discussions regarding artificial intelligence development and collaboration. This engagement highlights the UAE's proactive approach to integrating advanced AI technologies into its national agenda. Why it matters: Interactions between national leaders and prominent AI industry figures often signal future policy directions, potential investments, and significant technological partnerships for the region.

In recognition of Sheikh Khalifa’s contribution to advancing science and technology, UAE President endorses launch of K2 Think, world’s most advanced open-source reasoning model - wam.ae

WAM News · Sep 7 · LLM Research

The UAE President has endorsed the launch of K2 Think, which is described as the world’s most advanced open-source reasoning model. This launch recognizes Sheikh Khalifa’s contributions to advancing science and technology within the UAE. The announcement signifies a major national initiative in the field of artificial intelligence development. Why it matters: This positions the UAE at the forefront of open-source AI innovation and advanced reasoning capabilities, potentially setting new benchmarks for global AI development.

Saudi Arabia and NVIDIA to Build AI Factories to Power Next Wave of Intelligence for the Age of Reasoning - NVIDIA Newsroom

SDAIA · May 13 · Infrastructure Partnership

Saudi Arabia is collaborating with NVIDIA to develop and build 'AI factories' within the Kingdom. These 'AI factories' will accelerate the development and deployment of generative AI and other advanced AI applications, providing powerful computing infrastructure. The initiative aims to support Saudi Arabia's vision of becoming a global leader in AI development, enabling what NVIDIA terms the 'Age of Reasoning.' Why it matters: This major strategic partnership signifies Saudi Arabia's significant investment in advanced AI infrastructure, positioning the Kingdom as a key player in the global AI landscape and fostering domestic AI innovation.

UAE launches world's first AI-powered legislative intelligence office - TV BRICS

WAM News · Apr 15 · Policy Infrastructure

The UAE has launched the world's first AI-powered legislative intelligence office, aiming to integrate artificial intelligence into legislative processes. This new office is designed to enhance the efficiency, foresight, and data-driven capabilities of lawmaking within the country. Its establishment marks a significant step towards leveraging advanced technology for public administration. Why it matters: This initiative positions the UAE as a global leader in AI governance and demonstrates a strong national commitment to utilizing AI for improving governmental operations and policy development.

Inception launches InceptionClaw, a sovereign AI super assistant

G42 · May 11 · Product LLM

Inception, a G42 company, has launched InceptionClaw, an enterprise-grade AI super assistant designed for enterprise leaders and government officials. Built on Inception's Catalyst platform and powered by Compass models, InceptionClaw actively manages workloads by monitoring calendars and emails, delivering structured briefs, alerts, and audio summaries. It operates with UAE-native sovereignty, ensuring all data remains within UAE jurisdiction under Greenshield sovereign controls, addressing critical data residency and trust requirements. The assistant also includes features like tamper-proof audit trails, code-reviewed skills, spending limits, and human approval queues for high-stakes actions.

G42 & R/GA Launch Alpha.G42.ai: A World-First Generative Interface, Prototyping the Future of the Web

G42 · Apr 20 · Product LLM

G42, a global leader in artificial intelligence based in Abu Dhabi, partnered with creative innovation company R/GA to launch alpha.G42.ai, a generative interface designed to transform traditional websites into dynamic, conversational systems. This prototype redefines a brand's digital presence by employing an intelligent agent powered by integrated large language models (LLMs) to generate and curate personalized content for each visitor in real-time. The system processes various content types as knowledge, which it then synthesizes to produce dynamic, tailored outputs for users interacting via voice or text, moving beyond static content management. Why it matters: This initiative from a major UAE AI firm pioneers a novel approach to web interfaces, potentially influencing future digital interactions and content delivery globally.

Evaluation of Small Language Models for Arabic Language Processing

arXiv · Jun 19 · NLP LLM

A new paper evaluated twelve Small Language Models (SLMs) on Arabic natural language processing tasks, utilizing a benchmark of 240 Arabic test items across eight domains and ten language skills. The models were assessed in a zero-shot setting, with responses scored using a multi-model LLM-as-a-judge framework involving GPT-4.1 Mini, Claude Haiku 4.5, and DeepSeek-Chat. Gemma 3 (12B) achieved the highest overall score (4.548/5), followed by Aya and C4AI Command Arabic, with results suggesting that strong Arabic alignment and instruction-following are crucial for performance. Why it matters: This benchmark offers a standardized method for evaluating compact Arabic language models, guiding future development towards more efficient, reliable, and culturally relevant Arabic AI systems.

Almieyar-Oryx-BloomBench: A Bilingual Multimodal Benchmark for Cognitively Informed Evaluation of Vision-Language Models

arXiv · Jun 4 · Research LLM

Researchers have introduced BloomBench, a new cognitively human-grounded, bilingual (English-Arabic) multimodal benchmark for Vision-Language Models (VLMs), as part of the Almieyar benchmarking series. Grounded in Bloom's Taxonomy, it systematically evaluates six levels of cognition—Remember, Understand, Apply, Analyze, Evaluate, Create—through carefully designed image-question-answer tasks. A comprehensive study using BloomBench revealed that state-of-the-art VLMs exhibit strong semantic understanding but struggle significantly with factual recall and creative synthesis, alongside a critical performance gap between Arabic and English. Why it matters: This benchmark provides a crucial tool for diagnosing cognitive weaknesses in current VLMs and lays the groundwork for developing more cognitively aligned and inclusive multimodal AI, particularly for cross-lingual applications.

An NLP-Driven Framework for Curriculum-Labor Market Alignment: Schema-Constrained LLM Extraction, ESCO-Anchored Semantic Matching, and Multi-Dimensional Gap Quantification

arXiv · Jun 1 · NLP LLM

Researchers proposed a four-stage NLP framework combining schema-constrained LLM extraction, Sentence-BERT (SBERT) alignment with ESCO, an adjudication protocol, and a verification mechanism for curriculum-labor market alignment. The framework was instantiated for the ABET-accredited BSc Computer Science program at the United Arab Emirates University (UAEU), extracting 400 competency records from the study plan and aligning them with 30 job postings. The extractor achieved a Cohen's kappa of 0.79 on the skill slot and surfaced interpretable supply-demand gaps in general, transversal, algorithms, and software engineering skills, with a minimal gap in AI and data science. Why it matters: This framework provides a robust, NLP-driven method to identify crucial skill gaps in higher education curricula, directly supporting quality assurance and workforce development initiatives in the region.

Uncovering Temporal Framing in the News

arXiv · May 29 · NLP Research

Researchers from MBZUAI have proposed a new taxonomy of eight temporal frames and studied their persuasive use in news discourse. They created a multilingual dataset by expertly annotating 458 English and German news articles, identifying over 2,000 temporally framed sentences and approximately 3,000 annotations. Their experiments demonstrated that temporal framing is learnable at the sentence level, with supervised models significantly outperforming zero-shot classification approaches. Why it matters: This research provides a valuable dataset and methodology for understanding how time-related language shapes interpretation in news, contributing to advancements in NLP for media analysis and potentially countering disinformation.

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

arXiv · May 19 · NLP LLM

Researchers developed an Arabic NLP framework designed for large-scale financial sentiment analysis specifically tailored to the Saudi market. The framework integrates official financial news and social media, constructing an 84K-sample Arabic financial corpus through a multi-stage pipeline encompassing data collection, cleaning, and sentiment annotation. It employs Transformer-based NER and a curated company lexicon to link textual mentions to canonical company identifiers, assigning five-class sentiment labels for analyzing sentiment dynamics relative to stock market behavior on the Saudi Exchange. Why it matters: This research addresses a critical gap in Arabic financial NLP resources, offering a scalable method to understand investor sentiment in a key Middle Eastern market.

Yasi One launches from Abu Dhabi: A new AI that thinks before it responds - Gulf News

Gulf News News · May 12 · Product LLM

Yasi One, a new artificial intelligence system, has been launched from Abu Dhabi, UAE. This AI is specifically noted for its unique capability to 'think before it responds,' suggesting advanced processing and reasoning functionalities. The launch of Yasi One was reported by Gulf News. Why it matters: This development underscores Abu Dhabi's growing ambition to develop and deploy cutting-edge AI technologies, potentially contributing to more sophisticated and contextually aware AI applications in the region.

The Geopolitics of AI Safety: A Causal Analysis of Regional LLM Bias

arXiv · May 6 · LLM Research

This study introduces a Probabilistic Graphical Model (PGM) framework utilizing Pearl's do-operator to causally audit LLM safety mechanisms, specifically isolating the effect of injecting cultural demographics into prompts. A large-scale empirical analysis was conducted across seven instruction-tuned models from diverse origins, including the UAE's Falcon3-7B, as well as models from the US, Europe, China, and India, using ToxiGen and BOLD datasets. The findings revealed a disparity between observational and interventional bias, demonstrating that standard fairness metrics can overestimate demographic bias. Western models exhibited higher causal refusal rates for specific demographic groups, while Eastern models showed low overall intervention rates with targeted sensitivities toward regional demographics. Why it matters: This research highlights the geopolitical nuances of LLM safety alignment and the potential for demographic-sensitive over-triggering to restrict benign discourse, which is particularly relevant for diverse regions like the Middle East in developing culturally-aware AI.

The Cylindrical Representation Hypothesis for Language Model Steering

arXiv · May 3 · NLP LLM

Researchers have proposed the Cylindrical Representation Hypothesis (CRH) to address the instability and unpredictability observed in steering large language models, an issue not fully explained by the existing Linear Representation Hypothesis (LRH). CRH suggests that overlapping concept contributions lead to a sample-specific axis-orthogonal structure, comprising a central axis for concept generation and a surrounding normal plane for steering sensitivity. This framework identifies intrinsic uncertainty at the 'sensitive sector' level within the plane, providing a principled explanation for fluctuations in steering outcomes. Experiments verify the existence of this cylindrical structure and demonstrate CRH's practical utility in interpreting real-world model steering behavior, with code available on GitHub from mbzuai-nlp. Why it matters: This research from MBZUAI offers a crucial theoretical advancement in understanding and potentially improving the control and reliability of large language models.

The Cylindrical Representation Hypothesis for Language Model Steering

arXiv · May 3 · LLM NLP

Researchers from MBZUAI have proposed the Cylindrical Representation Hypothesis (CRH) to explain the instability and unpredictability observed in large language model steering. CRH relaxes the orthogonality assumption of the existing Linear Representation Hypothesis, positing a cylindrical structure where a central axis captures concept differences and a surrounding normal plane controls steering sensitivity. The hypothesis suggests that the intrinsic uncertainty in identifying specific sensitive sectors within this normal plane accounts for why steering outcomes frequently fluctuate even with well-aligned directions. Why it matters: This research offers a more robust theoretical framework for understanding and potentially improving the control and reliability of large language models.

Instruction-Guided Poetry Generation in Arabic and Its Dialects

arXiv · Apr 30 · NLP LLM

Researchers at MBZUAI have developed a new method for controllable poetry generation in Arabic and its dialects, moving beyond traditional analysis tasks for Arabic poetry within Large Language Models (LLMs). They introduce a large-scale, instruction-based dataset in Modern Standard Arabic (MSA) and various Arabic dialects, enabling LLMs to perform tasks like writing, revising, and continuing poems based on user criteria. Experiments show that fine-tuning LLMs on this dataset results in models capable of generating poetry aligned with user requirements, validated by automated metrics and human evaluation. Why it matters: This work represents a significant advancement in Arabic Natural Language Processing, offering tools for creative expression and cultural preservation while opening new avenues for user-guided content generation in culturally rich text forms.

Agentic AI is coming to UAE government services: What will change for residents - Gulf News

Gulf News News · Apr 24 · Policy Product

Agentic AI is set to be integrated into UAE government services, indicating a shift in how residents will interact with public sector offerings. This initiative aims to leverage advanced AI capabilities to transform government operations and service delivery. The changes are expected to impact various aspects of resident engagement with government platforms. Why it matters: This move signifies a major step in the UAE's strategy to modernize its public services through cutting-edge AI, potentially setting a new standard for citizen-centric digital governance in the region.

New Google AI feature lets your data power smarter answers — now in UAE - Gulf News

Gulf News News · Apr 15 · Product LLM

Google has introduced a new AI feature in the United Arab Emirates, designed to provide more intelligent and personalized answers to users. This feature reportedly leverages user data, with consent, to enhance its responsiveness and relevance. The rollout in the UAE signifies the expansion of Google's advanced AI services into the Middle East market. Why it matters: This launch represents increased access to sophisticated AI tools for consumers and businesses in the UAE, potentially accelerating AI adoption and innovation in the local digital economy.

RightNow-Arabic-0.5B-Turbo: An Open Sub-1B Arabic Language Model via Vocabulary Injection and Edge-First Deployment

arXiv · Apr 10 · LLM Arabic AI

RightNow-Arabic-0.5B-Turbo is a new 518M-parameter Arabic-specialized decoder LLM, built on Qwen2.5-0.5B, designed to bridge the gap between small multilingual and large Arabic-specialized models. Its development pipeline included adding 27,032 Arabic tokens via vocabulary injection, continued pretraining on 504M Arabic tokens, and fine-tuning with supervised instruction and direct preference optimization. The model achieved a 35.9% mean accuracy on three Arabic benchmarks (COPA-ar, Arabic HellaSwag, ArabicMMLU), outperforming all same-class open models and recovering 67% of SILMA-9B's mean accuracy at 1/18 the parameters, with all code and weights publicly released. Why it matters: This model significantly advances efficient Arabic NLP by providing a powerful, specialized sub-1B LLM suitable for edge deployment, making advanced Arabic AI more accessible and performant on resource-constrained devices.

Severity-Aware Weighted Loss for Arabic Medical Text Generation

arXiv · Apr 7 · NLP LLM

Researchers proposed a severity-aware weighted loss method to fine-tune Arabic language models for medical text generation, prioritizing severe clinical cases. This approach utilizes soft severity probabilities, derived from an AraBERT-based classifier, to dynamically scale token-level loss contributions during optimization on the MAQA dataset. The method consistently improved performance across ten Arabic LLMs, with AraGPT2-Base increasing from 54.04% to 66.14% and AraGPT2-Medium from 59.16% to 67.18%. Why it matters: This novel fine-tuning strategy addresses a critical limitation in medical AI by enhancing the safety and reliability of Arabic medical large language models, particularly in high-stakes clinical scenarios.

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

arXiv · Apr 7 · NLP LLM

Arabic-DeepSeek-R1 is an application-driven, open-source Arabic Large Language Model (LLM) that has achieved a new state-of-the-art (SOTA) across the Open Arabic LLM Leaderboard (OALL). The model utilizes a sparse Mixture-of-Experts (MoE) backbone and a four-phase Chain-of-Thought (CoT) distillation scheme, which incorporates Arabic-specific linguistic verification and regional ethical norms. It records the highest average score on the OALL suite and outperforms proprietary frontier systems like GPT-5.1 on a majority of benchmarks evaluating comprehensive Arabic language-specific tasks. Why it matters: This work offers a validated and cost-effective framework for developing high-performing, culturally-grounded AI for under-represented languages, addressing the digital equity gap.

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

arXiv · Apr 6 · NLP Research

Researchers have developed OmniScore, a family of deterministic learned metrics designed to evaluate generative text as an alternative to Large Language Models (LLMs) used as judges. OmniScore leverages small parameter models (<1B) and was trained on approximately 564,000 synthetic instances across 107 languages, then evaluated using 8,617 manually annotated instances. It approximates LLM-judge behavior while offering low latency and consistency for various evaluation settings like reference-based and source-grounded assessments in tasks like QA, translation, and summarization. Why it matters: This development provides a practical, scalable, and reproducible method for multilingual generative text evaluation, addressing key limitations of LLM-as-a-judge approaches and offering significant benefits for AI development in linguistically diverse regions.

Are Arabic Benchmarks Reliable? QIMMA's Quality-First Approach to LLM Evaluation

arXiv · Apr 3 · LLM NLP

QIMMA is introduced as a quality-assured Arabic LLM leaderboard that places systematic benchmark validation at its core. It employs a multi-model assessment pipeline combining automated LLM judgment with human review to identify and resolve quality issues in established Arabic benchmarks. The resulting evaluation suite comprises over 52,000 samples, predominantly grounded in native Arabic content, with transparent implementation via LightEval and EvalPlus. Why it matters: This initiative provides a more reliable and reproducible foundation for evaluating Arabic Large Language Models, addressing critical quality concerns in existing benchmarks.

World Reasoning Arena

arXiv · Mar 26 · Research LLM

Researchers from MBZUAI have introduced WR-Arena, a new comprehensive benchmark designed to evaluate World Models (WMs) beyond traditional next-state prediction and visual fidelity. WR-Arena assesses WMs across three core dimensions: Action Simulation Fidelity, Long-horizon Forecast, and Simulative Reasoning and Planning, using a curated task taxonomy and diverse datasets. Extensive experiments with state-of-the-art WMs revealed a significant gap between current models' capabilities and human-level hypothetical reasoning. Why it matters: This benchmark provides a critical diagnostic tool and guideline for developing more robust and intelligent world models capable of advanced understanding, forecasting, and purposeful action, particularly for AI research in the region.

UAE's use of American GPUs makes US tech a 'dominant standard', Trump's AI adviser says - thenationalnews.com

The National · Mar 26 · Infrastructure Policy

The UAE's extensive utilization of American Graphics Processing Units (GPUs) for its artificial intelligence development has established US technology as a "dominant standard" in the region. This observation was made by Michael Kratsios, former US Chief Technology Officer and AI adviser to Donald Trump. The reliance highlights the critical role of hardware supply chains in shaping global AI capabilities. Why it matters: This underscores the geopolitical implications of technological dependency and the strategic advantage held by nations controlling essential AI infrastructure.

Grounding Arabic LLMs in the Doha Historical Dictionary: Retrieval-Augmented Understanding of Quran and Hadith

arXiv · Mar 25 · NLP LLM

Researchers developed a retrieval-augmented generation (RAG) framework to improve Arabic Large Language Models (LLMs) in understanding complex historical and religious texts like the Quran and Hadith. This framework grounds LLMs in the Doha Historical Dictionary of Arabic (DHDA) through hybrid retrieval and intent-based routing. The approach significantly boosted the accuracy of Arabic-native LLMs such as Fanar and ALLaM to over 85%, closing the performance gap with proprietary models like Gemini. Why it matters: This research offers a novel method for enhancing Arabic NLP capabilities for historically nuanced texts, demonstrating the value of integrating diachronic lexicographic resources into RAG systems for deeper language understanding.

CoVR-R:Reason-Aware Composed Video Retrieval

arXiv · Mar 20 · CV RL

A new approach to composed video retrieval (CoVR) is presented, which leverages large multimodal models to infer causal and temporal consequences implied by an edit. The method aligns reasoned queries to candidate videos without task-specific finetuning. A new benchmark, CoVR-Reason, is introduced to evaluate reasoning in CoVR.

Older →