Skip to content
GCC AI Research

Search

Results for "reversal curse"

Overcoming the ‘reversal curse’ in LLMs with ReCall

MBZUAI ·

MBZUAI researchers identified 'self-referencing causal cycles' in LLM training data that can mitigate the 'reversal curse,' where LLMs struggle with information presented in reverse order. The study, to be presented at ACL, explains that the transformer architecture's unidirectional token generation causes this issue. By leveraging the repetitive nature of information in training texts, the team developed an efficient solution to improve LLM performance. Why it matters: Overcoming the reversal curse can significantly enhance LLM accuracy and reliability, especially in tasks requiring bidirectional reasoning and understanding of context.

Turning failure into success

KAUST ·

Dr. Samuel West, curator of the Museum of Failure, delivered a keynote lecture at KAUST on learning from innovation failure. He emphasized accepting failure, encouraging innovation, and framing work as learning problems. West used case studies like TwitterPeek and the Vasa warship to illustrate learning from past mistakes. Why it matters: This promotes a culture of experimentation and resilience, crucial for advancing AI and technology innovation in Saudi Arabia.

Saving ghost cities

KAUST ·

In a 2018 KAUST lecture, MIT professor Kamal Youcef-Toumi discussed the case of Ordos Kangbashi, a Chinese city designed for a million residents that became a near-ghost town. Despite government incentives, the city struggled due to an economic downturn and lack of social and economic balance. Youcef-Toumi emphasized the importance of the public realm and a balance between social and economic development for successful cities. Why it matters: The analysis provides insights relevant to urban planning in Saudi Arabia and the broader GCC region, where new cities and megaprojects are being developed.

Rational Counterfactuals

arXiv ·

This paper introduces rational counterfactuals, a method for identifying counterfactuals that maximize the attainment of a desired consequent. The approach aims to identify the antecedent that leads to a specific outcome for rational decision-making. The theory is applied to identify variable values that contribute to peace, such as Allies, Contingency, Distance, Major Power, Capability, Democracy, and Economic Interdependency. Why it matters: The research provides a framework for analyzing and promoting conditions conducive to peace using counterfactual reasoning.

Overcoming the curse of dimensionality

MBZUAI ·

MBZUAI Professor Fakhri Karray and co-authors from the University of Waterloo have published "Elements of Dimensionality Reduction and Manifold Learning," a textbook on methods for extracting useful components from large datasets. The book addresses the challenge of the "curse of dimensionality," where growth in datasets complicates their use in machine learning. Karray developed the material from a popular course he taught at Waterloo. Why it matters: The textbook provides a unified resource for students and researchers in machine learning and AI, addressing a foundational challenge in processing high-dimensional data, relevant to diverse applications in the region.

Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR

arXiv ·

A new method is proposed to reduce the verbosity of LLMs in step-by-step reasoning by retaining moderately easy problems during Reinforcement Learning with Verifiable Rewards (RLVR) training. This approach acts as an implicit length regularizer, preventing the model from excessively increasing output length on harder problems. Experiments using Qwen3-4B-Thinking-2507 show the model achieves baseline accuracy with nearly twice shorter solutions.

Diffusion-BBO: Diffusion-Based Inverse Modeling for Online Black-Box Optimization

arXiv ·

This paper introduces Diffusion-BBO, a new online black-box optimization (BBO) framework that uses a conditional diffusion model as an inverse surrogate model. The framework employs an Uncertainty-aware Exploration (UaE) acquisition function to propose scores in the objective space for conditional sampling. The approach is shown theoretically to achieve a near-optimal solution and empirically outperforms existing online BBO baselines across 6 scientific discovery tasks.

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

arXiv ·

Researchers from the National Center for AI in Saudi Arabia investigated the sensitivity of Large Language Model (LLM) leaderboards to minor benchmark perturbations. They found that small changes, like choice order, can shift rankings by up to 8 positions. The study recommends hybrid scoring and warns against over-reliance on simple benchmark evaluations, providing code for further research.