Search

Results for "reversal curse"

Overcoming the ‘reversal curse’ in LLMs with ReCall

MBZUAI · Invalid Date

MBZUAI researchers identified 'self-referencing causal cycles' in LLM training data that can mitigate the 'reversal curse,' where LLMs struggle with information presented in reverse order. The study, to be presented at ACL, explains that the transformer architecture's unidirectional token generation causes this issue. By leveraging the repetitive nature of information in training texts, the team developed an efficient solution to improve LLM performance. Why it matters: Overcoming the reversal curse can significantly enhance LLM accuracy and reliability, especially in tasks requiring bidirectional reasoning and understanding of context.

Turning failure into success

KAUST · Oct 26

Dr. Samuel West, curator of the Museum of Failure, delivered a keynote lecture at KAUST on learning from innovation failure. He emphasized accepting failure, encouraging innovation, and framing work as learning problems. West used case studies like TwitterPeek and the Vasa warship to illustrate learning from past mistakes. Why it matters: This promotes a culture of experimentation and resilience, crucial for advancing AI and technology innovation in Saudi Arabia.

Saving ghost cities

KAUST · May 15

In a 2018 KAUST lecture, MIT professor Kamal Youcef-Toumi discussed the case of Ordos Kangbashi, a Chinese city designed for a million residents that became a near-ghost town. Despite government incentives, the city struggled due to an economic downturn and lack of social and economic balance. Youcef-Toumi emphasized the importance of the public realm and a balance between social and economic development for successful cities. Why it matters: The analysis provides insights relevant to urban planning in Saudi Arabia and the broader GCC region, where new cities and megaprojects are being developed.

Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR

arXiv · Nov 2

A new method is proposed to reduce the verbosity of LLMs in step-by-step reasoning by retaining moderately easy problems during Reinforcement Learning with Verifiable Rewards (RLVR) training. This approach acts as an implicit length regularizer, preventing the model from excessively increasing output length on harder problems. Experiments using Qwen3-4B-Thinking-2507 show the model achieves baseline accuracy with nearly twice shorter solutions.

Diffusion-BBO: Diffusion-Based Inverse Modeling for Online Black-Box Optimization

arXiv · Jun 30

This paper introduces Diffusion-BBO, a new online black-box optimization (BBO) framework that uses a conditional diffusion model as an inverse surrogate model. The framework employs an Uncertainty-aware Exploration (UaE) acquisition function to propose scores in the objective space for conditional sampling. The approach is shown theoretically to achieve a near-optimal solution and empirically outperforms existing online BBO baselines across 6 scientific discovery tasks.

Overcoming the curse of dimensionality

MBZUAI · Invalid Date

MBZUAI Professor Fakhri Karray and co-authors from the University of Waterloo have published "Elements of Dimensionality Reduction and Manifold Learning," a textbook on methods for extracting useful components from large datasets. The book addresses the challenge of the "curse of dimensionality," where growth in datasets complicates their use in machine learning. Karray developed the material from a popular course he taught at Waterloo. Why it matters: The textbook provides a unified resource for students and researchers in machine learning and AI, addressing a foundational challenge in processing high-dimensional data, relevant to diverse applications in the region.