Skip to content
GCC AI Research

Search

Results for "math word problems"

LLMs tackle math word problems

MBZUAI ·

MBZUAI researchers presented a study at NAACL 2024 analyzing errors made by open-source LLMs when solving math word problems. The study, led by Ekaterina Kochmar and KV Aditya Srivatsa, investigates characteristics that make math word problems difficult for machines. Llama2-70B was used to test the ability of LLMs to solve these problems, revealing that LLMs can perform math operations correctly but still give the wrong answer. Why it matters: The research aims to improve AI's ability to understand and solve math word problems, potentially leading to better educational applications and teaching methods.

Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR

arXiv ·

A new method is proposed to reduce the verbosity of LLMs in step-by-step reasoning by retaining moderately easy problems during Reinforcement Learning with Verifiable Rewards (RLVR) training. This approach acts as an implicit length regularizer, preventing the model from excessively increasing output length on harder problems. Experiments using Qwen3-4B-Thinking-2507 show the model achieves baseline accuracy with nearly twice shorter solutions.

VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos

arXiv ·

MBZUAI researchers introduce VideoMathQA, a new benchmark for evaluating mathematical reasoning in videos, requiring models to interpret visual information, text, and spoken cues. The dataset spans 10 mathematical domains with videos ranging from 10 seconds to over 1 hour, and includes multi-step reasoning annotations. The benchmark aims to evaluate temporal cross-modal reasoning and highlights the limitations of existing approaches in complex video-based mathematical problem solving.

An algorithm for success

KAUST ·

The article mentions several KAUST faculty and staff, including Matteo Parsani (Assistant Professor of Applied Mathematics), Teofilo Abrajano (Director of Sponsored Research), and David Keyes (Director of the Extreme Computing Research Center). It also references a talk by NASA Senior Scientist Mark Carpenter at the SIAM CSE 2017 conference. The article includes a photograph of King Abdullah bin Abdulaziz Al Saud. Why it matters: This appears to be general information about KAUST faculty and activities, but lacks specific details on research or AI developments.

The role of applied mathematics in finance

KAUST ·

KAUST's Stochastic Numerics Research Group is developing methods for pricing European options. Their approach, detailed in an upcoming Journal of Computational Finance article, focuses on systematically tuning parameters to achieve accuracy while minimizing computational effort. The goal is to enable automated computation of fair prices for options contracts, similar to how insurance companies determine premiums. Why it matters: This research advances computational finance in the region, potentially improving risk management and investment strategies.

Why the World Cup is a random process with a drift

KAUST ·

KAUST Professor Peter Markowich discusses the role of mathematics in football, describing a match as a random process with a drift. The randomness stems from player conditions, referee decisions, weather, and more, while the drift represents the higher probability of the better team winning. He notes that the complexity arising from 11 players on each side increases the randomness compared to sports like tennis. Why it matters: This perspective highlights the interplay of chance and skill in sports, offering a mathematical lens for understanding game dynamics.