Reasoning with interactive guidance

MBZUAI · Notable

Summary

Niket Tandon from the Allen Institute for AI presented a talk at MBZUAI on enabling large language models to focus on human needs and continuously learn from interactions. He proposed a memory architecture inspired by the theory of recursive reminding to guide models in avoiding past errors. The talk addressed who to ask, what to ask, when to ask and how to apply the obtained guidance. Why it matters: The research explores how to align LLMs with human feedback, a key challenge for practical and ethical AI deployment.

Keywords

LLM · interactive guidance · MBZUAI · Allen Institute for AI · recursive reminding

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

Video-CoM: Interactive Video Reasoning via Chain of Manipulations

arXiv · Nov 28

Researchers at MBZUAI introduce "Interactive Video Reasoning," a new paradigm enabling models to actively "think with videos" by performing iterative visual actions to gather and refine evidence. They developed Video CoM, which reasons through a Chain of Manipulations (CoM), and constructed Video CoM Instruct, an 18K instruction tuning dataset for multi-step manipulation reasoning. The model is further optimized via reinforcement learning with reasoning aware Group Relative Policy Optimization (GRPO), achieving strong results across nine video reasoning benchmarks.

Reasoning with interactive guidance

Summary

Keywords

Related

Video-CoM: Interactive Video Reasoning via Chain of Manipulations