Skip to content
GCC AI Research

Search

Results for "Voodoo XP"

VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models

arXiv ·

The paper introduces VENOM, a text-driven framework for generating high-quality unrestricted adversarial examples using diffusion models. VENOM unifies image content generation and adversarial synthesis into a single reverse diffusion process, enhancing both attack success rate and image quality. The framework incorporates an adaptive adversarial guidance strategy with momentum to ensure the generated adversarial examples align with the distribution of natural images.

Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models

arXiv ·

Researchers at MBZUAI have introduced Video-R2, a reinforcement learning approach to improve the consistency and visual grounding of reasoning in multimodal language models. Video-R2 combines timestamp-aware supervised fine-tuning with Group Relative Policy Optimization (GRPO) guided by a Temporal Alignment Reward (TAR). The model demonstrates higher Think Answer Consistency (TAC), Video Attention Score (VAS), and accuracy across multiple benchmarks, showing improved temporal alignment and reasoning coherence for video understanding.

AI and Biomedicine: the Hospital of the Future

MBZUAI ·

Pierre Baldi from UC Irvine presented applications of AI to biomedicine, covering molecular-level analysis of circadian rhythms, real-time polyp detection in colonoscopy videos, and prediction of post-operative adverse outcomes. He discussed integrating AI in future AI-driven hospitals. The presentation was likely part of a panel discussion hosted by MBZUAI in collaboration with the Manara Center for Coexistence and Dialogue. Why it matters: This highlights the growing interest in AI applications within the healthcare sector in the UAE, particularly through institutions like MBZUAI.

Research on supervolcanoes gives clues to current, future climate change conditions

KAUST ·

KAUST researchers are studying ancient supervolcanoes, like the Toba eruption 75,000 years ago, to understand current and future climate conditions. Volcanic eruptions serve as natural experiments that push the climate system to its limits, helping scientists understand climate's physical mechanisms. Research shows that volcanic eruptions delayed global warming by about 30% starting from 1850. Why it matters: Understanding the impact of volcanic activity on climate change can improve predictions of future global warming, particularly in regions like the Middle East which are strongly affected by volcanic events.

At the forefront of programming models

KAUST ·

KAUST held its second hackathon and third NVIDIA workshop. Attendees listened to lectures from international experts. Participants worked on porting their scientific applications to a GPU accelerator. Why it matters: Such events help build regional expertise in accelerated computing and attract international collaboration.

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

arXiv ·

A new benchmark, LongShOTBench, is introduced for evaluating multimodal reasoning and tool use in long videos, featuring open-ended questions and diagnostic rubrics. The benchmark addresses the limitations of existing datasets by combining temporal length and multimodal richness, using human-validated samples. LongShOTAgent, an agentic system, is also presented for analyzing long videos, with both the benchmark and agent demonstrating the challenges faced by state-of-the-art MLLMs.