Skip to content
GCC AI Research

Search

Results for "focal modulation"

Making sense of space and time in video

MBZUAI ·

MBZUAI researchers presented a new approach to video analysis at ICCV in Paris, led by Syed Talal Wasim. The approach builds on still image processing techniques like focal modulation to analyze spatial and temporal information in video separately. It aims to improve temporal aggregation while avoiding the computational complexity of transformers. Why it matters: This research advances video understanding in computer vision by offering a more efficient method for temporal modeling, crucial for applications like activity recognition and video surveillance.

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

arXiv ·

The paper introduces the Prism Hypothesis, which posits a correspondence between an encoder's feature spectrum and its functional role, with semantic encoders capturing low-frequency components and pixel encoders retaining high-frequency information. Based on this, the authors propose Unified Autoencoding (UAE), a model that harmonizes semantic structure and pixel details using a frequency-band modulator. Experiments on ImageNet and MS-COCO demonstrate that UAE effectively unifies semantic abstraction and pixel-level fidelity, achieving state-of-the-art performance.

Fernando Albarracin, Young Scientist Award, URSI GASS 2020

TII ·

Dr. Fernando Albarracin from the Technology Innovation Institute has presented a novel microwave applicator design for hyperthermia, potentially useful in cancer treatment. The design combines two flat dielectric graded-index (GRIN) lenses to localize electromagnetic energy within a specific spot in the tissue. This system offers a suitable alternative to conventional antenna-based applicators by considering the interface between free space and human tissue. Why it matters: This research introduces a new approach to hyperthermia treatment that could improve the precision and effectiveness of cancer therapy in the region.

When AI stops playing “spot the difference” and starts understanding changes in MRIs

MBZUAI ·

MBZUAI researchers presented DEFUSE-MS at MICCAI 2025, a novel AI system for analyzing changes in MRI scans of multiple sclerosis (MS) patients. DEFUSE-MS uses a deformation field-guided spatiotemporal graph-based framework to identify new lesions by reasoning about how the brain has changed. The model constructs graphs of small regions within baseline and follow-up MRIs, linking them across time with edges enriched with learned embeddings of the deformation field. Why it matters: DEFUSE-MS reframes the task from simple "spot the difference" to understanding structural changes, potentially improving the speed and accuracy of MS diagnosis and treatment monitoring.

Foundations of Multisensory Artificial Intelligence

MBZUAI ·

Paul Liang from CMU presented on machine learning foundations for multisensory AI, discussing a theoretical framework for modality interactions. The talk covered cross-modal attention and multimodal transformer architectures, and applications in mental health, pathology, and robotics. Liang's research aims to enable AI systems to integrate and learn from diverse real-world sensory modalities. Why it matters: This highlights the growing importance of multimodal AI research and its potential for advancements across various sectors in the region, including healthcare and robotics.

Peeking inside the brain

KAUST ·

KAUST Discovery highlights the contributions of Magistretti to the field of neuroenergetics. His research explores the cellular and molecular basis of brain energy metabolism and brain imaging. Magistretti's group discovered mechanisms underlying the coupling between neuronal activity and energy consumption, revealing the role of astrocytes. Why it matters: Understanding brain energy metabolism and the role of glial cells can advance brain imaging techniques and our understanding of neuronal processes.

BRIQA: Balanced Reweighting in Image Quality Assessment of Pediatric Brain MRI

arXiv ·

This paper introduces BRIQA, a new method for automated assessment of artifact severity in pediatric brain MRI, which is important for diagnostic accuracy. BRIQA uses gradient-based loss reweighting and a rotating batching scheme to handle class imbalance in artifact severity levels. Experiments show BRIQA improves average macro F1 score from 0.659 to 0.706, especially for Noise, Zipper, Positioning and Contrast artifacts.