MBZUAI researchers developed a new approach called Multimodal Optimal Transport via Grounded Retrieval (MOTOR) to improve the accuracy of vision-language models for medical image analysis. MOTOR combines retrieval-augmented generation (RAG) with an optimal transport algorithm to retrieve and rank relevant image and textual data. Testing on two medical datasets showed that MOTOR improved average performance by 6.45%. Why it matters: This technique addresses the challenges of limited specialized medical datasets and computational costs associated with training AI models for medical image interpretation, offering a more efficient and accurate solution.
MBZUAI researchers developed a method to adapt Meta's Segment Anything Model (SAM) for medical image segmentation, addressing its performance gap with natural images. Their approach improves SAM's accuracy without requiring extensive retraining or large medical image datasets. The research, led by Chao Qin, was nominated for the Best Paper Award at the MICCAI conference in Marrakesh. Why it matters: This offers a more efficient and effective way to leverage foundation models in specialized medical imaging applications, potentially improving diagnostic accuracy and reducing the need for large-scale, domain-specific training data.
MBZUAI researchers developed a new deep learning method for rapid and accurate estimation of clinical measurements from echocardiograms. The method focuses on improving the measurement of the left ventricle ejection fraction, a key indicator of heart health. Their deep learning approach improves upon previous methods by better organizing data representation, enhancing performance and transferability. Why it matters: The AI-driven solution can potentially reduce analysis time for cardiologists, improve patient care, and be particularly beneficial in regions with limited healthcare resources.
MBZUAI researchers presented DEFUSE-MS at MICCAI 2025, a novel AI system for analyzing changes in MRI scans of multiple sclerosis (MS) patients. DEFUSE-MS uses a deformation field-guided spatiotemporal graph-based framework to identify new lesions by reasoning about how the brain has changed. The model constructs graphs of small regions within baseline and follow-up MRIs, linking them across time with edges enriched with learned embeddings of the deformation field. Why it matters: DEFUSE-MS reframes the task from simple "spot the difference" to understanding structural changes, potentially improving the speed and accuracy of MS diagnosis and treatment monitoring.
MBZUAI Ph.D. student Raza Imam and colleagues presented a new benchmark called MediMeta-C to test the robustness of medical vision-language models (MVLMs) under real-world image corruptions. They found that top-performing MVLMs on clean data often fail under mild corruption, with fundoscopy models particularly vulnerable. To address this, they developed RobustMedCLIP (RMC), a lightweight defense using few-shot LoRA tuning to improve model robustness. Why it matters: This research highlights the critical need for robustness testing in medical AI to ensure reliability in clinical settings, particularly in resource-constrained environments where image quality may be compromised.