Skip to content
GCC AI Research

Search

Results for "ACVA"

Towards Practical Remote Photoplethysmography Detector

MBZUAI ·

Pong C Yuen from Hong Kong Baptist University will present a talk on remote photoplethysmography (rPPG) detection. The talk will review the development of rPPG detection, share recent research, and discuss future directions. rPPG is a technology for non-contact computer vision and healthcare applications like heart rate estimation. Why it matters: Advancements in rPPG could enable new remote patient monitoring and diagnostic tools in the region, reducing the need for physical contact.

MBZUAI students win award for study presented at Asian Conference on Computer Vision

MBZUAI ·

MBZUAI students won an award at the Asian Conference on Computer Vision (ACCV) for their ObjectCompose method. ObjectCompose generates object-to-background variations of images to validate neural network performance. It helps developers test AI systems by adding variability to validation datasets without distorting the main object. Why it matters: This research offers a new approach to improve the robustness and reliability of computer vision models, which is crucial for real-world applications in the region.

Computer vision: Teaching computers how to see the world

KAUST ·

KAUST's Visual Computing Center (VCC) is researching computer vision, image processing, and machine learning, with applications in self-driving cars, surveillance, and security. Professor Bernard Ghanem is working on teaching machines to understand visual data semantically, similar to how humans perceive the world. Self-driving cars use visual sensors to interpret traffic signals and detect obstacles, while computer vision also assists governments and corporations with security applications like facial recognition and detecting unattended luggage. Why it matters: Advancements in computer vision at KAUST can contribute to innovations in autonomous vehicles and enhance security measures in the region.

To Make Just-Noticeable Difference (JND) Computable toward Visual Intelligence

MBZUAI ·

A professor from Nanyang Technological University (NTU), Singapore gave a talk at MBZUAI about "Just-Noticeable Difference (JND)" models in visual intelligence. The talk covered visual JND models, research and applications, and future opportunities for JND modeling. JND can help tackle big data challenges with limited resources by focusing on user-centric and green systems. Why it matters: Exploring JND could lead to advancements in AI applications related to visual signal processing, image synthesis, and generative AI in the region.

Improving patient care with computer vision

MBZUAI ·

MBZUAI's BioMedIA lab, led by Mohammad Yaqub, is developing AI solutions for healthcare challenges in cardiology, pulmonology, and oncology using computer vision. Yaqub's previous research analyzed fetal ultrasound images to correlate bone development with maternal vitamin D levels. The lab is now applying image analysis to improve the treatment of head and neck cancer using PET and CT scans. Why it matters: This research demonstrates the potential of AI and computer vision to improve diagnostic accuracy and accessibility of healthcare in the region and beyond.

Visualizing the future

KAUST ·

KAUST's Visual Computing Center (VCC) hosted an Open House event on March 28, showcasing its interdisciplinary research in visual computing. Demonstrations included a virtual reality driving simulator by FalconViz, intended for driver education in Saudi Arabia. Researchers also presented a drone trained to autonomously navigate race courses and a neural network for autonomous driving using image-based technology without GPS. Why it matters: The VCC's work highlights KAUST's role in advancing visual computing applications relevant to Saudi Arabia, from driver training to autonomous systems.

Video search gets closer to how humans look for clips

MBZUAI ·

A new paper at ICCV 2025, co-authored by MBZUAI Ph.D. student Dmitry Demidov, introduces Dense-WebVid-CoVR, a 1.6-million sample benchmark for composed video retrieval (CoVR). The benchmark features longer, context-rich descriptions and modification texts, generated using Gemini Pro and GPT-4o, with manual verification. The paper also presents a unified fusion approach that jointly reasons across video and text inputs, improving performance on fine-grained edit details. Why it matters: This work advances video search capabilities by enabling more human-like queries, which is crucial for creative and analytic workflows that require nuanced video retrieval.

Breathing new life into medical applications

MBZUAI ·

MBZUAI graduate Ahmed Sharshar developed a computer vision application that assesses lung health from a video of a person breathing, estimating Forced Vital Capacity (FVC), Forced Expiratory Volume in 1 second (FEV1), and Peak Expiratory Flow (PEF). The model achieved up to 100% accuracy using thermal video data from 60 participants. Sharshar aims to create lightweight models applicable in developing countries without high-end GPUs. Why it matters: This research showcases the potential of AI to democratize healthcare access through non-invasive, accessible diagnostic tools.