Skip to content
GCC AI Research

Search

Results for "webcam"

Real-time Few-shot Realistic Avatars

MBZUAI ·

Ekaterina Radionova from Smarter AI (formerly Samsung AI Center) presented an approach to generating lifelike real-time avatars. The work focuses on generating high-quality video with authentic facial features to support online generation. Radionova's master's degree is from Skoltech on Data Science program and Bachelor degree at Moscow Institute of Physics and Technology on Applied Math. Why it matters: Achieving realistic real-time avatars is critical for applications in online communication, entertainment, and virtual reality within the region.

Metaverse healthcare in red, green, and blue

MBZUAI ·

Researchers at MBZUAI developed a method to measure vital signs using webcams by analyzing color intensity changes in facial blood flow. They built a digital twin system that uses machine learning to combine heart rate, respiratory rate, and blood oxygen level measures. The system displays real-time vital sign information, enabling remote patient triage. Why it matters: This research contributes to the advancement of telemedicine, potentially improving healthcare access in underserved regions and aligning with UN Sustainable Development Goal #3.

High-quality Neural Reconstruction in Real-world Scenes

MBZUAI ·

A researcher at the University of Oxford presented new findings on 3D neural reconstruction. The talk introduced a dataset comprising real-world video captures with perfect 3D models. A novel joint optimization method refines camera poses during the reconstruction process. Why it matters: High-quality 3D reconstruction has broad applicability to robotics and computer vision applications in the region.

Al-Balad: Architectural gem of Old Jeddah showcased at WEP photography exhibition

KAUST ·

A photography exhibition at KAUST's 2015 Winter Enrichment Program (WEP) showcased Al-Balad in Jeddah through the lenses of Andrea Bachofen-Echt and Marina Kochetyga. The photographers captured the unique architecture of Al-Balad, a UNESCO World Heritage Site, during multiple visits over a month. Andrea noted Al-Balad's authenticity due to limited tourism, making it a unique subject. Why it matters: This highlights the importance of preserving and promoting the cultural heritage of Saudi Arabia through art and photography.

How to be a successful scientist-entrepreneur

KAUST ·

Dr. Eric Fossum, professor at Dartmouth and inventor of CMOS active pixel image sensors, spoke at KAUST's 2017 Enrichment in the Spring Program. The lecture focused on how to be a successful scientist-entrepreneur. He received a gift from the KAUST Enrichment Programs team. Why it matters: This highlights KAUST's efforts to engage with leading international experts to foster innovation and entrepreneurship among its researchers and students.

Cross-modal understanding and generation of multimodal content

MBZUAI ·

Nicu Sebe from the University of Trento presented recent work on video generation, focusing on animating objects in a source image using external information like labels, driving videos, or text. He introduced a Learnable Game Engine (LGE) trained from monocular annotated videos, which maintains states of scenes, objects, and agents to render controllable viewpoints. Why it matters: This talk highlights advancements in cross-modal AI, potentially enabling new applications in gaming, simulation, and content creation within the region.

Computer vision: Teaching computers how to see the world

KAUST ·

KAUST's Visual Computing Center (VCC) is researching computer vision, image processing, and machine learning, with applications in self-driving cars, surveillance, and security. Professor Bernard Ghanem is working on teaching machines to understand visual data semantically, similar to how humans perceive the world. Self-driving cars use visual sensors to interpret traffic signals and detect obstacles, while computer vision also assists governments and corporations with security applications like facial recognition and detecting unattended luggage. Why it matters: Advancements in computer vision at KAUST can contribute to innovations in autonomous vehicles and enhance security measures in the region.

Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance

arXiv ·

The paper introduces a framework for camel farm monitoring using a combination of automated annotation and fine-tune distillation. The Unified Auto-Annotation framework uses GroundingDINO and SAM to automatically annotate surveillance video data. The Fine-Tune Distillation framework then fine-tunes student models like YOLOv8, transferring knowledge from a larger teacher model, using data from Al-Marmoom Camel Farm in Dubai.