Skip to content
GCC AI Research

Search

Results for "motion tracking"

Tracking Meets Large Multimodal Models for Driving Scenario Understanding

arXiv ·

Researchers at MBZUAI have introduced a novel approach to enhance Large Multimodal Models (LMMs) for autonomous driving by integrating 3D tracking information. This method uses a track encoder to embed spatial and temporal data, enriching visual queries and improving the LMM's understanding of driving scenarios. Experiments on DriveLM-nuScenes and DriveLM-CARLA benchmarks demonstrate significant improvements in perception, planning, and prediction tasks compared to baseline models.

Teaching algorithms to see

KAUST ·

KAUST's Image and Video Understanding Lab is developing machine learning algorithms for computer vision and object tracking, with applications in video content search and UAV navigation. Their algorithms can detect specific activities in videos, helping platforms detect unwanted content and deliver relevant ads. The object tracking algorithm is also used to empower UAVs, enabling them to follow objects autonomously. Why it matters: This research enhances video content analysis and UAV capabilities, positioning KAUST as a leader in computer vision and AI applications within the region.

This AI can reconstruct motion blurred Human faces - Digital Information World

Inception ·

An artificial intelligence system has been developed that can reconstruct human faces from images affected by motion blur. This technology leverages advanced algorithms to reverse the blurring effect, enhancing facial clarity and detail. The system aims to improve image quality in various applications where motion artifacts are common. Why it matters: This advancement holds significant potential for applications in forensics, security surveillance, and improving consumer photography by recovering lost detail in images.