KAUST researchers in the Image and Video Understanding Lab are applying machine learning to computer vision for automated navigation, including self-driving cars and UAVs. They tested their algorithms on KAUST roads, aiming to replicate the brain's efficiency in tasks like activity and object recognition. The team is also exploring the possibility of creative algorithms that can transfer skills without direct training. Why it matters: This research contributes to the advancement of autonomous systems and explores the fundamental questions of replicating human intelligence in machines within the GCC region.
Tailin Wu from Stanford presented research on using machine learning to accelerate scientific discovery and simulation at MBZUAI. The work covers learning theories from dynamical systems with improved accuracy and interpretability. It also introduces LAMP, a deep learning model optimizing spatial resolutions in simulations. Why it matters: Efficient AI-driven scientific simulation has broad implications for research in physics, biomedicine, materials science and engineering across the region.
Nobuyuki Umetani from the University of Tokyo presented a talk on using AI to accelerate simulations and optimization for 3D shape designs. The talk covered interactive approaches integrating physical simulation into geometric modeling. Specific applications discussed included musical instruments, garment design, aerodynamic design, and floor plan design. Why it matters: This highlights growing interest in AI techniques at MBZUAI and across the GCC for streamlining engineering design and simulation processes.
Jorge Amador, a PhD student at KAUST's Visual Computing Center, presented a talk on physically-based simulation for generative AI models. The talk covered the use of synthetic data generation and physical priors to address the need for high-quality datasets. Applications discussed include photo editing, navigation, digital humans, and cosmological simulations. Why it matters: This research explores a promising technique to overcome data scarcity issues in AI, particularly relevant in resource-constrained environments or for sensitive applications.
Patrick van der Smagt, Director of AI Research at Volkswagen Group, discussed the use of generative machine learning models for predicting and controlling complex stochastic systems in robotics. The talk highlighted examples in robotics and beyond and addressed the challenges of achieving quality and trust in AI systems. He also mentioned his involvement in a European industry initiative on trust in AI and his membership in the AI Council of the State of Bavaria. Why it matters: Understanding control in robotics, along with trust in AI, are key issues for further development of autonomous systems, especially in industrial applications within the GCC region.
Fudan University's Zhongyu Wei presented research on social simulation driven by LLMs, covering individual and large-scale social movement simulation. Wei directs the Data Intelligence and Social Computing Lab (Fudan DISC) and has published extensively on multimodal large models and social computing. His work includes the Volcano multimodal model, DISC-MedLLM, and ElectionSim. Why it matters: Using LLMs for social simulation could provide new tools for understanding and potentially predicting social dynamics in the Arab world.
MBZUAI Professor Salman Khan is researching continuous, lifelong learning systems for computer vision, aiming to mimic human learning processes like curiosity and discovery. His work focuses on learning from limited data and adversarial robustness of deep neural networks. Khan, along with MBZUAI professors Fahad Khan and Rao Anwer, and partners from other universities, presented research at CVPR 2022. Why it matters: This research has the potential to significantly improve the ability of AI systems to understand and adapt to the real world, enabling more intelligent autonomous systems.
MBZUAI researchers developed MedAgentSim, a simulated hospital environment to evaluate AI diagnostic abilities. The simulation uses LLM-powered agents to mimic doctor-patient conversations, providing a dynamic assessment of diagnostic skills. The system includes doctor, patient, and evaluator agents that interact within the simulated hospital, making real-time decisions. Why it matters: This research offers a more realistic evaluation of AI in clinical settings, addressing limitations of current benchmarks and potentially improving AI's use in healthcare.