MBZUAI researchers introduce SocialMaze, a new benchmark for evaluating social reasoning capabilities in large language models (LLMs). SocialMaze includes six diverse tasks across social reasoning games, daily-life interactions, and digital community platforms, emphasizing deep reasoning, dynamic interaction, and information uncertainty. Experiments show that LLMs vary in handling dynamic interactions, degrade under uncertainty, but can be improved via fine-tuning on curated reasoning examples.
Fudan University's Zhongyu Wei presented research on social simulation driven by LLMs, covering individual and large-scale social movement simulation. Wei directs the Data Intelligence and Social Computing Lab (Fudan DISC) and has published extensively on multimodal large models and social computing. His work includes the Volcano multimodal model, DISC-MedLLM, and ElectionSim. Why it matters: Using LLMs for social simulation could provide new tools for understanding and potentially predicting social dynamics in the Arab world.
KAUST encouraged attendees of the 2015 Winter Enrichment Program (WEP) to share their experiences on social media using the hashtag #wep2015. The university provided tips for participants to effectively use platforms like Facebook, Twitter, and Instagram during the event. KAUST emphasized responsible sharing and respect for the university's multicultural community when posting. Why it matters: This initiative aimed to amplify the reach of WEP's activities and engage a broader audience in KAUST's community and knowledge-sharing efforts.
This paper explores how AI and social media analytics can identify and track trends in Saudi Arabia across sectors such as construction, food and beverage, tourism, technology, and entertainment. The study analyzed millions of social media posts each month, classifying discussions and calculating scores to track trends. The AI-driven methodology was able to predict the emergence and growth of trends by utilizing social media data.
This paper presents a decentralized multi-agent unmanned aerial system designed for search, pickup, and relocation of objects. The system integrates multi-agent aerial exploration, object detection/tracking, and aerial gripping. The decentralized system uses global state estimation, reactive collision avoidance, and sweep planning for exploration. Why it matters: The system's successful deployment in demonstrations and competitions like MBZIRC highlights the potential of integrated robotic solutions for complex tasks such as search and rescue in the region.
KAUST hosted the "Human-Machine Networks and Intelligent Infrastructures" conference, co-organized by Prof. Jeff Shamma and Asst. Prof. Meriem Laleg. The conference explored the blend of engineered devices and human elements in large-scale systems like smart grids. Keynote speaker Dr. Pramod Khargonekar discussed cyber-physical-social systems and emerging trends. Why it matters: The conference highlights the growing importance of understanding the interplay between AI, infrastructure, and human behavior in the development of smart cities and intelligent systems in the region.
MBZUAI researchers developed MedAgentSim, a simulated hospital environment to evaluate AI diagnostic abilities. The simulation uses LLM-powered agents to mimic doctor-patient conversations, providing a dynamic assessment of diagnostic skills. The system includes doctor, patient, and evaluator agents that interact within the simulated hospital, making real-time decisions. Why it matters: This research offers a more realistic evaluation of AI in clinical settings, addressing limitations of current benchmarks and potentially improving AI's use in healthcare.