Search

Results for "social simulation"

From Individual to Society: Social Simulation Driven by LLM-based Agent

MBZUAI · Invalid Date

Fudan University's Zhongyu Wei presented research on social simulation driven by LLMs, covering individual and large-scale social movement simulation. Wei directs the Data Intelligence and Social Computing Lab (Fudan DISC) and has published extensively on multimodal large models and social computing. His work includes the Volcano multimodal model, DISC-MedLLM, and ElectionSim. Why it matters: Using LLMs for social simulation could provide new tools for understanding and potentially predicting social dynamics in the Arab world.

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

arXiv · May 29

MBZUAI researchers introduce SocialMaze, a new benchmark for evaluating social reasoning capabilities in large language models (LLMs). SocialMaze includes six diverse tasks across social reasoning games, daily-life interactions, and digital community platforms, emphasizing deep reasoning, dynamic interaction, and information uncertainty. Experiments show that LLMs vary in handling dynamic interactions, degrade under uncertainty, but can be improved via fine-tuning on curated reasoning examples.

The diagnosis game: A simulated hospital environment to measure AI agents’ diagnostic abilities

MBZUAI · Invalid Date

MBZUAI researchers developed MedAgentSim, a simulated hospital environment to evaluate AI diagnostic abilities. The simulation uses LLM-powered agents to mimic doctor-patient conversations, providing a dynamic assessment of diagnostic skills. The system includes doctor, patient, and evaluator agents that interact within the simulated hospital, making real-time decisions. Why it matters: This research offers a more realistic evaluation of AI in clinical settings, addressing limitations of current benchmarks and potentially improving AI's use in healthcare.

MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments

arXiv · Jan 3

The paper introduces MIRAGE, a framework for evaluating LLMs' ability to simulate human behaviors in murder mystery games. MIRAGE uses four methods: TII, CIC, ICI and SCI to assess the LLMs' role-playing proficiency. Experiments show that even GPT-4 struggles with the complexities of the MIRAGE framework.

The chameleon effect in education with social AI: can children learn by subconsciously mimicking a social robot?

MBZUAI · Invalid Date

Maha Elgarf from NYU Abu Dhabi presented research on using social robots to stimulate creativity in children through subconscious mimicry, leveraging the 'chameleon effect'. The research involved a series of studies where children engaged in storytelling with a social robot, and their creativity was assessed. Elgarf also discussed using Large Language Models (LLMs) in education and challenges in the field. Why it matters: This explores innovative applications of social robotics and AI in education within the UAE, potentially enhancing children's learning and creativity.

Advance Simulation Method for Wheel-Terrain Interactions of Space Rovers: A Case Study on the UAE Rashid Rover

arXiv · Aug 23

This paper introduces a virtual wheel-terrain interaction model developed and validated for the UAE Rashid rover to enhance simulation accuracy for space rovers. The model incorporates wheel grouser properties, slippage, soil properties, and interaction mechanics, validated via lunar soil simulation. Experiments tested a Grouser-Rashid rover wheel at slip ratios of 0, 0.25, 0.50, and 0.75. Why it matters: This simulation method advances rover design and control, crucial for the UAE's space exploration program and lunar mission success.