MBZUAI researchers introduced CausalVerse, a new benchmark for causal representation learning (CRL) presented at NeurIPS 2025. CausalVerse combines high-fidelity visual complexity with access to underlying causal variables and graphs, featuring 200,000 images and 300 million video frames across 24 sub-scenes in four domains. It aims to provide a realistic and precise testbed to evaluate whether CRL methods can truly learn the right causes. Why it matters: By bridging the gap between toy datasets and real-world data, CausalVerse can drive advances in AI systems capable of understanding causality in complex scenarios.
MBZUAI hosted a talk on causal AI, featuring Professor Jin Tian from Iowa State University. The talk covered enriching AI systems with causal reasoning capabilities, moving AI beyond prediction to understanding. Professor Tian shared research on causal inference and estimating causal effects from data, using a novel estimator with double/debiased machine learning (DML) properties. Why it matters: Causal AI can improve the explainability, robustness, and adaptability of AI systems, addressing limitations of purely statistical models.
MBZUAI's Kun Zhang is applying causal machine learning to improve drug development and precision medicine, focusing on answering 'why' questions. Traditional drug development is costly (est. $2B) due to extensive studies needed to determine drug toxicity and efficacy. Zhang is combining causal ML with organs-on-chips technology to improve pre-clinical drug testing, aiming to reduce the failure rate of drugs in human trials. Why it matters: By improving the accuracy of pre-clinical drug testing, this research could significantly reduce the cost and time required to bring new medicines to market in the region and worldwide.
KAUST researchers developed a new algorithm for detecting cause and effect in large datasets. The algorithm aims to find underlying models that generate data, helping uncover cause-and-effect dynamics. It could aid researchers across fields like cell biology and genetics by answering questions that typical machine learning cannot. Why it matters: This advancement could equip current machine learning methods with abilities to better deal with abstraction, inference, and concepts such as cause and effect.
MBZUAI Professor Kun Zhang is developing machine learning techniques to identify hidden causal variables, which are underlying concepts driving cause-and-effect relationships. Zhang and colleagues from Carnegie Mellon University are presenting a new approach for this at ICML 2024. Their method, causal representation learning, assumes that measured variables are generated by unobserved latent variables. Why it matters: Uncovering hidden causal relationships can significantly advance understanding in various fields by revealing the underlying mechanisms driving observed phenomena.
MBZUAI researchers presented a new causal representation learning framework at ICLR for identifying latent causal variables in multimodal biological data. The framework addresses the challenge of uncovering underlying causal factors from lab tests, genetic information, and medical images. The new approach can identify latent causal variables and their influence on observed biological outcomes across modalities. Why it matters: The model's ability to analyze causal mechanisms between modalities can lead to more complete insights in biomedical research.
Saber Salehkaleybar from EPFL presented a talk on causal discovery, focusing on learning causal relationships from observational data and through interventions. He discussed an approximation algorithm for experiment design under budget constraints, with applications in gene-regulatory networks. The talk also covered improvements to reduce the computational complexity of experiment design algorithms. Why it matters: Causal AI systems can lead to more intelligent decision-making in various fields.
MBZUAI researchers presented a study at ICML 2024 examining how data aggregation distorts causal discovery. The study argues that current methods are misled because real-world interactions happen at a micro level while observations are aggregated. Using the example of ice cream sales and temperature, they highlight how aggregation introduces "instantaneous causality" where time-lags exist. Why it matters: The research identifies a fundamental limitation in current causal discovery methods, potentially impacting disciplines relying on accurate causal inference from observational data.