MBZUAI researchers introduced CausalVerse, a new benchmark for causal representation learning (CRL) presented at NeurIPS 2025. CausalVerse combines high-fidelity visual complexity with access to underlying causal variables and graphs, featuring 200,000 images and 300 million video frames across 24 sub-scenes in four domains. It aims to provide a realistic and precise testbed to evaluate whether CRL methods can truly learn the right causes. Why it matters: By bridging the gap between toy datasets and real-world data, CausalVerse can drive advances in AI systems capable of understanding causality in complex scenarios.
MBZUAI hosted a talk on causal AI, featuring Professor Jin Tian from Iowa State University. The talk covered enriching AI systems with causal reasoning capabilities, moving AI beyond prediction to understanding. Professor Tian shared research on causal inference and estimating causal effects from data, using a novel estimator with double/debiased machine learning (DML) properties. Why it matters: Causal AI can improve the explainability, robustness, and adaptability of AI systems, addressing limitations of purely statistical models.
MBZUAI researchers presented a new causal representation learning framework at ICLR for identifying latent causal variables in multimodal biological data. The framework addresses the challenge of uncovering underlying causal factors from lab tests, genetic information, and medical images. The new approach can identify latent causal variables and their influence on observed biological outcomes across modalities. Why it matters: The model's ability to analyze causal mechanisms between modalities can lead to more complete insights in biomedical research.
Researchers from MBZUAI presented a new algorithm at ICLR 2024 that identifies causal relationships involving both observed and latent variables. The algorithm addresses limitations of existing methods that struggle with latent variables or assume observed variables don't directly influence latent variables. The proposed algorithm can accommodate both scenarios, offering a more generalizable approach to causal discovery. Why it matters: This research advances the development of AI systems that can analyze complex data and identify causal relationships, with potential applications in fields like medicine where understanding causality is crucial for developing treatments and preventative measures.
KAUST researchers developed a new algorithm for detecting cause and effect in large datasets. The algorithm aims to find underlying models that generate data, helping uncover cause-and-effect dynamics. It could aid researchers across fields like cell biology and genetics by answering questions that typical machine learning cannot. Why it matters: This advancement could equip current machine learning methods with abilities to better deal with abstraction, inference, and concepts such as cause and effect.
MBZUAI researchers presented a study at ICML 2024 examining how data aggregation distorts causal discovery. The study argues that current methods are misled because real-world interactions happen at a micro level while observations are aggregated. Using the example of ice cream sales and temperature, they highlight how aggregation introduces "instantaneous causality" where time-lags exist. Why it matters: The research identifies a fundamental limitation in current causal discovery methods, potentially impacting disciplines relying on accurate causal inference from observational data.
MBZUAI's Kun Zhang is applying causal machine learning to improve drug development and precision medicine, focusing on answering 'why' questions. Traditional drug development is costly (est. $2B) due to extensive studies needed to determine drug toxicity and efficacy. Zhang is combining causal ML with organs-on-chips technology to improve pre-clinical drug testing, aiming to reduce the failure rate of drugs in human trials. Why it matters: By improving the accuracy of pre-clinical drug testing, this research could significantly reduce the cost and time required to bring new medicines to market in the region and worldwide.
MBZUAI Professor Kun Zhang is developing machine learning techniques to identify hidden causal variables, which are underlying concepts driving cause-and-effect relationships. Zhang and colleagues from Carnegie Mellon University are presenting a new approach for this at ICML 2024. Their method, causal representation learning, assumes that measured variables are generated by unobserved latent variables. Why it matters: Uncovering hidden causal relationships can significantly advance understanding in various fields by revealing the underlying mechanisms driving observed phenomena.