MBZUAI researchers collaborated with Carnegie Mellon University and the Broad Institute of MIT and Harvard to develop a new statistical method for analyzing data used for gene regulatory network inference. The method addresses the challenge of distinguishing true zero expression values from dropouts in single-cell RNA sequencing data. This research will be presented at the Twelfth International Conference on Learning Representations (ICLR 2024). Why it matters: Improving gene regulatory network inference can lead to better understanding of disease mechanisms and inform the development of new medicines.
KAUST's Environmental Epigenetics Program (KEEP), led by Prof. Valerio Orlando, focuses on understanding how cells acquire and maintain memory, particularly in response to environmental factors. The research investigates the role of non-coding RNA and chromosomal components in regulating gene expression beyond the DNA sequence. Epigenetics explains how the same genome can be interpreted differently, allowing cells and organs to adapt to changing conditions. Why it matters: This research could provide insights into how environmental factors impact gene expression and cell function, potentially leading to advances in understanding and treating diseases.
Munther Dahleh, director at the MIT Institute for Data, Systems, and Society (IDSS), discussed his group's research on network systems at the KAUST 2018 Winter Enrichment Program. The research focuses on the fragility of large networked systems, like highway systems, in response to disruptions that may lead to catastrophic failures. Dahleh's team studies transportation networks, electrical grids, and financial markets to understand system interconnection in causing systemic risk. Why it matters: Understanding networked systems is crucial for building resilient infrastructure and mitigating risks in critical sectors across the GCC region.
Carlo Maj from the University of Marburg will discuss using polygenic modeling to analyze the genetic architecture of multifactorial traits. He will present how these approaches can be used to predict the genetically driven components of complex phenotypes. The talk highlights the potential of these methods to bridge genomic research and genetic epidemiology using biobank data. Why it matters: Such methods could improve disease risk assessment and advance personalized risk management in the region if applied to local biobanks or datasets.
This article discusses a talk by Gábor Lugosi on "network archaeology," specifically the problems of root finding and broadcasting in large networks. The talk addresses discovering the past of dynamically growing networks when only a present-day snapshot is observed. Lugosi's research interests include machine learning theory, nonparametric statistics, and random structures. Why it matters: Understanding the evolution and origins of networks is crucial for various applications, including analyzing social networks, biological systems, and the spread of information.
Researchers at the Rosalind Franklin Institute are using generative AI, including GANs, to augment limited biological datasets, specifically mirtron data from mirtronDB. The synthetic data created mimics real-world samples, facilitating more comprehensive training of machine learning models, leading to improved mirtron identification tools. They also plan to apply Large Language Models (LLMs) to predict unknown patterns in sequence and structure biology problems. Why it matters: This research explores AI techniques to tackle data scarcity in biological research, potentially accelerating discoveries in noncoding RNA and transposable elements.
KAUST researchers developed a new algorithm for detecting cause and effect in large datasets. The algorithm aims to find underlying models that generate data, helping uncover cause-and-effect dynamics. It could aid researchers across fields like cell biology and genetics by answering questions that typical machine learning cannot. Why it matters: This advancement could equip current machine learning methods with abilities to better deal with abstraction, inference, and concepts such as cause and effect.
Khaled Alsayegh at the King Abdullah International Medical Research Center is creating a Saudi Stem Cell Donor Registry, with 80,000 potential donors identified. The aim is to identify universal donors, reprogram their cells into induced pluripotent stem (iPS) cells, and create a gene bank for matched tissue transplants. Alsayegh is collaborating with Jesper Tegnér at KAUST to create pacemaker cells using single-cell RNA sequencing. Why it matters: This initiative could revolutionize precision medicine in KSA by providing readily available, matched cells for transplants, reducing the need for patient-specific reprogramming and improving treatment outcomes.