KAUST researchers organized a week-long workshop on bioinformatics, covering genomics and transcriptomics data analysis. The workshop targeted students, postdocs, and senior researchers, providing hands-on training in coding and analysis using tools like R, Python, and shell scripts. Attendees with little prior computational biology experience were introduced to fundamental concepts and tools for handling large sequencing datasets. Why it matters: The workshop addresses the increasing need for bioinformatics expertise at KAUST and in the region, crucial for advancing research in fields like evolution and complex diseases.
KAUST Professor Xin Gao, lead of the Structural and Functional Bioinformatics Group, advocates for interdisciplinarity in academic research, specifically merging AI and bioinformatics. Gao, formally trained in computer science with no formal biology training, integrated biological knowledge independently. At KAUST, he synchronized bioinformatics, machine learning, and AI, despite the challenges of dividing efforts between disciplines. Why it matters: Gao's success highlights the growing importance of interdisciplinary approaches in AI research, particularly in bridging computational methods with specialized domains like biomedicine to drive innovation.
Søren Brunak presented deep learning approaches for analyzing disease trajectories using data from 7-10 million patients in Denmark and the USA. The models predict future outcomes like mortality and specific diagnoses, such as pancreatic cancer, using 15-40 years of patient data. Disease trajectories and explainable AI can generate hypotheses for molecular-level investigations into causal aspects of disease progression. Why it matters: This research demonstrates the potential of large-scale patient data and AI to improve disease prediction and generate hypotheses for further investigation into disease mechanisms relevant to regional healthcare systems.
KAUST and Abdulrahman Nasser Alagil Sons Foundation (Jarir Investment) have signed an MoU to establish an endowment supporting AI and bioinformatics training programs at the KAUST Academy. These programs target undergraduate students across Saudi Arabia. The endowment aims to develop curriculum, provide resources, and expand training opportunities. Why it matters: This partnership aligns with Saudi Vision 2030's goals of advancing R&D, promoting innovation, and building national AI capabilities.
KAUST's Computational Bioscience Research Center (CBRC) held a Research Conference on Big Data Analyses in Evolutionary Biology. The conference focused on the impact of large "omics" datasets on evolutionary biology, requiring big data approaches for analysis. Researchers discussed how computer science can contribute to biology and vice versa. Why it matters: Such interdisciplinary events at KAUST can foster innovation at the intersection of computational science and biology, advancing research in both fields.
KAUST researchers developed a statistical approach to improve the identification of cancer-related protein mutations by reducing false positives. The method uses Bayesian statistics to analyze protein domain data from tumor samples, accounting for potential errors due to limited data. The team tested their method on prostate cancer data, successfully identifying a known cancer-linked mutation in the DNA binding protein cd00083. Why it matters: This enhances the reliability of cancer research at the molecular level, potentially accelerating the discovery of new therapeutic targets.
KAUST Professor Takashi Gojobori has been elected as a Fellow of the International Society for Computational Biology (ISCB). ISCB is a scholarly society for computational biology and bioinformatics. Gojobori's research interests include comparative genomics and gene expression of neural cells, as well as the marine metagenomics of microorganisms. Why it matters: The recognition highlights KAUST's contributions to computational biology and bioinformatics and strengthens its position as a research hub in the region.
Researchers at the Rosalind Franklin Institute are using generative AI, including GANs, to augment limited biological datasets, specifically mirtron data from mirtronDB. The synthetic data created mimics real-world samples, facilitating more comprehensive training of machine learning models, leading to improved mirtron identification tools. They also plan to apply Large Language Models (LLMs) to predict unknown patterns in sequence and structure biology problems. Why it matters: This research explores AI techniques to tackle data scarcity in biological research, potentially accelerating discoveries in noncoding RNA and transposable elements.