Researchers at the Rosalind Franklin Institute are using generative AI, including GANs, to augment limited biological datasets, specifically mirtron data from mirtronDB. The synthetic data created mimics real-world samples, facilitating more comprehensive training of machine learning models, leading to improved mirtron identification tools. They also plan to apply Large Language Models (LLMs) to predict unknown patterns in sequence and structure biology problems. Why it matters: This research explores AI techniques to tackle data scarcity in biological research, potentially accelerating discoveries in noncoding RNA and transposable elements.
A KAUST team developed piRNAi, a gene-silencing tool in nematode worms using synthetic RNA sequences interacting with the piRNA pathway. They successfully silenced genes involved in sex determination and other functions, demonstrating multiplexed gene silencing. The gene silencing lasted for varying durations across generations, up to six generations. Why it matters: This expands the molecular toolkit for gene manipulation and offers potential therapeutic applications in humans, given the presence of the same gene-silencing pathway.
KAUST and King Faisal Specialist Hospital and Research Centre (KFSHRC) are collaborating to develop an RNA sequencing tool to improve the diagnosis rate of genetic diseases. The tool analyzes RNA data to find aberrant transcripts and mutations, building on KFSHRC's clinical data and KAUST's computational expertise. The team has already solved cases that DNA sequencing alone could not, including a case of a young child with brain damage caused by a recessive gene mutation. Why it matters: This collaboration can improve disease management and preventative services in the region, directly contributing to Saudi Arabia’s national research priority of health and wellness.
KAUST researchers developed CovMT, a COVID-19 mutation tracking system for authorities and scientists to detect variants. CovMT tracks mutation fingerprints using daily data from the GISAID database of over 1.5 million viral genomes. The system identifies mutation hot spots, enabling public health authorities to stay ahead of new variants. Why it matters: This system provides a tool for rapid variant detection and informed public health decision-making in the region and globally.
KAUST professors Samir Hamdan and Nina Fedoroff collaborated on research published in Nucleic Acids Research focusing on microRNA (miRNA) biogenesis in plants. The study examined miRNA production in Arabidopsis thaliana and found that the protein SERRATE (SE) is integral to the processing of pri-miRNA by DCL1. They characterized the interactions of SE with RNA and DCL1, elucidating the mechanism by which SE promotes DCL1 activity. Why it matters: Understanding miRNA biogenesis could help modify crop plants to better tolerate stressful conditions, potentially increasing crop yields and productivity in the region.
The paper introduces Guided Deep List, a tool for automating the generation of epidemiological line lists from open source reports. The tool uses distributed vector representations and dependency parsing to extract tabular data on disease outbreaks. It was evaluated on MERS outbreak data in Saudi Arabia, demonstrating improved accuracy over baseline methods and enabling epidemiological inferences.
MBZUAI's Eduardo da Veiga Beltrame is developing machine learning tools for analyzing single-cell RNA sequencing data, which measures RNA in thousands of individual cells. Sequencing costs have decreased faster than Moore's Law, enabling large-scale data collection in biology. RNA sequencing provides insights into gene expression and cellular activity, crucial for personalized medicine. Why it matters: Advancements in single-cell RNA sequencing and ML analysis will accelerate personalized medicine by providing detailed insights into cellular mechanisms and disease pathways.
MBZUAI's Assistant Professor of Computational Biology, Eduardo Beltrame, is researching single-cell RNA sequencing to advance personalized medicine. He is also designing MBZUAI’s new master’s and Ph.D. programs in computational biology, set to launch in 2026, under the guidance of Professor Eran Segal. MBZUAI's research agenda includes foundational initiatives like AIDO and the Human Phenotype Project, leveraging vast datasets such as the Emirati Genome Project. Why it matters: This highlights MBZUAI's commitment to cutting-edge research and education in computational biology, positioning it as a potential rival to top global institutions in the field.