KAUST, Tufts, and JIHS researchers created pangenome graphs using Saudi and Japanese samples, named JaSaPaGe. These graphs address the underrepresentation of these populations in existing pangenome databases, which are used as references for understanding individual DNA. The population-specific pangenomes are expected to improve variant calling and diagnostic accuracy for genetic disorders in these groups. Why it matters: This work promotes precision medicine and reduces diagnostic gaps for underrepresented populations by providing more relevant genetic baselines.
Carlo Maj from the University of Marburg will discuss using polygenic modeling to analyze the genetic architecture of multifactorial traits. He will present how these approaches can be used to predict the genetically driven components of complex phenotypes. The talk highlights the potential of these methods to bridge genomic research and genetic epidemiology using biobank data. Why it matters: Such methods could improve disease risk assessment and advance personalized risk management in the region if applied to local biobanks or datasets.
KAUST researchers have developed a genomic resource for Tausch’s goatgrass (Aegilops tauschii), a wild relative of wheat, by creating 46 high-quality genome assemblies. They compiled 493 genetically distinct accessions from an initial 900, collaborating with the Open Wild Wheat Consortium to select accessions with traits of interest, such as disease resistance and stress tolerance. Screening these assemblies helped identify rust resistance genes, including mapping a stem rust resistance gene to the Sr33 locus. Why it matters: This genomic resource will accelerate gene discovery in wheat, potentially improving modern wheat varieties and enhancing global food security.
Eran Segal from Weizmann Institute of Science presented The Human Phenotype Project, a large-scale prospective cohort with over 10,000 participants. The project aims to identify novel molecular markers and develop prediction models for disease onset using deep profiling. The profiling includes medical history, lifestyle, blood tests, and molecular profiling of the transcriptome, genetics, microbiome, metabolome and immune system. Why it matters: Such projects demonstrate the growing focus on personalized medicine in the region, utilizing advanced AI and machine learning techniques for disease prevention and treatment.
A talk discusses the challenges of single-cell data analysis, such as feature sparsity and the effects of rare cells. AI/ML strategies are uniquely positioned to model this data. ImYoo, a startup founded in 2021, is applying single-cell model architectures for unsupervised discovery of patient groupings and predicting sample-level phenotypical data in autoimmune disease. Why it matters: This highlights the growing application of AI/ML in analyzing single-cell data for population-scale human health studies, an area ripe for innovation and improvement in the Middle East's growing biotech sector.
Professor Eran Segal presented The Human Phenotype Project, a longitudinal cohort study with over 10,000 participants. The project aims to identify molecular markers and develop prediction models for disease using deep profiling techniques including medical history, lifestyle, blood tests, and microbiome analysis. The study provides insights into drivers of obesity, diabetes, and heart disease, identifying novel markers at the microbiome, metabolite, and immune system level. Why it matters: Such large-scale phenotyping initiatives could inform personalized medicine approaches relevant to the Middle East's specific health challenges.
Janet Kelso from the Max Planck Institute and Sudhir Kumar from Temple University discussed evolutionary biology in a KAUST Facebook Live interview. Kelso's research focuses on interactions between modern humans and Neanderthals, finding similarities in DNA and benefits for environmental adaptation. Kumar's work, highly cited, involves big data analyses in evolutionary biology. Why it matters: The interview highlights KAUST's engagement with international experts in bioinformatics and evolutionary biology, promoting interdisciplinary research and knowledge dissemination.
Dr. Mikhail Burtsev of the London Institute presented research on GENA-LM, a suite of transformer-based DNA language models. The talk addressed the challenge of scaling transformers for genomic sequences, proposing recurrent memory augmentation to handle long input sequences efficiently. This approach improves language modeling performance and holds promise for memory-intensive applications in bioinformatics. Why it matters: This research can significantly advance AI's capabilities in genomics by enabling the processing of much larger DNA sequences, with potential breakthroughs in understanding and treating diseases.