A talk discusses the challenges of single-cell data analysis, such as feature sparsity and the effects of rare cells. AI/ML strategies are uniquely positioned to model this data. ImYoo, a startup founded in 2021, is applying single-cell model architectures for unsupervised discovery of patient groupings and predicting sample-level phenotypical data in autoimmune disease. Why it matters: This highlights the growing application of AI/ML in analyzing single-cell data for population-scale human health studies, an area ripe for innovation and improvement in the Middle East's growing biotech sector.
MBZUAI's Eduardo da Veiga Beltrame is developing machine learning tools for analyzing single-cell RNA sequencing data, which measures RNA in thousands of individual cells. Sequencing costs have decreased faster than Moore's Law, enabling large-scale data collection in biology. RNA sequencing provides insights into gene expression and cellular activity, crucial for personalized medicine. Why it matters: Advancements in single-cell RNA sequencing and ML analysis will accelerate personalized medicine by providing detailed insights into cellular mechanisms and disease pathways.
MBZUAI researchers collaborated with Carnegie Mellon University and the Broad Institute of MIT and Harvard to develop a new statistical method for analyzing data used for gene regulatory network inference. The method addresses the challenge of distinguishing true zero expression values from dropouts in single-cell RNA sequencing data. This research will be presented at the Twelfth International Conference on Learning Representations (ICLR 2024). Why it matters: Improving gene regulatory network inference can lead to better understanding of disease mechanisms and inform the development of new medicines.
Dr. Mikhail Burtsev of the London Institute presented research on GENA-LM, a suite of transformer-based DNA language models. The talk addressed the challenge of scaling transformers for genomic sequences, proposing recurrent memory augmentation to handle long input sequences efficiently. This approach improves language modeling performance and holds promise for memory-intensive applications in bioinformatics. Why it matters: This research can significantly advance AI's capabilities in genomics by enabling the processing of much larger DNA sequences, with potential breakthroughs in understanding and treating diseases.
Eduardo da Veiga Beltrame, bioinformatics lead at ImYoo (a Caltech spinout), presented on scalable methods for single-cell omics data analysis, including kallisto|bustools and scvi-tools. He highlighted their use in ImYoo's decentralized longitudinal study on Inflammatory Bowel Disease (IBD), where patients self-collect capillary blood samples. Beltrame also discussed his research on STEM education programs in Brazil as a visiting scholar at UC Berkeley. Why it matters: This highlights the growing trend of decentralized clinical studies leveraging advanced single-cell technologies for precision medicine, showcasing the potential of remote data collection and analysis in understanding complex diseases.