Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN

Rosen, Yanay; Brbić, Maria; Roohani, Yusuf; Swanson, Kyle; Li, Ziang; Leskovec, Jure

doi:10.1038/s41592-024-02191-z

Citation Details

Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN

Abstract Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, interspecies genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes’ biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN can detect functionally related genes coexpressed across species, redefining differential expression for cross-species analysis. Applying SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets, we show that SATURN can effectively transfer annotations across species, even when they are evolutionarily remote. We also demonstrate that SATURN can be used to find potentially divergent gene functions between glaucoma-associated genes in humans and four other species. more »

Award ID(s):: 1918940 1835598

PAR ID:: 10497823

Author(s) / Creator(s):: Rosen, Yanay; Brbić, Maria; Roohani, Yusuf; Swanson, Kyle; Li, Ziang; Leskovec, Jure

Publisher / Repository:: Nature Methods

Date Published:: 2024-02-16

Journal Name:: Nature Methods

ISSN:: 1548-7091

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1038/s41592-024-02191-z

More Like this