skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on June 1, 2026

Title: Deep learning inference of miRNA expression from bulk and single-cell mRNA expression
Studying miRNA activity at the single-cell level presents a significant challenge due to the limitations of existing single-cell technologies in capturing miRNAs. To address this, we introduce two deep learning models: Cross-modality (CM) and single-modality (SM), both based on encoder-decoder architectures. These models predict miRNA expression at both bulk and single-cell levels using mRNA data. We evaluated the performance of CM and SM against the state-of-the-art miRSCAPE approach, using both bulk and single-cell datasets. Our results demonstrate that both CM and SM outperform miRSCAPE in accuracy. Furthermore, incorporating miRNA target information substantially enhanced performance compared to models that utilized all genes. These models provide powerful tools for predicting miRNA expression from single-cell mRNA data.  more » « less
Award ID(s):
2015838
PAR ID:
10640690
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
World Scientific Connect
Date Published:
Journal Name:
Journal of Bioinformatics and Computational Biology
Volume:
23
Issue:
03
ISSN:
0219-7200
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Deregulation of gene expression is associated with the pathogenesis of numerous human diseases including cancer. Current data analyses on gene expression are mostly focused on differential gene/transcript expression in big data-driven studies. However, a poor connection to the proteome changes is a widespread problem in current data analyses. This is partly due to the complexity of gene regulatory pathways at the post-transcriptional level. In this study, we overcome these limitations and introduce a graph-based learning model, PTNet, which simulates the microRNAs (miRNAs) that regulate gene expression post-transcriptionally in silico. Our model does not require large-scale proteomics studies to measure the protein expression and can successfully predict the protein levels by considering the miRNA–mRNA interaction network, the mRNA expression, and the miRNA expression. Large-scale experiments on simulations and real cancer high-throughput datasets using PTNet validated that (i) the miRNA-mediated interaction network affects the abundance of corresponding proteins and (ii) the predicted protein expression has a higher correlation with the proteomics data (ground-truth) than the mRNA expression data. The classification performance also shows that the predicted protein expression has an improved prediction power on cancer outcomes compared to the prediction done by the mRNA expression data only or considering both mRNA and miRNA. Availability: PTNet toolbox is available at http://github.com/CompbioLabUCF/PTNet 
    more » « less
  2. Abstract To understand phenotypic variations and key factors which affect disease susceptibility of complex traits, it is important to decipher cell‐type tissue compositions. To study cellular compositions of bulk tissue samples, one can evaluate cellular abundances and cell‐type‐specific gene expression patterns from the tissue transcriptome profiles. We develop both fixed and mixed models to reconstruct cellular expression fractions for bulk‐profiled samples by using reference single‐cell (sc) RNA‐sequencing (RNA‐seq) reference data. In benchmark evaluations of estimating cellular expression fractions, the mixed‐effect models provide similar results as an elegant machine learning algorithm named cell‐type identification by estimating relative subsets of RNA transcripts (CIBERSORTx), which is a well‐known and reliable procedure to reconstruct cell‐type abundances and cell‐type‐specific gene expression profiles. In real data analysis, the mixed‐effect models outperform or perform similarly as CIBERSORTx. The mixed models perform better than the fixed models in both benchmark evaluations and data analysis. In simulation studies, we show that if the heterogeneity exists in scRNA‐seq data, it is better to use mixed models with heterogeneous mean and variance–covariance. As a byproduct, the mixed models provide fractions of covariance between subject‐specific gene expression and cell types to measure their correlations. The proposed mixed models provide a complementary tool to dissect bulk tissues using scRNA‐seq data. 
    more » « less
  3. Abstract We present Bisque, a tool for estimating cell type proportions in bulk expression. Bisque implements a regression-based approach that utilizes single-cell RNA-seq (scRNA-seq) or single-nucleus RNA-seq (snRNA-seq) data to generate a reference expression profile and learn gene-specific bulk expression transformations to robustly decompose RNA-seq data. These transformations significantly improve decomposition performance compared to existing methods when there is significant technical variation in the generation of the reference profile and observed bulk expression. Importantly, compared to existing methods, our approach is extremely efficient, making it suitable for the analysis of large genomic datasets that are becoming ubiquitous. When applied to subcutaneous adipose and dorsolateral prefrontal cortex expression datasets with both bulk RNA-seq and snRNA-seq data, Bisque replicates previously reported associations between cell type proportions and measured phenotypes across abundant and rare cell types. We further propose an additional mode of operation that merely requires a set of known marker genes. 
    more » « less
  4. Abstract We evaluated miRNA and mRNA expression differences in head tissues between avid-biting vs. reluctant-biting Aedes albopictus (Skuse) females from a single population over a 20-min timescale. We found no differences in miRNA expression between avid vs. reluctant biters, indicating that translational modulation of blood-feeding behavior occurs on a longer timescale than mRNA transcription. In contrast, we detected 19 differentially expressed mRNAs. Of the 19 differentially expressed genes at the mRNA level between avid-biting vs. reluctant-biting A. albopictus, 9 are implicated in olfaction, consistent with the well-documented role of olfaction in mosquito host-seeking. Additionally, several of the genes that we identified as differentially expressed in association with phenotypic variation in biting behavior share similar functions with or are inferred orthologues of, genes associated with evolutionary variation in biting behaviors of Wyeomyia smithii (Coq.) and Culex pipiens (Lin.). A future goal is to determine whether these genes are involved in the evolutionary transition from a biting to a non-biting life history. 
    more » « less
  5. Bardoni, Barbara (Ed.)
    Fragile X syndrome results from the loss of expression of the Fragile X Mental Retardation Protein (FMRP). FMRP and RNA helicase Moloney Leukemia virus 10 (MOV10) are important Argonaute (AGO) cofactors for miRNA-mediated translation regulation. We previously showed that MOV10 functionally associates with FMRP. Here we quantify the effect of reduced MOV10 and FMRP expression on dendritic morphology. Murine neurons with reduced MOV10 and FMRP phenocopied Dicer1 KO neurons which exhibit impaired dendritic maturation Hong J (2013), leading us to hypothesize that MOV10 and FMRP regulate DICER expression. In cells and tissues expressing reduced MOV10 or no FMRP, DICER expression was significantly reduced. Moreover, the Dicer1 mRNA is a Cross-Linking Immunoprecipitation (CLIP) target of FMRP Darnell JC (2011), MOV10 Skariah G (2017) and AGO2 Kenny PJ (2020). MOV10 and FMRP modulate expression of DICER1 mRNA through its 3’untranslated region (UTR) and introduction of a DICER1 transgene restores normal neurite outgrowth in the Mov10 KO neuroblastoma Neuro2A cell line and branching in MOV10 heterozygote neurons. Moreover, we observe a global reduction in AGO2-associated microRNAs isolated from Fmr1 KO brain. We conclude that the MOV10-FMRP-AGO2 complex regulates DICER expression, revealing a novel mechanism for regulation of miRNA production required for normal neuronal morphology. 
    more » « less