skip to main content


Title: Gene Regulation Analysis Reveals Perturbations of Autism Spectrum Disorder during Neural System Development
Autism spectrum disorder (ASD) is a neurodevelopmental disorder that impedes patients’ cognition, social, speech and communication skills. ASD is highly heterogeneous with a variety of etiologies and clinical manifestations. The prevalence rate of ASD increased steadily in recent years. Presently, molecular mechanisms underlying ASD occurrence and development remain to be elucidated. Here, we integrated multi-layer genomics data to investigate the transcriptome and pathway dysregulations in ASD development. The RNA sequencing (RNA-seq) expression profiles of induced pluripotent stem cells (iPSCs), neural progenitor cells (NPCs) and neuron cells from ASD and normal samples were compared in our study. We found that substantially more genes were differentially expressed in the NPCs than the iPSCs. Consistently, gene set variation analysis revealed that the activity of the known ASD pathways in NPCs and neural cells were significantly different from the iPSCs, suggesting that ASD occurred at the early stage of neural system development. We further constructed comprehensive brain- and neural-specific regulatory networks by incorporating transcription factor (TF) and gene interactions with long 5 non-coding RNA(lncRNA) and protein interactions. We then overlaid the transcriptomes of different cell types on the regulatory networks to infer the regulatory cascades. The variations of the regulatory cascades between ASD and normal samples uncovered a set of novel disease-associated genes and gene interactions, particularly highlighting the functional roles of ELF3 and the interaction between STAT1 and lncRNA ELF3-AS 1 in the disease development. These new findings extend our understanding of ASD and offer putative new therapeutic targets for further studies.  more » « less
Award ID(s):
1946391
NSF-PAR ID:
10321733
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Genes
Volume:
12
Issue:
12
ISSN:
2073-4425
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Aims

    Dissecting complex interactions among transcription factors (TFs), microRNAs (miRNAs) and long noncoding RNAs (lncRNAs) are central for understanding heart development and function. Although computational approaches and platforms have been described to infer relationships among regulatory factors and genes, current approaches do not adequately account for how highly diverse, interacting regulators that include noncoding RNAs (ncRNAs) control cardiac gene expression dynamics over time.

    Methods

    To overcome this limitation, we devised an integrated framework, cardiac gene regulatory modeling (CGRM) that integrates LogicTRN and regulatory component analysis bioinformatics modeling platforms to infer complex regulatory mechanisms. We then used CGRM to identify and compare the TF-ncRNA gene regulatory networks that govern early- and late-stage cardiomyocytes (CMs) generated by in vitro differentiation of human pluripotent stem cells (hPSC) and ventricular and atrial CMs isolated during in vivo human cardiac development.

    Results

    Comparisons of in vitro versus in vivo derived CMs revealed conserved regulatory networks among TFs and ncRNAs in early cells that significantly diverged in late staged cells. We report that cardiac genes (“heart targets”) expressed in early-stage hPSC-CMs are primarily regulated by MESP1, miR-1, miR-23, lncRNAs NEAT1 and MALAT1, while GATA6, HAND2, miR-200c, NEAT1 and MALAT1 are critical for late hPSC-CMs. The inferred TF-miRNA-lncRNA networks regulating heart development and contraction were similar among early-stage CMs, among individual hPSC-CM datasets and between in vitro and in vivo samples. However, genes related to apoptosis, cell cycle and proliferation, and transmembrane transport showed a high degree of divergence between in vitro and in vivo derived late-stage CMs. Overall, late-, but not early-stage CMs diverged greatly in the expression of “heart target” transcripts and their regulatory mechanisms.

    Conclusions

    In conclusion, we find that hPSC-CMs are regulated in a cell autonomous manner during early development that diverges significantly as a function of time when compared to in vivo derived CMs. These findings demonstrate the feasibility of using CGRM to reveal dynamic and complex transcriptional and posttranscriptional regulatory interactions that underlie cell directed versus environment-dependent CM development. These results with in vitro versus in vivo derived CMs thus establish this approach for detailed analyses of heart disease and for the analysis of cell regulatory systems in other biomedical fields.

     
    more » « less
  2. The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the etiological agent responsible for coronavirus disease 2019 (COVID-19), has affected the lives of billions and killed millions of infected people. This virus has been demonstrated to have different outcomes among individuals, with some of them presenting a mild infection, while others present severe symptoms or even death. The identification of the molecular states related to the severity of a COVID-19 infection has become of the utmost importance to understanding the differences in critical immune response. In this study, we computationally processed a set of publicly available single-cell RNA-Seq (scRNA-Seq) data of 12 Bronchoalveolar Lavage Fluid (BALF) samples diagnosed as having a mild, severe, or no infection, and generated a high-quality dataset that consists of 63,734 cells, each with 23,916 genes. We extended the cell-type and sub-type composition identification and our analysis showed significant differences in cell-type composition in mild and severe groups compared to the normal. Importantly, inflammatory responses were dramatically elevated in the severe group, which was evidenced by the significant increase in macrophages, from 10.56% in the normal group to 20.97% in the mild group and 34.15% in the severe group. As an indicator of immune defense, populations of T cells accounted for 24.76% in the mild group and decreased to 7.35% in the severe group. To verify these findings, we developed several artificial neural networks (ANNs) and graph convolutional neural network (GCNN) models. We showed that the GCNN models reach a prediction accuracy of the infection of 91.16% using data from subtypes of macrophages. Overall, our study indicates significant differences in the gene expression profiles of inflammatory response and immune cells of severely infected patients. 
    more » « less
  3. Rationale: NAA15 (N-alpha-acetyltransferase 15) is a component of the NatA (N-terminal acetyltransferase complex). The mechanism by which NAA15 haploinsufficiency causes congenital heart disease remains unknown. To better understand molecular processes by which NAA15 haploinsufficiency perturbs cardiac development, we introduced NAA15 variants into human induced pluripotent stem cells (iPSCs) and assessed the consequences of these mutations on RNA and protein expression. Objective: We aim to understand the role of NAA15 haploinsufficiency in cardiac development by investigating proteomic effects on NatA complex activity and identifying proteins dependent upon a full amount of NAA15. Methods and Results: We introduced heterozygous loss of function, compound heterozygous, and missense residues (R276W) in iPSCs using CRISPR/Cas9. Haploinsufficient NAA15 iPSCs differentiate into cardiomyocytes, unlike NAA15 -null iPSCs, presumably due to altered composition of NatA. Mass spectrometry analyses reveal ≈80% of identified iPSC NatA targeted proteins displayed partial or complete N-terminal acetylation. Between null and haploinsufficient NAA15 cells, N-terminal acetylation levels of 32 and 9 NatA-specific targeted proteins were reduced, respectively. Similar acetylation loss in few proteins occurred in NAA15 R276W induced pluripotent stem cells. In addition, steady-state protein levels of 562 proteins were altered in both null and haploinsufficient NAA15 cells; 18 were ribosomal-associated proteins. At least 4 proteins were encoded by genes known to cause autosomal dominant congenital heart disease. Conclusions: These studies define a set of human proteins that requires a full NAA15 complement for normal synthesis and development. A 50% reduction in the amount of NAA15 alters levels of at least 562 proteins and N-terminal acetylation of only 9 proteins. One or more modulated proteins are likely responsible for NAA15-haploinsufficiency mediated congenital heart disease. Additionally, genetically engineered induced pluripotent stem cells provide a platform for evaluating the consequences of amino acid sequence variants of unknown significance on NAA15 function. 
    more » « less
  4. Although epithelial-mesenchymal transition (EMT) is a common feature of fibrotic lung disease, its role in fibrogenesis is controversial. Recently, aberrant basaloid cells were identified in fibrotic lung tissue as a novel epithelial cell type displaying a partial EMT phenotype. The developmental origin of these cells remains unknown. To elucidate the role of EMT in the development of aberrant basaloid cells from the bronchial epithelium, we mapped EMT-induced transcriptional changes at the population and single-cell levels. Human bronchial epithelial cells grown as submerged or air-liquid interface (ALI) cultures with or without EMT induction were analyzed by bulk and single-cell RNA-Sequencing. Comparison of submerged and ALI cultures revealed differential expression of 8,247 protein coding (PC) and 1,621 long noncoding RNA (lncRNA) genes and revealed epithelial cell-type-specific lncRNAs. Similarly, EMT induction in ALI cultures resulted in robust transcriptional reprogramming of 6,020 PC and 907 lncRNA genes. Although there was no evidence for fibroblast/myofibroblast conversion following EMT induction, cells displayed a partial EMT gene signature and an aberrant basaloid-like cell phenotype. The substantial transcriptional differences between submerged and ALI cultures highlight that care must be taken when interpreting data from submerged cultures. This work supports that lung epithelial EMT does not generate fibroblasts/myofibroblasts and confirms ALI cultures provide a physiologically relevant system to study aberrant basaloid-like cells and mechanisms of EMT. We provide a catalog of PC and lncRNA genes and an interactive browser ( https://bronc-epi-in-vitro.cells.ucsc.edu/ ) of single-cell RNA-Seq data for further exploration of potential roles in the lung epithelium in health and lung disease. 
    more » « less
  5. INTRODUCTION Neurons are by far the most diverse of all cell types in animals, to the extent that “cell types” in mammalian brains are still mostly heterogeneous groups, and there is no consensus definition of the term. The Drosophila optic lobes, with approximately 200 well-defined cell types, provides a tractable system with which to address the genetic basis of neuronal type diversity. We previously characterized the distinct developmental gene expression program of each of these types using single-cell RNA sequencing (scRNA-seq), with one-to-one correspondence to the known morphological types. RATIONALE The identity of fly neurons is determined by temporal and spatial patterning mechanisms in stem cell progenitors, but it remained unclear how these cell fate decisions are implemented and maintained in postmitotic neurons. It was proposed in Caenorhabditis elegans that unique combinations of terminal selector transcription factors (TFs) that are continuously expressed in each neuron control nearly all of its type-specific gene expression. This model implies that it should be possible to engineer predictable and complete switches of identity between different neurons just by modifying these sustained TFs. We aimed to test this prediction in the Drosophila visual system. RESULTS Here, we used our developmental scRNA-seq atlases to identify the potential terminal selector genes in all optic lobe neurons. We found unique combinations of, on average, 10 differentially expressed and stably maintained (across all stages of development) TFs in each neuron. Through genetic gain- and loss-of-function experiments in postmitotic neurons, we showed that modifications of these selector codes are sufficient to induce predictable switches of identity between various cell types. Combinations of terminal selectors jointly control both developmental (e.g., morphology) and functional (e.g., neurotransmitters and their receptors) features of neurons. The closely related Transmedullary 1 (Tm1), Tm2, Tm4, and Tm6 neurons (see the figure) share a similar code of terminal selectors, but can be distinguished from each other by three TFs that are continuously and specifically expressed in one of these cell types: Drgx in Tm1, Pdm3 in Tm2, and SoxN in Tm6. We showed that the removal of each of these selectors in these cell types reprograms them to the default Tm4 fate. We validated these conversions using both morphological features and molecular markers. In addition, we performed scRNA-seq to show that ectopic expression of pdm3 in Tm4 and Tm6 neurons converts them to neurons with transcriptomes that are nearly indistinguishable from that of wild-type Tm2 neurons. We also show that Drgx expression in Tm1 neurons is regulated by Klumpfuss, a TF expressed in stem cells that instructs this fate in progenitors, establishing a link between the regulatory programs that specify neuronal fates and those that implement them. We identified an intronic enhancer in the Drgx locus whose chromatin is specifically accessible in Tm1 neurons and in which Klu motifs are enriched. Genomic deletion of this region knocked down Drgx expression specifically in Tm1 neurons, leaving it intact in the other cell types that normally express it. We further validated this concept by demonstrating that ectopic expression of Vsx (visual system homeobox) genes in Mi15 neurons not only converts them morphologically to Dm2 neurons, but also leads to the loss of their aminergic identity. Our results suggest that selector combinations can be further sculpted by receptor tyrosine kinase signaling after neurogenesis, providing a potential mechanism for postmitotic plasticity of neuronal fates. Finally, we combined our transcriptomic datasets with previously generated chromatin accessibility datasets to understand the mechanisms that control brain wiring downstream of terminal selectors. We built predictive computational models of gene regulatory networks using the Inferelator framework. Experimental validations of these networks revealed how selectors interact with ecdysone-responsive TFs to activate a large and specific repertoire of cell surface proteins and other effectors in each neuron at the onset of synapse formation. We showed that these network models can be used to identify downstream effectors that mediate specific cellular decisions during circuit formation. For instance, reduced levels of cut expression in Tm2 neurons, because of its negative regulation by pdm3 , controls the synaptic layer targeting of their axons. Knockdown of cut in Tm1 neurons is sufficient to redirect their axons to the Tm2 layer in the lobula neuropil without affecting other morphological features. CONCLUSION Our results support a model in which neuronal type identity is primarily determined by a relatively simple code of continuously expressed terminal selector TFs in each cell type throughout development. Our results provide a unified framework of how specific fates are initiated and maintained in postmitotic neurons and open new avenues to understanding synaptic specificity through gene regulatory networks. The conservation of this regulatory logic in both C. elegans and Drosophila makes it likely that the terminal selector concept will also be useful in understanding and manipulating the neuronal diversity of mammalian brains. Terminal selectors enable predictive cell fate reprogramming. Tm1, Tm2, Tm4, and Tm6 neurons of the Drosophila visual system share a core set of TFs continuously expressed by each cell type (simplified). The default Tm4 fate is overridden by the expression of a single additional terminal selector to generate Tm1 ( Drgx ), Tm2 ( pdm3 ), or Tm6 ( SoxN ) fates. 
    more » « less