skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A small protein encoded by a putative lncRNA regulates apoptosis and tumorigenicity in human colorectal cancer cells
Long noncoding RNAs (lncRNAs) are often associated with polysomes, indicating coding potential. However, only a handful of endogenous proteins encoded by putative lncRNAs have been identified and assigned a function. Here, we report the discovery of a putative gastrointestinal-tract-specific lncRNA ( LINC00675 ) that is regulated by the pioneer transcription factor FOXA1 and encodes a conserved small protein of 79 amino acids which we termed FORCP ( FO XA1- R egulated C onserved Small P rotein). FORCP transcript is undetectable in most cell types but is abundant in well-differentiated colorectal cancer (CRC) cells where it functions to inhibit proliferation, clonogenicity, and tumorigenesis. The epitope-tagged and endogenous FORCP protein predominantly localizes to the endoplasmic reticulum (ER). In response to ER stress, FORCP depletion results in decreased apoptosis. Our findings on the initial characterization of FORCP demonstrate that FORCP is a novel, conserved small protein encoded by a mis-annotated lncRNA that regulates apoptosis and tumorigenicity in well-differentiated CRC cells.  more » « less
Award ID(s):
1723008
PAR ID:
10226079
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; « less
Date Published:
Journal Name:
eLife
Volume:
9
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Simon, Anne E. (Ed.)
    ABSTRACT Long noncoding RNAs (lncRNAs) of virus origin accumulate in cells infected by many positive-strand (+) RNA viruses to bolster viral infectivity. Their biogenesis mostly utilizes exoribonucleases of host cells that degrade viral genomic or subgenomic RNAs in the 5′-to-3′ direction until being stalled by well-defined RNA structures. Here, we report a viral lncRNA that is produced by a novel replication-dependent mechanism. This lncRNA corresponds to the last 283 nucleotides of the turnip crinkle virus (TCV) genome and hence is designated tiny TCV subgenomic RNA (ttsgR). ttsgR accumulated to high levels in TCV-infected Nicotiana benthamiana cells when the TCV-encoded RNA-dependent RNA polymerase (RdRp), also known as p88, was overexpressed. Both (+) and (−) strand forms of ttsgR were produced in a manner dependent on the RdRp functionality. Strikingly, templates as short as ttsgR itself were sufficient to program ttsgR amplification, as long as the TCV-encoded replication proteins p28 and p88 were provided in trans . Consistent with its replicational origin, ttsgR accumulation required a 5′ terminal carmovirus consensus sequence (CCS), a sequence motif shared by genomic and subgenomic RNAs of many viruses phylogenetically related to TCV. More importantly, introducing a new CCS motif elsewhere in the TCV genome was alone sufficient to cause the emergence of another lncRNA. Finally, abolishing ttsgR by mutating its 5′ CCS gave rise to a TCV mutant that failed to compete with wild-type TCV in Arabidopsis . Collectively, our results unveil a replication-dependent mechanism for the biogenesis of viral lncRNAs, thus suggesting that multiple mechanisms, individually or in combination, may be responsible for viral lncRNA production. IMPORTANCE Many positive-strand (+) RNA viruses produce long noncoding RNAs (lncRNAs) during the process of cellular infections and mobilize these lncRNAs to counteract antiviral defenses, as well as coordinate the translation of viral proteins. Most viral lncRNAs arise from 5′-to-3′ degradation of longer viral RNAs being stalled at stable secondary structures. Here, we report a viral lncRNA that is produced by the replication machinery of turnip crinkle virus (TCV). This lncRNA, designated ttsgR, shares the terminal characteristics with TCV genomic and subgenomic RNAs and overaccumulates in the presence of moderately overexpressed TCV RNA-dependent RNA polymerase (RdRp). Furthermore, templates that are of similar sizes as ttsgR are readily replicated by TCV replication proteins (p28 and RdRp) provided from nonviral sources. In summary, this study establishes an approach for uncovering low abundance viral lncRNAs, and characterizes a replicating TCV lncRNA. Similar investigations on human-pathogenic (+) RNA viruses could yield novel therapeutic targets. 
    more » « less
  2. Long noncoding RNA (lncRNA) genes outnumber protein coding genes in the human genome and the majority remain uncharacterized. A major difficulty in generalizing understanding of lncRNA function is the dearth of gross sequence conservation, both for lncRNAs across species and for lncRNAs that perform similar functions within a species. Machine learning based methods which harness vast amounts of information on RNAs are increasingly used to impute certain biological characteristics. This includes interactions with proteins that are important mediators of RNA function, thus enabling the generation of knowledge in contexts for which experimental data are lacking. Here, we applied a natural language-based machine learning approach that enabled us to identify RNA binding protein interactions in lncRNA transcripts, using only RNA sequence as an input. We found that this predictive method is a powerful approach to infer conserved binding across species as distant as human and opossum, even in the absence of sequence conservation, thus informing on sequence-function relationships for these poorly understood RNAs. 
    more » « less
  3. Abstract Previously, we have shown that apoplastic wash fluid (AWF) purified from Arabidopsis leaves contains small RNAs (sRNAs). To investigate whether these sRNAs are encapsulated inside extracellular vesicles (EVs), we treated EVs isolated from Arabidopsis leaves with the protease trypsin and RNase A, which should degrade RNAs located outside EVs but not those located inside. These analyses revealed that apoplastic RNAs are mostly located outside and are associated with proteins. Further analyses of these extracellular RNAs (exRNAs) revealed that they include both sRNAs and long noncoding RNAs (lncRNAs), including circular RNAs (circRNAs). We also found that exRNAs are highly enriched in the posttranscriptional modification N6-methyladenine (m6A). Consistent with this, we identified a putative m6A-binding protein in AWF, GLYCINE-RICH RNA-BINDING PROTEIN 7 (GRP7), as well as the sRNA-binding protein ARGONAUTE2 (AGO2). These two proteins coimmunoprecipitated with lncRNAs, including circRNAs. Mutation of GRP7 or AGO2 caused changes in both the sRNA and lncRNA content of AWF, suggesting that these proteins contribute to the secretion and/or stabilization of exRNAs. We propose that exRNAs located outside of EVs mediate host-induced gene silencing, rather than RNA located inside EVs. 
    more » « less
  4. Background: Long non-coding Ribonucleic Acids (lncRNAs) can be localized to different cellular compartments, such as the nuclear and the cytoplasmic regions. Their biological functions are influenced by the region of the cell where they are located. Compared to the vast number of lncRNAs, only a relatively small proportion have annotations regarding their subcellular localization. It would be helpful if those few annotated lncRNAs could be leveraged to develop predictive models for localization of other lncRNAs. Methods: Conventional computational methods use q-mer profiles from lncRNA sequences and train machine learning models such as support vector machines and logistic regression with the profiles. These methods focus on the exact q-mer. Given possible sequence mutations and other uncertainties in genomic sequences and their role in biological function, a consideration of these variabilities might improve our ability to model lncRNAs and their localization. Thus, we build on inexact q-mers and use machine learning/deep learning techniques to study three specific problems in lncRNA subcellular localization, namely, prediction of lncRNA localization using inexact q-mers, the issue of whether lncRNA localization is cell-type-specific, and the notion of switching (lncRNA) genes. Results: We performed our analysis using data on lncRNA localization across 15 cell lines. Our results showed that using inexact q-mers (with q = 6) can improve the lncRNA localization prediction performance compared to using exact q-mers. Further, we showed that lncRNA localization, in general, is not cell-line-specific. We also identified a category of LncRNAs which switch cellular compartments between different cell lines (we call them switching lncRNAs). These switching lncRNAs complicate the problem of predicting lncRNA localization using machine learning models, showing that lncRNA localization is still a major challenge. 
    more » « less
  5. Abstract Long noncoding RNAs (lncRNAs) are RNA transcripts longer than 200 nucleotides that do not code for proteins. LncRNAs play crucial regulatory roles in several biological processes via diverse mechanisms and their aberrant expression is associated with various diseases. LncRNA genes are further subcategorized based on their relative organization in the genome. MicroRNA (miRNA)‐host‐gene‐derived lncRNAs (lnc‐MIRHGs) refer to lncRNAs whose genes also harbor miRNAs. There exists crosstalk between the processing of lnc‐MIRHGs and the biogenesis of the encoded miRNAs. Although the functions of the encoded miRNAs are usually well understood, whether those lnc‐MIRHGs play independent functions are not fully elucidated. Here, we review our current understanding of lnc‐MIRHGs, including their biogenesis, function, and mechanism of action, with a focus on discussing the miRNA‐independent functions of lnc‐MIRHGs, including their involvement in cancer. Our current understanding of lnc‐MIRHGsstrongly indicates that this class of lncRNAs could play important roles in basic cellular events as well as in diseases. This article is categorized under:Regulatory RNAs/RNAi/Riboswitches > Regulatory RNAsRegulatory RNAs/RNAi/Riboswitches > Biogenesis of Effector Small RNAs 
    more » « less