skip to main content

Title: tRFtarget: a database for transfer RNA-derived fragment targets
Abstract Transfer RNA-derived fragments (tRFs) are a new class of small non-coding RNAs and play important roles in biological and physiological processes. Prediction of tRF target genes and binding sites is crucial in understanding the biological functions of tRFs in the molecular mechanisms of human diseases. We developed a publicly accessible web-based database, tRFtarget (, for tRF target prediction. It contains the computationally predicted interactions between tRFs and mRNA transcripts using the two state-of-the-art prediction tools RNAhybrid and IntaRNA, including location of the binding sites on the target, the binding region, and free energy of the binding stability with graphic illustration. tRFtarget covers 936 tRFs and 135 thousand predicted targets in eight species. It allows researchers to search either target genes by tRF IDs or tRFs by gene symbols/transcript names. We also integrated the manually curated experimental evidence of the predicted interactions into the database. Furthermore, we provided a convenient link to the DAVID® web server to perform downstream functional pathway analysis and gene ontology annotation on the predicted target genes. This database provides useful information for the scientific community to experimentally validate tRF target genes and facilitate the investigation of the molecular functions and mechanisms of tRFs.
; ; ;
Award ID(s):
Publication Date:
Journal Name:
Nucleic Acids Research
Page Range or eLocation-ID:
D254 to D260
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Deciphering gene regulatory networks (GRNs) is both a promise and challenge of systems biology. The promise lies in identifying key transcription factors (TFs) that enable an organism to react to changes in its environment. The challenge lies in validating GRNs that involve hundreds of TFs with hundreds of thousands of interactions with their genome-wide targets experimentally determined by high-throughput sequencing. To address this challenge, we developed ConnecTF, a species-independent, web-based platform that integrates genome-wide studies of TF–target binding, TF–target regulation, and other TF-centric omic datasets and uses these to build and refine validated or inferred GRNs. We demonstrate themore »functionality of ConnecTF by showing how integration within and across TF–target datasets uncovers biological insights. Case study 1 uses integration of TF–target gene regulation and binding datasets to uncover TF mode-of-action and identify potential TF partners for 14 TFs in abscisic acid signaling. Case study 2 demonstrates how genome-wide TF–target data and automated functions in ConnecTF are used in precision/recall analysis and pruning of an inferred GRN for nitrogen signaling. Case study 3 uses ConnecTF to chart a network path from NLP7, a master TF in nitrogen signaling, to direct secondary TF2s and to its indirect targets in a Network Walking approach. The public version of ConnecTF ( contains 3,738,278 TF–target interactions for 423 TFs in Arabidopsis, 839,210 TF–target interactions for 139 TFs in maize (Zea mays), and 293,094 TF–target interactions for 26 TFs in rice (Oryza sativa). The database and tools in ConnecTF will advance the exploration of GRNs in plant systems biology applications for model and crop species.« less
  2. ABSTRACT DeoR-type helix-turn-helix (HTH) domain proteins are transcriptional regulators of sugar and nucleoside metabolism in diverse bacteria and also occur in select archaea. In the model archaeon Haloferax volcanii , previous work implicated GlpR, a DeoR-type transcriptional regulator, in the transcriptional repression of glpR and the gene encoding the fructose-specific phosphofructokinase ( pfkB ) during growth on glycerol. However, the global regulon governed by GlpR remained unclear. Here, we compared transcriptomes of wild-type and Δ glpR mutant strains grown on glycerol and glucose to detect significant transcript level differences for nearly 50 new genes regulated by GlpR. By coupling computationalmore »prediction of GlpR binding sequences with in vivo and in vitro DNA binding experiments, we determined that GlpR directly controls genes encoding enzymes involved in fructose degradation, including fructose bisphosphate aldolase, a central control point in glycolysis. GlpR also directly controls other transcription factors. In contrast, other metabolic pathways appear to be under the indirect influence of GlpR. In vitro experiments demonstrated that GlpR purifies to function as a tetramer that binds the effector molecule fructose-1-phosphate (F1P). These results suggest that H. volcanii GlpR functions as a direct negative regulator of fructose degradation during growth on carbon sources other than fructose, such as glucose and glycerol, and that GlpR bears striking functional similarity to bacterial DeoR-type regulators. IMPORTANCE Many archaea are extremophiles, able to thrive in habitats of extreme salinity, pH and temperature. These biological properties are ideal for applications in biotechnology. However, limited knowledge of archaeal metabolism is a bottleneck that prevents the broad use of archaea as microbial factories for industrial products. Here, we characterize how sugar uptake and use are regulated in a species that lives in high salinity. We demonstrate that a key sugar regulatory protein in this archaeal species functions using molecular mechanisms conserved with distantly related bacterial species.« less
  3. Abstract AGAMOUS-Like 18 (AGL18) is a MADS domain transcription factor (TF) that is structurally related to AGL15. Here we show that, like AGL15, AGL18 can promote somatic embryogenesis (SE) when ectopically expressed in Arabidopsis (Arabidopsis thaliana). Based on loss-of-function mutants, AGL15 and AGL18 have redundant functions in developmental processes such as SE. To understand the nature of this redundancy, we undertook a number of studies to look at the interaction between these factors. We studied the genome-wide direct targets of AGL18 to characterize its roles at the molecular level using chromatin immunoprecipitation (ChIP)-SEQ combined with RNA-SEQ. The results demonstrated thatmore »AGL18 binds to thousands of sites in the genome. Comparison of ChIP-SEQ data for AGL15 and AGL18 revealed substantial numbers of genes bound by both AGL15 and AGL18, but there were also differences. Gene ontology analysis revealed that target genes were enriched for seed, embryo, and reproductive development as well as hormone and stress responses. The results also demonstrated that AGL15 and AGL18 interact in a complex regulatory loop, where AGL15 inhibited transcript accumulation of AGL18, while AGL18 increased AGL15 transcript accumulation. Co-immunoprecipitation revealed an interaction between AGL18 and AGL15 in somatic embryo tissue. The binding and expression analyses revealed a complex crosstalk and interactions among embryo TFs and their target genes. In addition, our study also revealed that phosphorylation of AGL18 and AGL15 was crucial for the promotion of SE.« less
  4. Abstract Background Identification of genes responsible for anatomical entities is a major requirement in many fields including developmental biology, medicine, and agriculture. Current wet lab techniques used for this purpose, such as gene knockout, are high in resource and time consumption. Protein–protein interaction (PPI) networks are frequently used to predict disease genes for humans and gene candidates for molecular functions, but they are rarely used to predict genes for anatomical entities. Moreover, PPI networks suffer from network quality issues, which can be a limitation for their usage in predicting candidate genes. Therefore, we developed an integrative framework to improve themore »candidate gene prediction accuracy for anatomical entities by combining existing experimental knowledge about gene-anatomical entity relationships with PPI networks using anatomy ontology annotations. We hypothesized that this integration improves the quality of the PPI networks by reducing the number of false positive and false negative interactions and is better optimized to predict candidate genes for anatomical entities. We used existing Uberon anatomical entity annotations for zebrafish and mouse genes to construct gene networks by calculating semantic similarity between the genes. These anatomy-based gene networks were semantic networks, as they were constructed based on the anatomy ontology annotations that were obtained from the experimental data in the literature. We integrated these anatomy-based gene networks with mouse and zebrafish PPI networks retrieved from the STRING database and compared the performance of their network-based candidate gene predictions. Results According to evaluations of candidate gene prediction performance tested under four different semantic similarity calculation methods (Lin, Resnik, Schlicker, and Wang), the integrated networks, which were semantically improved PPI networks, showed better performances by having higher area under the curve values for receiver operating characteristic and precision-recall curves than PPI networks for both zebrafish and mouse. Conclusion Integration of existing experimental knowledge about gene-anatomical entity relationships with PPI networks via anatomy ontology improved the candidate gene prediction accuracy and optimized them for predicting candidate genes for anatomical entities.« less
  5. Abstract Traditional techniques to identify macromolecular targets for drugs utilize solely the information on a query drug and a putative target. Nonetheless, the mechanisms of action of many drugs depend not only on their binding affinity toward a single protein, but also on the signal transduction through cascades of molecular interactions leading to certain phenotypes. Although using protein-protein interaction networks and drug-perturbed gene expression profiles can facilitate system-level investigations of drug-target interactions, utilizing such large and heterogeneous data poses notable challenges. To improve the state-of-the-art in drug target identification, we developed GraphDTI, a robust machine learning framework integrating the molecular-levelmore »information on drugs, proteins, and binding sites with the system-level information on gene expression and protein-protein interactions. In order to properly evaluate the performance of GraphDTI, we compiled a high-quality benchmarking dataset and devised a new cluster-based cross-validation protocol. Encouragingly, GraphDTI not only yields an AUC of 0.996 against the validation dataset, but it also generalizes well to unseen data with an AUC of 0.939, significantly outperforming other predictors. Finally, selected examples of identified drugtarget interactions are validated against the biomedical literature. Numerous applications of GraphDTI include the investigation of drug polypharmacological effects, side effects through offtarget binding, and repositioning opportunities.« less