The Fusarium oxysporum species complex (FOSC) includes both plant and human pathogens that cause devastating plant vascular wilt diseases and threaten public health. Each F. oxysporum genome comprises core chromosomes (CCs) for housekeeping functions and accessory chromosomes (ACs) that contribute to host-specific adaptation. This study inspects global transcription factor profiles (TFomes) and their potential roles in coordinating CC and AC functions to accomplish host-specific interactions. Remarkably, we found a clear positive correlation between the sizes of TFomes and the proteomes of an organism. With the acquisition of ACs, the FOSC TFomes were larger than the other fungal genomes included in this study. Among a total of 48 classified TF families, 14 families involved in transcription/translation regulations and cell cycle controls were highly conserved. Among the 30 FOSC expanded families, Zn2-C6 and Znf_C2H2 were most significantly expanded to 671 and 167 genes per family including well-characterized homologs of Ftf1 (Zn2-C6) and PacC (Znf_C2H2) that are involved in host-specific interactions. Manual curation of characterized TFs increased the TFome repertoires by 3% including a disordered protein Ren1. RNA-Seq revealed a steady pattern of expression for conserved TF families and specific activation for AC TFs. Functional characterization of these TFs could enhance our understanding of transcriptional regulation involved in FOSC cross-kingdom interactions, disentangle species-specific adaptation, and identify targets to combat diverse diseases caused by this group of fungal pathogens. 
                        more » 
                        « less   
                    
                            
                            A systematic study of HIF1A cofactors in hypoxic cancer cells
                        
                    
    
            Abstract Hypoxia inducible factor 1 alpha (HIF1A) is a transcription factor (TF) that forms highly structural and functional protein–protein interactions with other TFs to promote gene expression in hypoxic cancer cells. However, despite the importance of these TF-TF interactions, we still lack a comprehensive view of many of the TF cofactors involved and how they cooperate. In this study, we systematically studied HIF1A cofactors in eight cancer cell lines using the computational motif mining tool, SIOMICS, and discovered 201 potential HIF1A cofactors, which included 21 of the 29 known HIF1A cofactors in public databases. These 201 cofactors were statistically and biologically significant, with 19 of the top 37 cofactors in our study directly validated in the literature. The remaining 18 were novel cofactors. These discovered cofactors can be essential to HIF1A’s regulatory functions and may lead to the discovery of new therapeutic targets in cancer treatment. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2015838
- PAR ID:
- 10379370
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Scientific Reports
- Volume:
- 12
- Issue:
- 1
- ISSN:
- 2045-2322
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Transcription factor (TF) target search on genome is highly essential for gene expression and regulation. High-resolution determination of TF diffusion along DNA remains technically challenging. Here, we constructed a TF model system using the plant WRKY domain protein in complex with DNA from crystallography and demonstrated microsecond diffusion dynamics of WRKY on DNA by employing all-atom molecular-dynamics (MD) simulations. Notably, we found that WRKY preferentially binds to one strand of DNA with significant energetic bias compared with the other, or nonpreferred strand. The preferential DNA-strand binding becomes most prominent in the static process, from nonspecific to specific DNA binding, but less distinct during diffusive movements of the domain protein on the DNA. Remarkably, without employing acceleration forces or bias, we captured a complete one-base-pair stepping cycle of the protein tracking along major groove of DNA with a homogeneous poly-adenosine sequence, as individual hydrogen bonds break and reform at the protein–DNA binding interface. Further DNA-groove tracking motions of the protein forward or backward, with occasional sliding as well as strand crossing to minor groove of DNA, were also captured. The processive diffusion of WRKY along DNA has been further sampled via coarse-grained MD simulations. The study thus provides structural dynamics details on diffusion of a small TF domain protein, suggests how the protein approaches a specific recognition site on DNA, and supports further high-precision experimental detection. The stochastic movements revealed in the TF diffusion also provide general clues about how other protein walkers step and slide along DNA.more » « less
- 
            Breast cancer treatment can be improved with biomarkers for early detection and individualized therapy. A set of 86 microRNAs (miRNAs) were identified to separate breast cancer tumors from normal breast tissues (n = 52) with an overall accuracy of 90.4%. Six miRNAs had concordant expression in both tumors and breast cancer patient blood samples compared with the normal control samples. Twelve miRNAs showed concordant expression in tumors vs. normal breast tissues and patient survival (n = 1093), with seven as potential tumor suppressors and five as potential oncomiRs. From experimentally validated target genes of these 86 miRNAs, pan-sensitive and pan-resistant genes with concordant mRNA and protein expression associated with in-vitro drug response to 19 NCCN-recommended breast cancer drugs were selected. Combined with in-vitro proliferation assays using CRISPR-Cas9/RNAi and patient survival analysis, MEK inhibitors PD19830 and BRD-K12244279, pilocarpine, and tremorine were discovered as potential new drug options for treating breast cancer. Multi-omics biomarkers of response to the discovered drugs were identified using human breast cancer cell lines. This study presented an artificial intelligence pipeline of miRNA-based discovery of biomarkers, therapeutic targets, and repositioning drugs that can be applied to many cancer types.more » « less
- 
            Abstract The discovery of cancer driver mutations is a fundamental goal in cancer research. While many cancer driver mutations have been discovered in the protein-coding genome, research into potential cancer drivers in the non-coding regions showed limited success so far. Here, we present a novel comprehensive framework Dr.Nod for detection of non-coding cis-regulatory candidate driver mutations that are associated with dysregulated gene expression using tissue-matched enhancer-gene annotations. Applying the framework to data from over 1500 tumours across eight tissues revealed a 4.4-fold enrichment of candidate driver mutations in regulatory regions of known cancer driver genes. An overarching conclusion that emerges is that the non-coding driver mutations contribute to cancer by significantly altering transcription factor binding sites, leading to upregulation of tissue-matched oncogenes and down-regulation of tumour-suppressor genes. Interestingly, more than half of the detected cancer-promoting non-coding regulatory driver mutations are over 20 kb distant from the cancer-associated genes they regulate. Our results show the importance of tissue-matched enhancer-gene maps, functional impact of mutations, and complex background mutagenesis model for the prediction of non-coding regulatory drivers. In conclusion, our study demonstrates that non-coding mutations in enhancers play a previously underappreciated role in cancer and dysregulation of clinically relevant target genes.more » « less
- 
            Abstract Machine learning approaches have been applied to identify transcription factor (TF)–DNA interaction important for gene regulation and expression. However, due to the enormous search space of the genome, it is challenging to build models capable of surveying entire reference genomes, especially in species where models were not trained. In this study, we surveyed a variety of methods for classification of epigenomics data in an attempt to improve the detection for 12 members of the auxin response factor (ARF)-binding DNAs from maize and soybean as assessed by DNA Affinity Purification and sequencing (DAP-seq). We used the classification for prediction by minimizing the genome search space by only surveying unmethylated regions (UMRs). For identification of DAP-seq-binding events within the UMRs, we achieved 78.72 % accuracy rate across 12 members of ARFs of maize on average by encoding DNA with count vectorization for k-mer with a logistic regression classifier with up-sampling and feature selection. Importantly, feature selection helps to uncover known and potentially novel ARF-binding motifs. This demonstrates an independent method for identification of TF-binding sites. Finally, we tested the model built with maize DAP-seq data and applied it directly to the soybean genome and found high false-negative rates, which accounted for more than 40 % across the ARF TFs tested. The findings in this study suggest the potential use of various methods to predict TF–DNA interactions within and between species with varying degrees of success.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
