skip to main content


Search for: All records

Award ID contains: 1741090

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    We employed several algorithms with high efficacy to analyze the public transcriptomic data, aiming to identify key transcription factors (TFs) that regulate regeneration inArabidopsis thaliana. Initially, we utilized CollaborativeNet, also known as TF-Cluster, to construct a collaborative network of all TFs, which was subsequently decomposed into many subnetworks using the Triple-Link and Compound Spring Embedder (CoSE) algorithms. Functional analysis of these subnetworks led to the identification of nine subnetworks closely associated with regeneration. We further applied principal component analysis and gene ontology (GO) enrichment analysis to reduce the subnetworks from nine to three, namely subnetworks 1, 12, and 17. Searching for TF-binding sites in the promoters of the co-expressed and co-regulated (CCGs) genes of all TFs in these three subnetworks and Triple-Gene Mutual Interaction analysis of TFs in these three subnetworks with the CCGs involved in regeneration enabled us to rank the TFs in each subnetwork. Finally, six potential candidate TFs—WOX9A, LEC2, PGA37, WIP5, PEI1, and AIL1 from subnetwork 1—were identified, and their roles in somatic embryogenesis (GO:0010262) and regeneration (GO:0031099) were discussed, so were the TFs in Subnetwork 12 and 17 associated with regeneration. The TFs identified were also assessed using the CIS-BP database and Expression Atlas. Our analyses suggest some novel TFs that may have regulatory roles in regeneration and embryogenesis and provide valuable data and insights into the regulatory mechanisms related to regeneration. The tools and the procedures used here are instrumental for analyzing high-throughput transcriptomic data and advancing our understanding of the regulation of various biological processes of interest.

     
    more » « less
    Free, publicly-accessible full text available November 22, 2024
  2. Abstract

    Four statistical selection methods for inferring transcription factor (TF)–target gene (TG) pairs were developed by coupling mean squared error (MSE) or Huber loss function, with elastic net (ENET) or least absolute shrinkage and selection operator (Lasso) penalty. Two methods were also developed for inferring pathway gene regulatory networks (GRNs) by combining Huber or MSE loss function with a network (Net)-based penalty. To solve these regressions, we ameliorated an accelerated proximal gradient descent (APGD) algorithm to optimize parameter selection processes, resulting in an equally effective but much faster algorithm than the commonly used convex optimization solver. The synthetic data generated in a general setting was used to test four TF–TG identification methods, ENET-based methods performed better than Lasso-based methods. Synthetic data generated from two network settings was used to test Huber-Net and MSE-Net, which outperformed all other methods. The TF–TG identification methods were also tested with SND1 and gl3 overexpression transcriptomic data, Huber-ENET and MSE-ENET outperformed all other methods when genome-wide predictions were performed. The TF–TG identification methods fill the gap of lacking a method for genome-wide TG prediction of a TF, and potential for validating ChIP/DAP-seq results, while the two Net-based methods are instrumental for predicting pathway GRNs.

     
    more » « less
  3. Abstract

    Goss's wilt, caused by the Gram-positive actinobacterium Clavibacter nebraskensis, is an important bacterial disease of maize. The molecular and genetic mechanisms of resistance to the bacterium, or, in general, Gram-positive bacteria causing plant diseases, remain poorly understood. Here, we examined the genetic basis of Goss's wilt through differential gene expression, standard genome-wide association mapping (GWAS), extreme phenotype (XP) GWAS using highly resistant (R) and highly susceptible (S) lines, and quantitative trait locus (QTL) mapping using 3 bi-parental populations, identifying 11 disease association loci. Three loci were validated using near-isogenic lines or recombinant inbred lines. Our analysis indicates that Goss's wilt resistance is highly complex and major resistance genes are not commonly present. RNA sequencing of samples separately pooled from R and S lines with or without bacterial inoculation was performed, enabling identification of common and differential gene responses in R and S lines. Based on expression, in both R and S lines, the photosynthesis pathway was silenced upon infection, while stress-responsive pathways and phytohormone pathways, namely, abscisic acid, auxin, ethylene, jasmonate, and gibberellin, were markedly activated. In addition, 65 genes showed differential responses (up- or down-regulated) to infection in R and S lines. Combining genetic mapping and transcriptional data, individual candidate genes conferring Goss's wilt resistance were identified. Collectively, aspects of the genetic architecture of Goss's wilt resistance were revealed, providing foundational data for mechanistic studies.

     
    more » « less
  4. Abstract

    Understanding gene regulatory networks is essential to elucidate developmental processes and environmental responses. Here, we studied regulation of a maize (Zea mays) transcription factor gene using designer transcription activator-like effectors (dTALes), which are synthetic Type III TALes of the bacterial genus Xanthomonas and serve as inducers of disease susceptibility gene transcription in host cells. The maize pathogen Xanthomonas vasicola pv. vasculorum was used to introduce 2 independent dTALes into maize cells to induced expression of the gene glossy3 (gl3), which encodes a MYB transcription factor involved in biosynthesis of cuticular wax. RNA-seq analysis of leaf samples identified, in addition to gl3, 146 genes altered in expression by the 2 dTALes. Nine of the 10 genes known to be involved in cuticular wax biosynthesis were upregulated by at least 1 of the 2 dTALes. A gene previously unknown to be associated with gl3, Zm00001d017418, which encodes aldehyde dehydrogenase, was also expressed in a dTALe-dependent manner. A chemically induced mutant and a CRISPR-Cas9 mutant of Zm00001d017418 both exhibited glossy leaf phenotypes, indicating that Zm00001d017418 is involved in biosynthesis of cuticular waxes. Bacterial protein delivery of dTALes proved to be a straightforward and practical approach for the analysis and discovery of pathway-specific genes in maize.

     
    more » « less
  5. Abstract Background

    The maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies.

    Results

    Here, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including a 1.8-Mb duplication containing the Gametophyte factor1 locus for unilateral cross-incompatibility, and six inversions of 0.7 Mb or greater. Increased copy number of carotenoid cleavage dioxygenase 1 (ccd1) in A188 is associated with elevated expression during seed development. Highccd1expression in seeds together with low expression of yellow endosperm 1 (y1) reduces carotenoid accumulation, accounting for the white seed phenotype of A188. Furthermore, transcriptome and epigenome analyses reveal enhanced expression of defense pathways and altered DNA methylation patterns of the embryonic callus.

    Conclusions

    The A188 genome assembly provides a high-resolution sequence for a complex genome species and a foundational resource for analyses of genome variation and gene function in maize. The genome, in comparison to B73, contains extensive intra-species structural variations and other genetic differences. Expression and network analyses identify discrete profiles for embryonic callus and other tissues.

     
    more » « less
  6. Abstract

    The wheat wild relativeAegilops tauschiiwas previously used to transfer theLr42leaf rust resistance gene into bread wheat.Lr42confers resistance at both seedling and adult stages, and it is broadly effective against all leaf rust races tested to date.Lr42has been used extensively in the CIMMYT international wheat breeding program with resulting cultivars deployed in several countries. Here, using a bulked segregant RNA-Seq (BSR-Seq) mapping strategy, we identify three candidate genes forLr42. Overexpression of a nucleotide-binding site leucine-rich repeat (NLR) gene AET1Gv20040300 induces strong resistance to leaf rust in wheat and a mutation of the gene disrupted the resistance. TheLr42resistance allele is rare inAe. tauschiiand likely arose from ectopic recombination. Cloning ofLr42provides diagnostic markers and over 1000 CIMMYT wheat lines carryingLr42have been developed documenting its widespread use and impact in crop improvement.

     
    more » « less
  7. Summary

    Increasing populations and temperatures are expected to escalate food demands beyond production capacities, and the development of maize lines with better performance under heat stress is desirable. Here, we report that constitutive ectopic expression of a heterologous glutaredoxin S17 fromArabidopsis thaliana(AtGRXS17) can provide thermotolerance in maize through enhanced chaperone activity and modulation of heat stress‐associated gene expression. The thermotolerant maize lines had increased protection against protein damage and yielded a sixfold increase in grain production in comparison to the non‐transgenic counterparts under heat stress field conditions. The maize lines also displayed thermotolerance in the reproductive stages, resulting in improved pollen germination and the higher fidelity of fertilized ovules under heat stress conditions. Our results present a robust and simple strategy for meeting rising yield demands in maize and, possibly, other crop species in a warming global environment.

     
    more » « less
  8. null (Ed.)
    Drought stress is a major constraint in global maize production, causing almost 30–90% of the yield loss depending upon growth stage and the degree and duration of the stress. Here, we report that ectopic expression of Arabidopsis glutaredoxin S17 (AtGRXS17) in field grown maize conferred tolerance to drought stress during the reproductive stage, which is the most drought sensitive stage for seed set and, consequently, grain yield. AtGRXS17-expressing maize lines displayed higher seed set in the field, resulting in 2-fold and 1.5-fold increase in yield in comparison to the non-transgenic plants when challenged with drought stress at the tasseling and silking/pollination stages, respectively. AtGRXS17-expressing lines showed higher relative water content, higher chlorophyll content, and less hydrogen peroxide accumulation than wild-type (WT) control plants under drought conditions. AtGRXS17-expressing lines also exhibited at least 2-fold more pollen germination than WT plants under drought stress. Compared to the transgenic maize, WT controls accumulated higher amount of proline, indicating that WT plants were more stressed over the same period. The results present a robust and simple strategy for meeting rising yield demands in maize under water limiting conditions. 
    more » « less
  9. null (Ed.)
    Abstract Genome sequences provide genomic maps with a single-base resolution for exploring genetic contents. Sequencing technologies, particularly long reads, have revolutionized genome assemblies for producing highly continuous genome sequences. However, current long-read sequencing technologies generate inaccurate reads that contain many errors. Some errors are retained in assembled sequences, which are typically not completely corrected by using either long reads or more accurate short reads. The issue commonly exists, but few tools are dedicated for computing error rates or determining error locations. In this study, we developed a novel approach, referred to as k-mer abundance difference (KAD), to compare the inferred copy number of each k-mer indicated by short reads and the observed copy number in the assembly. Simple KAD metrics enable to classify k-mers into categories that reflect the quality of the assembly. Specifically, the KAD method can be used to identify base errors and estimate the overall error rate. In addition, sequence insertion and deletion as well as sequence redundancy can also be detected. Collectively, KAD is valuable for quality evaluation of genome assemblies and, potentially, provides a diagnostic tool to aid in precise error correction. KAD software has been developed to facilitate public uses. 
    more » « less