skip to main content


This content will become publicly available on June 1, 2024

Title: Codon Optimization Improves the Prediction of Xylose Metabolism from Gene Content in Budding Yeasts
Abstract Xylose is the second most abundant monomeric sugar in plant biomass. Consequently, xylose catabolism is an ecologically important trait for saprotrophic organisms, as well as a fundamentally important trait for industries that hope to convert plant mass to renewable fuels and other bioproducts using microbial metabolism. Although common across fungi, xylose catabolism is rare within Saccharomycotina, the subphylum that contains most industrially relevant fermentative yeast species. The genomes of several yeasts unable to consume xylose have been previously reported to contain the full set of genes in the XYL pathway, suggesting the absence of a gene–trait correlation for xylose metabolism. Here, we measured growth on xylose and systematically identified XYL pathway orthologs across the genomes of 332 budding yeast species. Although the XYL pathway coevolved with xylose metabolism, we found that pathway presence only predicted xylose catabolism about half of the time, demonstrating that a complete XYL pathway is necessary, but not sufficient, for xylose catabolism. We also found that XYL1 copy number was positively correlated, after phylogenetic correction, with xylose utilization. We then quantified codon usage bias of XYL genes and found that XYL3 codon optimization was significantly higher, after phylogenetic correction, in species able to consume xylose. Finally, we showed that codon optimization of XYL2 was positively correlated, after phylogenetic correction, with growth rates in xylose medium. We conclude that gene content alone is a weak predictor of xylose metabolism and that using codon optimization enhances the prediction of xylose metabolism from yeast genome sequence data.  more » « less
Award ID(s):
2110403 1442148
NSF-PAR ID:
10426545
Author(s) / Creator(s):
; ; ; ; ; ; ;
Editor(s):
Townsend, Jeffrey
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
40
Issue:
6
ISSN:
0737-4038
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Alexandre, Gladys (Ed.)
    ABSTRACT Xylella fastidiosa infects several economically important crops in the Americas, and it also recently emerged in Europe. Here, using a set of Xylella genomes reflective of the genus-wide diversity, we performed a pan-genome analysis based on both core and accessory genes for two purposes: (i) to test associations between genetic divergence and plant host species and (ii) to identify positively selected genes that are potentially involved in arms-race dynamics. For the former, tests yielded significant evidence for the specialization of X. fastidiosa to plant host species. This observation contributes to a growing literature suggesting that the phylogenetic history of X. fastidiosa lineages affects the host range. For the latter, our analyses uncovered evidence of positive selection across codons for 5.3% (67 of 1,257) of the core genes and 5.4% (201 of 3,691) of the accessory genes. These genes are candidates to encode interacting factors with plant and insect hosts. Most of these genes had unknown functions, but we did identify some tractable candidates, including nagZ_2 , which encodes a beta-glucosidase that is important for Neisseria gonorrhoeae biofilm formation; cya , which modulates gene expression in pathogenic bacteria, and barA , a membrane associated histidine kinase that has roles in cell division, metabolism, and pili formation. IMPORTANCE Xylella fastidiosa causes devasting diseases to several critical crops. Because X. fastidiosa colonizes and infects many plant species, it is important to understand whether the genome of X. fastidiosa has genetic determinants that underlie specialization to specific host plants. We analyzed genome sequences of X. fastidiosa to investigate evolutionary relationships and to test for evidence of positive selection on specific genes. We found a significant signal between genome diversity and host plants, consistent with bacterial specialization to specific plant hosts. By screening for positive selection, we identified both core and accessory genes that may affect pathogenicity, including genes involved in biofilm formation. 
    more » « less
  2. null (Ed.)
    Factor for inversion stimulation is a global regulator that is highly expressed during exponential phase growth and undetectable in stationary phase growth. Quorum sensing (QS) is a global regulatory mechanism that controls gene expression in response to changes in cell density and growth phase. In Vibrio parahaemolyticus, a marine species and a significant human pathogen, the QS regulatory sRNAs, Qrr1 to Qrr5, are expressed during exponential growth and negatively regulate the high cell density QS master regulator OpaR. OpaR is a positive regulator of capsule polysaccharide (CPS) formation, which is required for biofilm formation, and is a repressor of lateral flagella required for swarming motility. In V. parahaemolyticus, we show that Fis is a positive regulator of the qrr sRNAs expression. In an in-frame fis deletion mutant, qrr expression was repressed and opaR expression was induced. The Δfis mutant produced CPS and biofilm, but swarming motility was abolished. Also, the fis deletion mutant was more sensitive to polymyxin B. Swarming motility requires expression of both the surface sensing scrABC operon and lateral flagella laf operon. Our data showed that in the Δfis mutant both laf and scrABC genes were repressed. Fis controlled swarming motility indirectly through the QS pathway and directly through the surface sensing pathway. To determine the effects of Fis on cellular metabolism, we performed in vitro growth competition assays, and found that Δfis was outcompeted by wild type in minimal media supplemented with intestinal mucus as a sole nutrient source. The data showed that Fis positively modulated mucus components L-arabinose, D-gluconate and N-acetyl-D-glucosamine catabolism gene expression. In an in vivo colonization competition assay, Δfis was outcompeted by wild type, indicating Fis is required for fitness. Overall, these data demonstrate a global regulatory role for Fis in V. parahaemolyticus that includes QS, motility, and metabolism. 
    more » « less
  3. Background Cellulolytic, hemicellulolytic, and amylolytic (CHA) enzyme-producing halophiles are understudied. The recently defined taxon Iocasia fonsfrigidae consists of one well-described anaerobic bacterial strain: NS-1 T . Prior to characterization of strain NS-1 T , an isolate designated Halocella sp. SP3-1 was isolated and its genome was published. Based on physiological and genetic comparisons, it was suggested that Halocella sp. SP3-1 may be another isolate of I. fronsfrigidae . Despite being geographic variants of the same species, data indicate that strain SP3-1 exhibits genetic, genomic, and physiological characteristics that distinguish it from strain NS-1 T . In this study, we examine the halophilic and alkaliphilic nature of strain SP3-1 and the genetic substrates underlying phenotypic differences between strains SP3-1 and NS-1 T with focus on sugar metabolism and CHA enzyme expression. Methods Standard methods in anaerobic cell culture were used to grow strains SP3-1 as well as other comparator species. Morphological characterization was done via electron microscopy and Schaeffer-Fulton staining. Data for sequence comparisons ( e.g. , 16S rRNA) were retrieved via BLAST and EzBioCloud. Alignments and phylogenetic trees were generated via CLUTAL_X and neighbor joining functions in MEGA (version 11). Genomes were assembled/annotated via the Prokka annotation pipeline. Clusters of Orthologous Groups (COGs) were defined by eegNOG 4.5. DNA-DNA hybridization calculations were performed by the ANI Calculator web service. Results Cells of strain SP3-1 are rods. SP3-1 cells grow at NaCl concentrations of 5-30% (w/v). Optimal growth occurs at 37 °C, pH 8.0, and 20% NaCl (w/v). Although phylogenetic analysis based on 16S rRNA gene indicates that strain SP3-1 belongs to the genus Iocasia with 99.58% average nucleotide sequence identity to Iocasia fonsfrigida NS-1 T , strain SP3-1 is uniquely an extreme haloalkaliphile. Moreover, strain SP3-1 ferments D-glucose to acetate, butyrate, carbon dioxide, hydrogen, ethanol, and butanol and will grow on L-arabinose, D-fructose, D-galactose, D-glucose, D-mannose, D-raffinose, D-xylose, cellobiose, lactose, maltose, sucrose, starch, xylan and phosphoric acid swollen cellulose (PASC). D-rhamnose, alginate, and lignin do not serve as suitable culture substrates for strain SP3-1. Thus, the carbon utilization profile of strain SP3-1 differs from that of I. fronsfrigidae strain NS-1 T . Differences between these two strains are also noted in their lipid composition. Genomic data reveal key differences between the genetic profiles of strain SP3-1 and NS-1 T that likely account for differences in morphology, sugar metabolism, and CHA-enzyme potential. Important to this study, I. fonsfrigidae SP3-1 produces and extracellularly secretes CHA enzymes at different levels and composition than type strain NS-1 T . The high salt tolerance and pH range of SP3-1 makes it an ideal candidate for salt and pH tolerant enzyme discovery. 
    more » « less
  4. INTRODUCTION During the independent process of cereal evolution, many trait shifts appear to have been under convergent selection to meet the specific needs of humans. Identification of convergently selected genes across cereals could help to clarify the evolution of crop species and to accelerate breeding programs. In the past several decades, researchers have debated whether convergent phenotypic selection in distinct lineages is driven by conserved molecular changes or by diverse molecular pathways. Two of the most economically important crops, maize and rice, display some conserved phenotypic shifts—including loss of seed dispersal, decreased seed dormancy, and increased grain number during evolution—even though they experienced independent selection. Hence, maize and rice can serve as an excellent system for understanding the extent of convergent selection among cereals. RATIONALE Despite the identification of a few convergently selected genes, our understanding of the extent of molecular convergence on a genome-wide scale between maize and rice is very limited. To learn how often selection acts on orthologous genes, we investigated the functions and molecular evolution of the grain yield quantitative trait locus KRN2 in maize and its rice ortholog OsKRN2 . We also identified convergently selected genes on a genome-wide scale in maize and rice, using two large datasets. RESULTS We identified a selected gene, KRN2 ( kernel row number2 ), that differs between domesticated maize and its wild ancestor, teosinte. This gene underlies a major quantitative trait locus for kernel row number in maize. Selection in the noncoding upstream regions resulted in a reduction of KRN2 expression and an increased grain number through an increase in kernel rows. The rice ortholog, OsKRN2 , also underwent selection and negatively regulates grain number via control of secondary panicle branches. These orthologs encode WD40 proteins and function synergistically with a gene of unknown function, DUF1644, which suggests that a conserved protein interaction controls grain number in maize and rice. Field tests show that knockout of KRN2 in maize or OsKRN2 in rice increased grain yield by ~10% and ~8%, respectively, with no apparent trade-off in other agronomic traits. This suggests potential applications of KRN2 and its orthologs for crop improvement. On a genome-wide scale, we identified a set of 490 orthologous genes that underwent convergent selection during maize and rice evolution, including KRN2/OsKRN2 . We found that the convergently selected orthologous genes appear to be significantly enriched in two specific pathways in both maize and rice: starch and sucrose metabolism, and biosynthesis of cofactors. A deep analysis of convergently selected genes in the starch metabolic pathway indicates that the degree of genetic convergence via convergent selection is related to the conservation and complexity of the gene network for a given selection. CONCLUSION Our findings show that common phenotypic shifts during maize and rice evolution acting on conserved genes are driven at least in part by convergent selection, which in maize and rice likely occurred both during and after domestication. We provide evolutionary and functional evidence on the convergent selection of KRN2/OsKRN2 for grain number between maize and rice. We further found that a complete loss-of-function allele of KRN2/OsKRN2 increased grain yield without an apparent negative impact on other agronomic traits. Exploring the role of KRN2/OsKRN2 and other convergently selected genes across the cereals could provide new opportunities to enhance the production of other global crops. Shared selected orthologous genes in maize and rice for convergent phenotypic shifts during domestication and improvement. By comparing 3163 selected genes in maize and 18,755 selected genes in rice, we identified 490 orthologous gene pairs, including KRN2 and its rice ortholog OsKRN2 , as having been convergently selected. Knockout of KRN2 in maize or OsKRN2 in rice increased grain yield by increasing kernel rows and secondary panicle branches, respectively. 
    more » « less
  5. Ruvinsky, Ilya (Ed.)
    Abstract Developmental polyphenism, the ability to switch between phenotypes in response to environmental variation, involves the alternating activation of environmentally sensitive genes. Consequently, to understand how a polyphenic response evolves requires a comparative analysis of the components that make up environmentally sensitive networks. Here, we inferred coexpression networks for a morphological polyphenism, the feeding-structure dimorphism of the nematode Pristionchus pacificus. In this species, individuals produce alternative forms of a novel trait—moveable teeth, which in one morph enable predatory feeding—in response to environmental cues. To identify the origins of polyphenism network components, we independently inferred coexpression modules for more conserved transcriptional responses, including in an ancestrally nonpolyphenic nematode species. Further, through genome-wide analyses of these components across the nematode family (Diplogastridae) in which the polyphenism arose, we reconstructed how network components have changed. To achieve this, we assembled and resolved the phylogenetic context for five genomes of species representing the breadth of Diplogastridae and a hypothesized outgroup. We found that gene networks instructing alternative forms arose from ancestral plastic responses to environment, specifically starvation-induced metabolism and the formation of a conserved diapause (dauer) stage. Moreover, loci from rapidly evolving gene families were integrated into these networks with higher connectivity than throughout the rest of the P. pacificus transcriptome. In summary, we show that the modular regulatory outputs of a polyphenic response evolved through the integration of conserved plastic responses into networks with genes of high evolutionary turnover. 
    more » « less