skip to main content


Title: An updated catalogue of diverse type II polyketide synthase biosynthetic gene clusters captured from large-scale nucleotide databases
Nature serves as a rich source of molecules with immense chemical diversity. Aptly named, these ‘natural products’ boast a wide variety of environmental, medicinal and industrial applications. Type II polyketides, in particular, confer substantial medicinal benefits, including antibacterial, antifungal, anticancer and anti-inflammatory properties. These molecules are produced by enzyme assemblies known as type II polyketide synthases (PKSs), which use domains such as the ketosynthase chain-length factor and acyl carrier protein to produce polyketides with varying lengths, cyclization patterns and oxidation states. In this work, we use a novel bioinformatic workflow to identify biosynthetic gene clusters (BGCs) that code for the core type II PKS enzymes. This method does not rely on annotation and thus was able to unearth previously ‘hidden’ type II PKS BGCs. This work led us to identify over 6000 putative type II PKS BGCs spanning a diverse set of microbial phyla, nearly double those found in most recent studies. Notably, many of these newly identified BGCs were found in non-actinobacteria, which are relatively underexplored as sources of type II polyketides. Results from this work lay an important foundation for future bioprospecting and engineering efforts that will enable sustainable access to diverse and structurally complex molecules with medicinally relevant properties.  more » « less
Award ID(s):
1652424 2201984
NSF-PAR ID:
10405264
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Microbial Genomics
Volume:
9
Issue:
3
ISSN:
2057-5858
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Animal cytoplasmic fatty acid synthase (FAS) represents a unique family of enzymes that are classically thought to be most closely related to fungal polyketide synthase (PKS). Recently, a widespread family of animal lipid metabolic enzymes has been described that bridges the gap between these two ubiquitous and important enzyme classes: the animal FAS–like PKSs (AFPKs). Although very similar in sequence to FAS enzymes that produce saturated lipids widely found in animals, AFPKs instead produce structurally diverse compounds that resemble bioactive polyketides. Little is known about the factors that bridge lipid and polyketide synthesis in the animals. Here, we describe the function of EcPKS2 fromElysia chlorotica, which synthesizes a complex polypropionate natural product found in this mollusc. EcPKS2 starter unit promiscuity potentially explains the high diversity of polyketides found in and among molluscan species. Biochemical comparison of EcPKS2 with the previously described EcPKS1 reveals molecular principles governing substrate selectivity that should apply to related enzymes encoded within the genomes of photosynthetic gastropods. Hybridization experiments combining EcPKS1 and EcPKS2 demonstrate the interactions between the ketoreductase and ketosynthase domains in governing the product outcomes. Overall, these findings enable an understanding of the molecular principles of structural diversity underlying the many molluscan polyketides likely produced by the diverse AFPK enzyme family.

     
    more » « less
  2. Simmons, Lyle A. ; Bush, Karen (Ed.)
    ABSTRACT Unique DNA repair enzymes that provide self-resistance against therapeutically important, genotoxic natural products have been discovered in bacterial biosynthetic gene clusters (BGCs). Among these, the DNA glycosylase AlkZ is essential for azinomycin B production and belongs to the HTH_42 superfamily of uncharacterized proteins. Despite their widespread existence in antibiotic producers and pathogens, the roles of these proteins in production of other natural products are unknown. Here, we determine the evolutionary relationship and genomic distribution of all HTH_42 proteins from Streptomyces and use a resistance-based genome mining approach to identify homologs associated with known and uncharacterized BGCs. We find that AlkZ-like (AZL) proteins constitute one distinct HTH_42 subfamily and are highly enriched in BGCs and variable in sequence, suggesting each has evolved to protect against a specific secondary metabolite. As a validation of the approach, we show that the AZL protein, HedH4, associated with biosynthesis of the alkylating agent hedamycin, excises hedamycin-DNA adducts with exquisite specificity and provides resistance to the natural product in cells. We also identify a second, phylogenetically and functionally distinct subfamily whose proteins are never associated with BGCs, are highly conserved with respect to sequence and genomic neighborhood, and repair DNA lesions not associated with a particular natural product. This work delineates two related families of DNA repair enzymes—one specific for complex alkyl-DNA lesions and involved in self-resistance to antimicrobials and the other likely involved in protection against an array of genotoxins—and provides a framework for targeted discovery of new genotoxic compounds with therapeutic potential. IMPORTANCE Bacteria are rich sources of secondary metabolites that include DNA-damaging genotoxins with antitumor/antibiotic properties. Although Streptomyces produce a diverse number of therapeutic genotoxins, efforts toward targeted discovery of biosynthetic gene clusters (BGCs) producing DNA-damaging agents is lacking. Moreover, work on toxin-resistance genes has lagged behind our understanding of those involved in natural product synthesis. Here, we identified over 70 uncharacterized BGCs producing potentially novel genotoxins through resistance-based genome mining using the azinomycin B-resistance DNA glycosylase AlkZ. We validate our analysis by characterizing the enzymatic activity and cellular resistance of one AlkZ ortholog in the BGC of hedamycin, a potent DNA alkylating agent. Moreover, we uncover a second, phylogenetically distinct family of proteins related to Escherichia coli YcaQ, a DNA glycosylase capable of unhooking interstrand DNA cross-links, which differs from the AlkZ-like family in sequence, genomic location, proximity to BGCs, and substrate specificity. This work defines two families of DNA glycosylase for specialized repair of complex genotoxic natural products and generalized repair of a broad range of alkyl-DNA adducts and provides a framework for targeted discovery of new compounds with therapeutic potential. 
    more » « less
  3. Abstract  

    Secondary metabolites (SMs) are biologically active small molecules, many of which are medically valuable. Fungal genomes contain vast numbers of SM biosynthetic gene clusters (BGCs) with unknown products, suggesting that huge numbers of valuable SMs remain to be discovered. It is challenging, however, to identify SM BGCs, among the millions present in fungi, that produce useful compounds. One solution is resistance gene-guided genome mining, which takes advantage of the fact that some BGCs contain a gene encoding a resistant version of the protein targeted by the compound produced by the BGC. The bioinformatic signature of such BGCs is that they contain an allele of an essential gene with no SM biosynthetic function, and there is a second allele elsewhere in the genome. We have developed a computer-assisted approach to resistance gene-guided genome mining that allows users to query large databases for BGCs that putatively make compounds that have targets of therapeutic interest. Working with the MycoCosm genome database, we have applied this approach to look for SM BGCs that target the proteasome β6 subunit, the target of the proteasome inhibitor fellutamide B, or HMG-CoA reductase, the target of cholesterol reducing therapeutics such as lovastatin. Our approach proved effective, finding known fellutamide and lovastatin BGCs as well as fellutamide- and lovastatin-related BGCs with variations in the SM genes that suggest they may produce structural variants of fellutamides and lovastatin. Gratifyingly, we also found BGCs that are not closely related to lovastatin BGCs but putatively produce novel HMG-CoA reductase inhibitors.

    One-Sentence Summary

    A new computer-assisted approach to resistance gene-directed genome mining is reported along with its use to identify fungal biosynthetic gene clusters that putatively produce proteasome and HMG-CoA reductase inhibitors.

     
    more » « less
  4. Goldman, Gustavo H. (Ed.)
    ABSTRACT Fungal secondary metabolites are widely used as therapeutics and are vital components of drug discovery programs. A major challenge hindering discovery of novel secondary metabolites is that the underlying pathways involved in their biosynthesis are transcriptionally silent under typical laboratory growth conditions, making it difficult to identify the transcriptional networks that they are embedded in. Furthermore, while the genes participating in secondary metabolic pathways are typically found in contiguous clusters on the genome, known as biosynthetic gene clusters (BGCs), this is not always the case, especially for global and pathway-specific regulators of pathways’ activities. To address these challenges, we used 283 genome-wide gene expression data sets of the ascomycete cell factory Aspergillus niger generated during growth under 155 different conditions to construct two gene coexpression networks based on Spearman’s correlation coefficients (SCCs) and on mutual rank-transformed Pearson’s correlation coefficients (MR-PCCs). By mining these networks, we predicted six transcription factors, named MjkA to MjkF, to regulate secondary metabolism in A. niger . Overexpression of each transcription factor using the Tet-On cassette modulated the production of multiple secondary metabolites. We found that the SCC and MR-PCC approaches complemented each other, enabling the delineation of putative global (SCC) and pathway-specific (MR-PCC) transcription factors. These results highlight the potential of coexpression network approaches to identify and activate fungal secondary metabolic pathways and their products. More broadly, we argue that drug discovery programs in fungi should move beyond the BGC paradigm and focus on understanding the global regulatory networks in which secondary metabolic pathways are embedded. IMPORTANCE There is an urgent need for novel bioactive molecules in both agriculture and medicine. The genomes of fungi are thought to contain vast numbers of metabolic pathways involved in the biosynthesis of secondary metabolites with diverse bioactivities. Because these metabolites are biosynthesized only under specific conditions, the vast majority of the fungal pharmacopeia awaits discovery. To discover the genetic networks that regulate the activity of secondary metabolites, we examined the genome-wide profiles of gene activity of the cell factory Aspergillus niger across hundreds of conditions. By constructing global networks that link genes with similar activities across conditions, we identified six putative global and pathway-specific regulators of secondary metabolite biosynthesis. Our study shows that elucidating the behavior of the genetic networks of fungi under diverse conditions harbors enormous promise for understanding fungal secondary metabolism, which ultimately may lead to novel drug candidates. 
    more » « less
  5. Abstract

    Fungal polyketides display remarkable structural diversity and bioactivity, and therefore the biosynthesis and engineering of this large class of molecules is therapeutically significant. Here, we successfully recode, construct and characterize the biosynthetic pathway of bikaverin, a tetracyclic polyketide with antibiotic, antifungal and anticancer properties, inS. cerevisiae. We use a green fluorescent protein (GFP) mapping strategy to identify the low expression of Bik1 (polyketide synthase) as a major bottleneck step in the pathway, and a promoter exchange strategy is used to increase expression of Bik1 and bikaverin titer. Then, we use an enzyme-fusion strategy to directly couple the monooxygenase (Bik2) and methyltransferase (Bik3) to efficiently channel intermediates between modifying enzymes, leading to an improved titer of bikaverin at 202.75 mg/L with flask fermentation (273-fold higher than the initial titer). This study demonstrates that the biosynthesis of complex fungal polyketides can be established and efficiently engineered inS. cerevisiae, highlighting the potential for natural product synthesis and large-scale fermentation in yeast.

     
    more » « less