skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: An updated catalogue of diverse type II polyketide synthase biosynthetic gene clusters captured from large-scale nucleotide databases
Nature serves as a rich source of molecules with immense chemical diversity. Aptly named, these ‘natural products’ boast a wide variety of environmental, medicinal and industrial applications. Type II polyketides, in particular, confer substantial medicinal benefits, including antibacterial, antifungal, anticancer and anti-inflammatory properties. These molecules are produced by enzyme assemblies known as type II polyketide synthases (PKSs), which use domains such as the ketosynthase chain-length factor and acyl carrier protein to produce polyketides with varying lengths, cyclization patterns and oxidation states. In this work, we use a novel bioinformatic workflow to identify biosynthetic gene clusters (BGCs) that code for the core type II PKS enzymes. This method does not rely on annotation and thus was able to unearth previously ‘hidden’ type II PKS BGCs. This work led us to identify over 6000 putative type II PKS BGCs spanning a diverse set of microbial phyla, nearly double those found in most recent studies. Notably, many of these newly identified BGCs were found in non-actinobacteria, which are relatively underexplored as sources of type II polyketides. Results from this work lay an important foundation for future bioprospecting and engineering efforts that will enable sustainable access to diverse and structurally complex molecules with medicinally relevant properties.  more » « less
Award ID(s):
1652424 2201984
PAR ID:
10405264
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Microbial Genomics
Volume:
9
Issue:
3
ISSN:
2057-5858
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Animal cytoplasmic fatty acid synthase (FAS) represents a unique family of enzymes that are classically thought to be most closely related to fungal polyketide synthase (PKS). Recently, a widespread family of animal lipid metabolic enzymes has been described that bridges the gap between these two ubiquitous and important enzyme classes: the animal FAS–like PKSs (AFPKs). Although very similar in sequence to FAS enzymes that produce saturated lipids widely found in animals, AFPKs instead produce structurally diverse compounds that resemble bioactive polyketides. Little is known about the factors that bridge lipid and polyketide synthesis in the animals. Here, we describe the function of EcPKS2 fromElysia chlorotica, which synthesizes a complex polypropionate natural product found in this mollusc. EcPKS2 starter unit promiscuity potentially explains the high diversity of polyketides found in and among molluscan species. Biochemical comparison of EcPKS2 with the previously described EcPKS1 reveals molecular principles governing substrate selectivity that should apply to related enzymes encoded within the genomes of photosynthetic gastropods. Hybridization experiments combining EcPKS1 and EcPKS2 demonstrate the interactions between the ketoreductase and ketosynthase domains in governing the product outcomes. Overall, these findings enable an understanding of the molecular principles of structural diversity underlying the many molluscan polyketides likely produced by the diverse AFPK enzyme family. 
    more » « less
  2. Abstract  Secondary metabolites (SMs) are biologically active small molecules, many of which are medically valuable. Fungal genomes contain vast numbers of SM biosynthetic gene clusters (BGCs) with unknown products, suggesting that huge numbers of valuable SMs remain to be discovered. It is challenging, however, to identify SM BGCs, among the millions present in fungi, that produce useful compounds. One solution is resistance gene-guided genome mining, which takes advantage of the fact that some BGCs contain a gene encoding a resistant version of the protein targeted by the compound produced by the BGC. The bioinformatic signature of such BGCs is that they contain an allele of an essential gene with no SM biosynthetic function, and there is a second allele elsewhere in the genome. We have developed a computer-assisted approach to resistance gene-guided genome mining that allows users to query large databases for BGCs that putatively make compounds that have targets of therapeutic interest. Working with the MycoCosm genome database, we have applied this approach to look for SM BGCs that target the proteasome β6 subunit, the target of the proteasome inhibitor fellutamide B, or HMG-CoA reductase, the target of cholesterol reducing therapeutics such as lovastatin. Our approach proved effective, finding known fellutamide and lovastatin BGCs as well as fellutamide- and lovastatin-related BGCs with variations in the SM genes that suggest they may produce structural variants of fellutamides and lovastatin. Gratifyingly, we also found BGCs that are not closely related to lovastatin BGCs but putatively produce novel HMG-CoA reductase inhibitors. One-Sentence SummaryA new computer-assisted approach to resistance gene-directed genome mining is reported along with its use to identify fungal biosynthetic gene clusters that putatively produce proteasome and HMG-CoA reductase inhibitors. 
    more » « less
  3. Simmons, Lyle A.; Bush, Karen (Ed.)
    ABSTRACT Unique DNA repair enzymes that provide self-resistance against therapeutically important, genotoxic natural products have been discovered in bacterial biosynthetic gene clusters (BGCs). Among these, the DNA glycosylase AlkZ is essential for azinomycin B production and belongs to the HTH_42 superfamily of uncharacterized proteins. Despite their widespread existence in antibiotic producers and pathogens, the roles of these proteins in production of other natural products are unknown. Here, we determine the evolutionary relationship and genomic distribution of all HTH_42 proteins from Streptomyces and use a resistance-based genome mining approach to identify homologs associated with known and uncharacterized BGCs. We find that AlkZ-like (AZL) proteins constitute one distinct HTH_42 subfamily and are highly enriched in BGCs and variable in sequence, suggesting each has evolved to protect against a specific secondary metabolite. As a validation of the approach, we show that the AZL protein, HedH4, associated with biosynthesis of the alkylating agent hedamycin, excises hedamycin-DNA adducts with exquisite specificity and provides resistance to the natural product in cells. We also identify a second, phylogenetically and functionally distinct subfamily whose proteins are never associated with BGCs, are highly conserved with respect to sequence and genomic neighborhood, and repair DNA lesions not associated with a particular natural product. This work delineates two related families of DNA repair enzymes—one specific for complex alkyl-DNA lesions and involved in self-resistance to antimicrobials and the other likely involved in protection against an array of genotoxins—and provides a framework for targeted discovery of new genotoxic compounds with therapeutic potential. IMPORTANCE Bacteria are rich sources of secondary metabolites that include DNA-damaging genotoxins with antitumor/antibiotic properties. Although Streptomyces produce a diverse number of therapeutic genotoxins, efforts toward targeted discovery of biosynthetic gene clusters (BGCs) producing DNA-damaging agents is lacking. Moreover, work on toxin-resistance genes has lagged behind our understanding of those involved in natural product synthesis. Here, we identified over 70 uncharacterized BGCs producing potentially novel genotoxins through resistance-based genome mining using the azinomycin B-resistance DNA glycosylase AlkZ. We validate our analysis by characterizing the enzymatic activity and cellular resistance of one AlkZ ortholog in the BGC of hedamycin, a potent DNA alkylating agent. Moreover, we uncover a second, phylogenetically distinct family of proteins related to Escherichia coli YcaQ, a DNA glycosylase capable of unhooking interstrand DNA cross-links, which differs from the AlkZ-like family in sequence, genomic location, proximity to BGCs, and substrate specificity. This work defines two families of DNA glycosylase for specialized repair of complex genotoxic natural products and generalized repair of a broad range of alkyl-DNA adducts and provides a framework for targeted discovery of new compounds with therapeutic potential. 
    more » « less
  4. Abstract Streptomycesbacteria are known for their prolific production of secondary metabolites, many of which have been widely used in human medicine, agriculture and animal health. To guide the effective prioritization of specific biosynthetic gene clusters (BGCs) for drug development and targeting the most prolific producer strains, knowledge about phylogenetic relationships ofStreptomycesspecies, genome-wide diversity and distribution patterns of BGCs is critical. We used genomic and phylogenetic methods to elucidate the diversity of major classes of BGCs in 1,110 publicly availableStreptomycesgenomes. Genome mining ofStreptomycesreveals high diversity of BGCs and variable distribution patterns in theStreptomycesphylogeny, even among very closely related strains. The most common BGCs are non-ribosomal peptide synthetases, type 1 polyketide synthases, terpenes, and lantipeptides. We also found that numerousStreptomycesspecies harbor BGCs known to encode antitumor compounds. We observed that strains that are considered the same species can vary tremendously in the BGCs they carry, suggesting that strain-level genome sequencing can uncover high levels of BGC diversity and potentially useful derivatives of any one compound. These findings suggest that a strain-level strategy for exploring secondary metabolites for clinical use provides an alternative or complementary approach to discovering novel pharmaceutical compounds from microbes. 
    more » « less
  5. The successful engineering of biosynthetic pathways hinges on understanding the factors that influence acyl carrier protein (ACP) stability and function. The stability and structure of ACPs can be influenced by the presence of divalent cations, but how this relates to primary sequence remains poorly understood. As part of a course‐based undergraduate research experience, we investigated the thermostability of type II polyketide synthase (PKS) ACPs. We observed an approximate 40 °C range in the thermostability among the 14 ACPs studied, as well as an increase in stability (5–26 °C) of the ACPs in the presence of divalent cations. Distribution of charges in the helix II‐loop–helix III region was found to impact the enthalpy of denaturation. Taken together, our results reveal clues as to how the sequence of type II PKS ACPs relates to their structural stability, information that can be used to study how ACP sequence relates to function. © 2018 American Institute of Chemical EngineersAIChE J, 64: 4308–4318, 2018 
    more » « less