skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Identifying widespread and recurrent variants of genetic parts to improve annotation of engineered DNA sequences
Engineered plasmids have been workhorses of recombinant DNA technology for nearly half a century. Plasmids are used to clone DNA sequences encoding new genetic parts and to reprogram cells by combining these parts in new ways. Historically, many genetic parts on plasmids were copied and reused without routinely checking their DNA sequences. With the widespread use of high-throughput DNA sequencing technologies, we now know that plasmids often contain variants of common genetic parts that differ slightly from their canonical sequences. Because the exact provenance of a genetic part on a particular plasmid is usually unknown, it is difficult to determine whether these differences arose due to mutations during plasmid construction and propagation or due to intentional editing by researchers. In either case, it is important to understand how the sequence changes alter the properties of the genetic part. We analyzed the sequences of over 50,000 engineered plasmids using depositor metadata and a metric inspired by the natural language processing field. We detected 217 uncatalogued genetic part variants that were especially widespread or were likely the result of convergent evolution or engineering. Several of these uncatalogued variants are known mutants of plasmid origins of replication or antibiotic resistance genes that are missing from current annotation databases. However, most are uncharacterized, and 3/5 of the plasmids we analyzed contained at least one of the uncatalogued variants. Our results include a list of genetic parts to prioritize for refining engineered plasmid annotation pipelines, highlight widespread variants of parts that warrant further investigation to see whether they have altered characteristics, and suggest cases where unintentional evolution of plasmid parts may be affecting the reliability and reproducibility of science.  more » « less
Award ID(s):
2103208 1554179
PAR ID:
10514273
Author(s) / Creator(s):
;
Editor(s):
Mienda, Bashir Sajo
Publisher / Repository:
PLOS
Date Published:
Journal Name:
PLOS ONE
Volume:
19
Issue:
5
ISSN:
1932-6203
Page Range / eLocation ID:
e0304164
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Staphylococci can cause a wide array of infections that can be life threatening. These infections become more deadly when the isolates are antibiotic resistant and thus harder to treat. Many resistance determinants are plasmid-mediated; however, staphylococcal plasmids have not yet been fully characterized. In particular, plasmids and their contributions to antibiotic resistance have not been investigated within the Arab states, where antibiotic use is not universally regulated. Here, we characterized the putative plasmid content among 56 Staphylococcus aureus and 10 Staphylococcus haemolyticus clinical isolates from Alexandria, Egypt. Putative plasmid sequences were detected in over half of our collection. In total, we identified 72 putative plasmid sequences in 27 S. aureus and 1 S. haemolyticus isolates. While these isolates typically carried one or two plasmids, we identified one isolate— S. aureus AA53—with 11 putative plasmids. The plasmid sequences most frequently encoded a Rep_1, RepL, or PriCT_1 type replication protein. As expected, antibiotic resistance genes were widespread among the identified plasmid sequences. Related plasmids were identified amongst our clinical isolates; homologous plasmids present in multiple isolates clustered into 11 groups based upon sequence similarity. Plasmids from the same cluster often shared antibiotic resistance genes, including blaZ , which is associated with β-lactam resistance. Our analyses suggest that plasmids are a key factor in the pathology and epidemiology of S. aureus in Egypt. A better characterization of plasmids and the role they contribute to the success of Staphylococci as pathogens will guide the design of effective control strategies to limit their spread. 
    more » « less
  2. ABSTRACT Horizontal gene transfer is responsible for the exchange of many types of genetic elements, including plasmids. Properties of the exchanged genetic element are known to influence the efficiency of transfer via the mechanisms of conjugation, transduction, and transformation. Recently, an alternative general pathway of horizontal gene transfer has been identified, namely, gene exchange by extracellular vesicles. Although extracellular vesicles have been shown to facilitate the exchange of several types of plasmids, the influence of plasmid characteristics on genetic exchange within vesicles is unclear. Here, a set of different plasmids was constructed to systematically test the impact of plasmid properties, specifically, plasmid copy number, size, and origin of replication, on gene transfer in vesicles. The influence of each property on the production, packaging, and uptake of vesicles containing bacterial plasmids was quantified, revealing how plasmid properties modulate vesicle-mediated horizontal gene transfer. The loading of plasmids into vesicles correlates with the plasmid copy number and is influenced by characteristics that help set the number of plasmids within a cell, including size and origin of replication. Plasmid origin also has a separate impact on both vesicle loading and uptake, demonstrating that the origin of replication is a major determinant of the propensity of specific plasmids to transfer within extracellular vesicles. IMPORTANCE Extracellular vesicle formation and exchange are common within bacterial populations. Vesicles package multiple types of biomolecules, including genetic material. The exchange of extracellular vesicles containing genetic material facilitates interspecies DNA transfer and may be a promiscuous mechanism of horizontal gene transfer. Unlike other mechanisms of horizontal gene transfer, it is unclear whether characteristics of the exchanged DNA impact the likelihood of transfer in vesicles. Here, we systematically examine the influence of plasmid copy number, size, and origin of replication on the loading of DNA into vesicles and the uptake of DNA containing vesicles by recipient cells. These results reveal how each plasmid characteristic impacts gene transfer in vesicles and contribute to a greater understanding of the importance of vesicle-mediated gene exchange in the landscape of horizontal gene transfer. 
    more » « less
  3. McMahon, Katherine (Ed.)
    ABSTRACT Mobile genetic elements (MGEs) drive bacterial evolution, alter gene availability within microbial communities, and facilitate adaptation to ecological niches. In natural systems, bacteria simultaneously possess or encounter multiple MGEs, yet their combined influences on microbial communities are poorly understood. Here, we investigate interactions among MGEs in the marine bacterium Sulfitobacter pontiacus . Two related strains, CB-D and CB-A, each harbor a single prophage. These prophages share high sequence identity with one another and an integration site within the host genome, yet these strains exhibit differences in “spontaneous” prophage induction (SPI) and consequent fitness. To better understand mechanisms underlying variation in SPI between these lysogens, we closed their genomes, which revealed that in addition to harboring different prophage genotypes, CB-A lacks two of the four large, low-copy-number plasmids possessed by CB-D. To assess the relative roles of plasmid content versus prophage genotype on host physiology, a panel of derivative strains varying in MGE content were generated. Characterization of these derivatives revealed a robust link between plasmid content and SPI, regardless of prophage genotype. Strains possessing all four plasmids had undetectable phage in cell-free lysates, while strains lacking either one plasmid (pSpoCB-1) or a combination of two plasmids (pSpoCB-2 and pSpoCB-4) produced high (>10 5 PFU/mL) phage titers. Homologous plasmid sequences were identified in related bacteria, and plasmid and phage genes were found to be widespread in Tara Oceans metagenomic data sets. This suggests that plasmid-dependent stabilization of prophages may be commonplace throughout the oceans. IMPORTANCE The consequences of prophage induction on the physiology of microbial populations are varied and include enhanced biofilm formation, conferral of virulence, and increased opportunity for horizontal gene transfer. These traits lead to competitive advantages for lysogenized bacteria and influence bacterial lifestyles in a variety of niches. However, biological controls of “spontaneous” prophage induction, the initiation of phage replication and phage-mediated cell lysis without an overt stressor, are not well understood. In this study, we observed a novel interaction between plasmids and prophages in the marine bacterium Sulfitobacter pontiacus . We found that loss of one or more distinct plasmids—which we show carry genes ubiquitous in the world’s oceans—resulted in a marked increase in prophage induction within lysogenized strains. These results demonstrate cross talk between different mobile genetic elements and have implications for our understanding of the lysogenic-lytic switches of prophages found not only in marine environments, but throughout all ecosystems. 
    more » « less
  4. Nojiri, Hideaki (Ed.)
    ABSTRACT Bacterial mobile genetic elements (MGEs) encode functional modules that perform both core and accessory functions for the element, the latter of which are often only transiently associated with the element. The presence of these accessory genes, which are often close homologs to primarily immobile genes, incur high rates of false positives and, therefore, limits the usability of these databases for MGE annotation. To overcome this limitation, we analyzed 10,776,849 protein sequences derived from eight MGE databases to compile a comprehensive set of 6,140 manually curated protein families that are linked to the “life cycle” (integration/excision, replication/recombination/repair, transfer, stability/transfer/defense, and phage-specific processes) of plasmids, phages, integrative, transposable, and conjugative elements. We overlay experimental information where available to create a tiered annotation scheme of high-quality annotations and annotations inferred exclusively through bioinformatic evidence. We additionally provide an MGE-class label for each entry (e.g., plasmid or integrative element), and assign to each entry a major and minor category. The resulting database, mobileOG-db (for mobile orthologous groups), comprises over 700,000 deduplicated sequences encompassing five major mobileOG categories and more than 50 minor categories, providing a structured language and interpretable basis for an array of MGE-centered analyses. mobileOG-db can be accessed at mobileogdb.flsi.cloud.vt.edu/, where users can select, refine, and analyze custom subsets of the dynamic mobilome. IMPORTANCE The analysis of bacterial mobile genetic elements (MGEs) in genomic data is a critical step toward profiling the root causes of antibiotic resistance, phenotypic or metabolic diversity, and the evolution of bacterial genera. Existing methods for MGE annotation pose high barriers of biological and computational expertise to properly harness. To bridge this gap, we systematically analyzed 10,776,849 proteins derived from eight databases of MGEs to identify 6,140 MGE protein families that can serve as candidate hallmarks, i.e., proteins that can be used as “signatures” of MGEs to aid annotation. The resulting resource, mobileOG-db, provides a multilevel classification scheme that encompasses plasmid, phage, integrative, and transposable element protein families categorized into five major mobileOG categories and more than 50 minor categories. mobileOG-db thus provides a rich resource for simple and intuitive element annotation that can be integrated seamlessly into existing MGE detection pipelines and colocalization analyses. 
    more » « less
  5. Abstract Engineered DNA will slow the growth of a host cell if it redirects limiting resources or otherwise interferes with homeostasis. Escape mutants that alleviate this burden can rapidly evolve and take over cell populations, making genetic engineering less reliable and predictable. Synthetic biologists often use genetic parts encoded on plasmids, but their burden is rarely characterized. We measured how 301 BioBrick plasmids affectedEscherichia coligrowth and found that 59 (19.6%) were burdensome, primarily because they depleted the limited gene expression resources of host cells. Overall, no BioBricks reduced the growth rate ofE. coliby >45%, which agreed with a population genetic model that predicts such plasmids should be unclonable. We made this model available online for education (https://barricklab.org/burden-model) and added our burden measurements to the iGEM Registry. Our results establish a fundamental limit on what DNA constructs and genetic modifications can be successfully engineered into cells. 
    more » « less