skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Challenges and opportunities for strain verification by whole-genome sequencing
Abstract Laboratory strains, cell lines, and other genetic materials change hands frequently in the life sciences. Despite evidence that such materials are subject to mix-ups, contamination, and accumulation of secondary mutations, verification of strains and samples is not an established part of many experimental workflows. With the plummeting cost of next generation technologies, it is conceivable that whole genome sequencing (WGS) could be applied to routine strain and sample verification in the future. To demonstrate the need for strain validation by WGS, we sequenced haploid yeast segregants derived from a popular commercial mutant collection and identified several unexpected mutations. We determined that available bioinformatics tools may be ill-suited for verification and highlight the importance of finishing reference genomes for commonly used laboratory strains.  more » « less
Award ID(s):
1832320 1759900
PAR ID:
10154436
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
10
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Nikel, Pablo Ivan (Ed.)
    ABSTRACT Archaeal molecular biology has been a topic of intense research in recent decades as their role in global ecosystems, nutrient cycles, and eukaryotic evolution comes to light. The hypersaline-adapted archaeal speciesHalobacterium salinarumandHaloferax volcaniiserve as important model organisms for understanding archaeal genomics, genetics, and biochemistry, in part because efficient tools enable genetic manipulation. As a result, the number of strains in circulation among the haloarchaeal research community has increased in recent decades. However, the degree of genetic divergence and effects on genetic integrity resulting from the creation and inter-lab transfer of novel lab stock strains remain unclear. To address this, we performed whole-genome re-sequencing on a cross-section of wild-type, parental, and knockout strains in both model species. Integrating these data with existing repositories of re-sequencing data, we identify mutations that have arisen in a collection of 60 strains, sampled from two species across eight different labs. Independent of sequencing, we construct strain lineages, identifying branch points and significant genetic events in strain history. Combining this with our sequencing data, we identify small clusters of mutations that definitively separate lab strains. Additionally, an analysis of gene knockout strains suggests that roughly one in three strains currently in use harbors second-site mutations of potential phenotypic impact. Overall, we find that divergence among lab strains is thus far minimal, though as the archaeal research community continues to grow, careful strain provenance and genomic re-sequencing are required to keep inter-lab divergence to a minimum, prevent the compounding of mutations into fully independent lineages, and maintain the current high degree of reproducible research between lab groups. IMPORTANCEArchaea are a domain of microbial life whose member species play a critical role in the global carbon cycle, climate regulation, the human microbiome, and persistence in extreme habitats. In particular, hypersaline-adapted archaea are important, genetically tractable model organisms for studying archaeal genetics, genomics, and biochemistry. As the archaeal research community grows, keeping track of the genetic integrity of strains of interest is necessary. In particular, routine genetic manipulations and the common practice of sharing strains between labs allow mutations to arise in lab stocks. If these mutations affect cellular processes, they may jeopardize the reproducibility of work between research groups and confound the results of future studies. In this work, we examine DNA sequences from 60 strains across two species of archaea. We identify shared and unique mutations occurring between and within strains. Independently, we trace the lineage of each strain, identifying which genetic manipulations lead to observed off-target mutations. While overall divergence across labs is minimal so far, our work highlights the need for labs to continue proper strain husbandry. 
    more » « less
  2. Cao, Yi (Ed.)
    Raphidocelis subcapitata is one of the most frequently used species for algal growth inhibition tests. Accordingly, many microalgal culture collections worldwide maintain R . subcapitata for distribution to users. All R . subcapitata strains maintained in these collections are derived from the same cultured strain, NIVA-CHL1. However, considering that 61 years have passed since this strain was isolated, we suspected that NIVA-CHL1 in culture collections might have acquired various mutations. In this study, we compared the genome sequences among NIVA-CHL1 from 8 microalgal culture collections and one laboratory in Japan to evaluate the presence of mutations. We found single-nucleotide polymorphisms or indels at 19,576 to 28,212 sites per strain in comparison with the genome sequence of R . subcapitata NIES-35, maintained at the National Institute for Environmental Studies, Tsukuba, Japan. These mutations were detected not only in non-coding but also in coding regions; some of the latter mutations may affect protein function. In growth inhibition test with 3,5-dichlorophenol, EC50 values varied 2.6-fold among the 9 strains. In the ATCC 22662–2 and CCAP 278/4 strains, we also detected a mutation in the gene encoding small-conductance mechanosensitive ion channel, which may lead to protein truncation and loss of function. Growth inhibition test with sodium chloride suggested that osmotic regulation has changed in ATCC 22662–2 and CCAP 278/4 in comparison with NIES-35. 
    more » « less
  3. Canine distemper virus (CDV) is a multi-host pathogen with variable clinical outcomes of infection across and within species. We used whole-genome sequencing (WGS) to search for viral markers correlated with clinical distemper in African lions. To identify candidate markers, we first documented single-nucleotide polymorphisms (SNPs) differentiating CDV strains associated with different clinical outcomes in lions in East Africa. We then conducted evolutionary analyses on WGS from all global CDV lineages to identify loci subject to selection. SNPs that both differentiated East African strains and were under selection were mapped to a phylogenetic tree representing global CDV diversity to assess if candidate markers correlated with documented outbreaks of clinical distemper in lions (n = 3). Of 54 SNPs differentiating East African strains, ten were under positive or episodic diversifying selection and 20 occurred in the clinical strain despite strong purifying selection at those loci. Candidate markers were in functional domains of the RNP complex (n = 19), the matrix protein (n = 4), on CDV glycoproteins (n = 5), and on the V protein (n = 1). We found mutations at two loci in common between sequences from three CDV outbreaks of clinical distemper in African lions; one in the signaling lymphocytic activation molecule receptor (SLAM)-binding region of the hemagglutinin protein and another in the catalytic center of phosphodiester bond formation on the large polymerase protein. These results suggest convergent evolution at these sites may have a functional role in clinical distemper outbreaks in African lions and uncover potential novel barriers to pathogenicity in this species. 
    more » « less
  4. Abstract Elucidating how individual mutations affect the protein energy landscape is crucial for understanding how proteins evolve. However, predicting mutational effects remains challenging because of epistasis—the nonadditive interactions between mutations. Here, we investigate the biophysical mechanism of strain-specific epistasis in the nonstructural protein 1 (NS1) of influenza A viruses (IAVs). We integrate structural, kinetic, thermodynamic, and conformational dynamics analyses of four NS1s of influenza strains that emerged between 1918 and 2004. Although functionally near-neutral, strain-specific NS1 mutations exhibit long-range epistatic interactions with residues at the p85β-binding interface. We reveal that strain-specific mutations reshaped the NS1 energy landscape during evolution. Using NMR spin dynamics, we find that the strain-specific mutations altered the conformational dynamics of the hidden network of tightly packed residues, underlying the evolution of long-range epistasis. This work shows how near-neutral mutations silently alter the biophysical energy landscapes, resulting in diverse background effects during molecular evolution. 
    more » « less
  5. Didelot, Xavier (Ed.)
    Organelles and endosymbionts have naturally evolved dramatically reduced genome sizes compared to their free-living ancestors. Synthetic biologists have purposefully engineered streamlined microbial genomes to create more efficient cellular chassis and define the minimal components of cellular life. During natural or engineered genome streamlining, deletion of many non-essential genes in combination often reduces bacterial fitness for idiosyncratic or unknown reasons. We investigated how and to what extent laboratory evolution could overcome these defects in six variants of the transposon-freeAcinetobacter baylyistrain ADP1-ISx that each had a deletion of a different 22- to 42-kilobase region and two strains with larger deletions of 70 and 293 kilobases. We evolved replicate populations of ADP1-ISx and each deletion strain for ~300 generations in a chemically defined minimal medium or a complex medium and sequenced the genomes of endpoint clonal isolates. Fitness increased in all cases that were examined except for two ancestors that each failed to improve in one of the two environments. Mutations affecting nine protein-coding genes and two small RNAs were significantly associated with one of the two environments or with certain deletion ancestors. The global post-transcriptional regulatorsrnd(ribonuclease D),csrA(RNA-binding carbon storage regulator), andhfq(RNA-binding protein and chaperone) were frequently mutated across all strains, though the incidence and effects of these mutations on gene function and bacterial fitness varied with the ancestral deletion and evolution environment. Mutations in this regulatory network likely compensate for how an earlier deletion of a transposon in the ADP1-ISx ancestor of all the deletion strains restoredcsrAfunction. More generally, our results demonstrate that fitness lost during genome streamlining can usually be regained rapidly through laboratory evolution and that recovery tends to occur through a combination of deletion-specific compensation and global regulatory adjustments. 
    more » « less