Germline mutations provide the raw material for all evolutionary processes and contribute to the occurrence of spontaneous human diseases and disorders. Yet despite the daily interaction of humans and other organisms with an increasing number of chemicals that are potentially mutagenic, precise measurements of chemically induced changes to the genome-wide rate and spectrum of germline mutation are lacking. A large-scale mutation-accumulation experiment was propagated in the presence and absence of an environmentally relevant cadmium concentration to quantify the influence of cadmium on germline mutation rates and spectra. Cadmium exposure dramatically changed the genome-wide rates and regional spectra of germline mutations. In comparison with those in control conditions, exposed to cadmium had a higher overall mutation rates and a lower overall mutation rate. exposed to cadmium had a higher intergenic mutation rate and a lower exonic mutation rate. The higher intergenic mutation rate under cadmium exposure was the result of an elevated intergenic rate, whereas the lower exon mutation rate in cadmium was the result of a complete loss of exonic mutations-mutations that are known to be enriched at 5-hydroxymethylcytosine. We experimentally show that cadmium exposure significantly reduced 5-hydroxymethylcytosine levels. These results provide evidence that cadmium changes regional mutation rates and can influence regional rates by interfering with an epigenetic process in the germline. We further suggest these observed cadmium-induced changes to the germline mutation rate may be explained by cadmium's inhibition of zinc-containing domains. The cadmium-induced changes to germline mutation rates and spectra we report provide a comprehensive view of the mutagenic perils of cadmium and give insight into its potential impact on human population health. https://doi.org/10.1289/EHP8932.
more »
« less
The Mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates
In the past decade, several studies have estimated the human per-generation germline mutation rate using large pedigrees. More recently, estimates for various nonhuman species have been published. However, methodological differences among studies in detecting germline mutations and estimating mutation rates make direct comparisons difficult. Here, we describe the many different steps involved in estimating pedigree-based mutation rates, including sampling, sequencing, mapping, variant calling, filtering, and appropriately accounting for false-positive and false-negative rates. For each step, we review the different methods and parameter choices that have been used in the recent literature. Additionally, we present the results from a ‘Mutationathon,’ a competition organized among five research labs to compare germline mutation rate estimates for a single pedigree of rhesus macaques. We report almost a twofold variation in the final estimated rate among groups using different post-alignment processing, calling, and filtering criteria, and provide details into the sources of variation across studies. Though the difference among estimates is not statistically significant, this discrepancy emphasizes the need for standardized methods in mutation rate estimations and the difficulty in comparing rates from different studies. Finally, this work aims to provide guidelines for computational and statistical benchmarks for future studies interested in identifying germline mutations from pedigrees.
more »
« less
- Award ID(s):
- 2045343
- NSF-PAR ID:
- 10390243
- Date Published:
- Journal Name:
- eLife
- Volume:
- 11
- ISSN:
- 2050-084X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Harris, Kelley (Ed.)Abstract A common assumption in dating patrilineal events using Y-chromosome sequencing data is that the Y-chromosome mutation rate is invariant across haplogroups. Previous studies revealed interhaplogroup heterogeneity in phylogenetic branch length. Whether this heterogeneity is caused by interhaplogroup mutation rate variation or nongenetic confounders remains unknown. Here, we analyzed whole-genome sequences from cultured cells derived from >1,700 males. We confirmed the presence of branch length heterogeneity. We demonstrate that sex-chromosome mutations that appear within cell lines, which likely occurred somatically or in vitro (and are thus not influenced by nongenetic confounders) are informative for germline mutational processes. Using within-cell-line mutations, we computed a relative Y-chromosome somatic mutation rate, and uncovered substantial variation (up to 83.3%) in this proxy for germline mutation rate among haplogroups. This rate positively correlates with phylogenetic branch length, indicating that interhaplogroup mutation rate variation is a likely cause of branch length heterogeneity.more » « less
-
The mutation rate and mutations' effects on fitness are crucial to evolution. Mutation rates are under selection due to linkage between mutation rate modifiers and mutations' effects on fitness. The linkage between a higher mutation rate and more beneficial mutations selects for higher mutation rates, while the linkage between a higher mutation rate and more deleterious mutations selects for lower mutation rates. The net direction of selection on mutations rates depends on the fitness landscape, and a great deal of work has elucidated the fitness landscapes of mutations. However, tests of the effect of varying a mutation rate on evolution in a single organism in a single environment have been difficult. This has been studied using strains of antimutators and mutators, but these strains may differ in additional ways and typically do not allow for continuous variation of the mutation rate. To help investigate the effects of the mutation rate on evolution, we have genetically engineered a strain of E. coli with a point mutation rate that can be smoothly varied over two orders of magnitude. We did this by engineering a strain with inducible control of the mismatch repair proteins MutH and MutL. We used this strain in an approximately 350 generation evolution experiment with controlled variation of the mutation rate. We confirmed the construct and the mutation rate were stable over this time. Sequencing evolved strains revealed a higher number of single nucleotide polymorphisms at higher mutations rates, likely due to either the beneficial effects of these mutations or their linkage to beneficial mutations.more » « less
-
Mitochondrial mutations in Caenorhabditis elegans show signatures of oxidative damage and an AT-biasSurtees, J A (Ed.)Abstract Rapid mutation rates are typical of mitochondrial genomes (mtDNAs) in animals, but it is not clear why. The difficulty of obtaining measurements of mtDNA mutation that are not biased by natural selection has stymied efforts to distinguish between competing hypotheses about the causes of high mtDNA mutation rates. Several studies which have measured mtDNA mutations in nematodes have yielded small datasets with conflicting conclusions about the relative abundance of different substitution classes (i.e., the mutation spectrum). We therefore leveraged Duplex Sequencing, a high-fidelity DNA sequencing technique, to characterize de novo mtDNA mutations in Caenorhabditis elegans. This approach detected nearly an order of magnitude more mtDNA mutations than documented in any previous nematode mutation study. Despite an existing extreme AT bias in the C. elegans mtDNA (75.6% AT), we found that a significant majority of mutations increase genomic AT content. Compared to some prior studies in nematodes and other animals, the mutation spectrum reported here contains an abundance of CG→AT transversions, supporting the hypothesis that oxidative damage may be a driver of mtDNA mutations in nematodes. Furthermore, we found an excess of G→T and C→T changes on the coding DNA strand relative to the template strand, consistent with increased exposure to oxidative damage. Analysis of the distribution of mutations across the mtDNA revealed significant variation among protein-coding genes and as well as among neighboring nucleotides. This high-resolution view of mitochondrial mutations in C. elegans highlights the value of this system for understanding relationships among oxidative damage, replication error, and mtDNA mutation.more » « less
-
de Visser, J. Arjan (Ed.)The rate of adaptive evolution depends on the rate at which beneficial mutations are introduced into a population and the fitness effects of those mutations. The rate of beneficial mutations and their expected fitness effects is often difficult to empirically quantify. As these 2 parameters determine the pace of evolutionary change in a population, the dynamics of adaptive evolution may enable inference of their values. Copy number variants (CNVs) are a pervasive source of heritable variation that can facilitate rapid adaptive evolution. Previously, we developed a locus-specific fluorescent CNV reporter to quantify CNV dynamics in evolving populations maintained in nutrient-limiting conditions using chemostats. Here, we use CNV adaptation dynamics to estimate the rate at which beneficial CNVs are introduced through de novo mutation and their fitness effects using simulation-based likelihood–free inference approaches. We tested the suitability of 2 evolutionary models: a standard Wright–Fisher model and a chemostat model. We evaluated 2 likelihood-free inference algorithms: the well-established Approximate Bayesian Computation with Sequential Monte Carlo (ABC-SMC) algorithm, and the recently developed Neural Posterior Estimation (NPE) algorithm, which applies an artificial neural network to directly estimate the posterior distribution. By systematically evaluating the suitability of different inference methods and models, we show that NPE has several advantages over ABC-SMC and that a Wright–Fisher evolutionary model suffices in most cases. Using our validated inference framework, we estimate the CNV formation rate at the GAP1 locus in the yeast Saccharomyces cerevisiae to be 10 −4.7 to 10 −4 CNVs per cell division and a fitness coefficient of 0.04 to 0.1 per generation for GAP1 CNVs in glutamine-limited chemostats. We experimentally validated our inference-based estimates using 2 distinct experimental methods—barcode lineage tracking and pairwise fitness assays—which provide independent confirmation of the accuracy of our approach. Our results are consistent with a beneficial CNV supply rate that is 10-fold greater than the estimated rates of beneficial single-nucleotide mutations, explaining the outsized importance of CNVs in rapid adaptive evolution. More generally, our study demonstrates the utility of novel neural network–based likelihood–free inference methods for inferring the rates and effects of evolutionary processes from empirical data with possible applications ranging from tumor to viral evolution.more » « less