Title: CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees
Abstract
Short tandem repeats (STRs), also known as microsatellites, are commonly used to noninvasively genotype wild‐living endangered species, including African apes. Until recently, capillary electrophoresis has been the method of choice to determine the length of polymorphicSTRloci. However, this technique is labor intensive, difficult to compare across platforms, and notoriously imprecise. Here we developed a MiSeq‐based approach and tested its performance using previously genotyped fecal samples from long‐term studied chimpanzees in Gombe National Park, Tanzania. Using data from eight microsatellite loci as a reference, we designed a bioinformatics platform that converts raw MiSeq reads into locus‐specific files and automatically calls alleles after filtering stutter sequences and otherPCRartifacts. Applying this method to the entire Gombe population, we confirmed previously reported genotypes, but also identified 31 new alleles that had been missed due to sequence differences and size homoplasy. The new genotypes, which increased the allelic diversity and heterozygosity in Gombe by 61% and 8%, respectively, were validated by replicate amplification and pedigree analyses. This demonstrated inheritance and resolved one case of an ambiguous paternity. Using both singleplex and multiplex locus amplification, we also genotyped fecal samples from chimpanzees in the Greater Mahale Ecosystem in Tanzania, demonstrating the utility of the MiSeq‐based approach for genotyping nonhabituated populations and performing comparative analyses across field sites. The new automated high‐throughput analysis platform (available athttps://github.com/ShawHahnLab/chiimp) will allow biologists to more accurately and effectively determine wildlife population size and structure, and thus obtain information critical for conservation efforts.
Chuanromanee, Tya S.; Cohen, James I.; Ryan, Gillian L.(
, Applications in Plant Sciences)
Premise
Morphometric analysis is a common approach for comparing and categorizing botanical samples; however, completing a suite of analyses using existing tools may require a multi‐stage, multi‐program process. To facilitate streamlined analysis within a single program, Morphological Analysis of Size and Shape (MASS) for leaves was developed. Its utility is demonstrated using exemplar leaf samples fromAcer saccharum,Malus domestica, andLithospermum.
Methods
Exemplar samples were obtained from across a single tree (Acer saccharum), three trees in the same species (Malus domestica), and online, digitized herbarium specimens (Lithospermum).MASSwas used to complete simple geometric measurements of samples, such as length and area, as well as geometric morphological analyses including elliptical Fourier and Procrustes analyses. Principal component analysis (PCA) of data was also completed within the same program.
Results
MASSis capable of making desired measurements and analyzing traditional morphometric data as well as landmark and outline data.
Discussion
UsingMASS, differences were observed among leaves of the three studied taxa, but only inMalus domesticawere differences statistically significant or correlated with other morphological features. In the future,MASScould be applied for analysis of other two‐dimensional organs and structures.MASSis available for download athttps://github.com/gillianlynnryan/MASS.
Anderson, Sarah N.; Stitzer, Michelle C.; Brohammer, Alex B.; Zhou, Peng; Noshay, Jaclyn M.; O'Connor, Christine H.; Hirsch, Cory D.; Ross‐Ibarra, Jeffrey; Hirsch, Candice N.; Springer, Nathan M.(
, The Plant Journal)
Summary
Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genome organization and content. Most maize genomes are composed ofTEs. We developed an approach to define shared and variableTEinsertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17 andPH207) with uniform structural annotations ofTEs. Among these genomes we identified approximately 400 000TEs that are polymorphic, encompassing 1.6 Gb of variableTEsequence. These polymorphicTEs include a combination of recent transposition events as well as deletions of olderTEs. There are examples of polymorphicTEs within each of the superfamilies ofTEs and they are found distributed across the genome, including in regions of recent shared ancestry among individuals. There are many examples of polymorphicTEs within or near maize genes. In addition, there are 2380 gene annotations in the B73 genome that are located within variableTEs, providing evidence for the role ofTEs in contributing to the substantial differences in annotated gene content among these genotypes.TEs are highly variable in our survey of four temperate maize genomes, highlighting the major contribution ofTEs in driving variation in genome organization and gene content.
Open Research Badges
This article has earned an Open Data Badge for making publicly available the digitally‐shareable data necessary to reproduce the reported results. The data is available athttps://github.com/SNAnderson/maizeTE_variation;https://mcstitzer.github.io/maize_TEs.
Baetscher, Diana S.; Clemento, Anthony J.; Ng, Thomas C.; Anderson, Eric C.; Garza, John C.(
, Molecular Ecology Resources)
Abstract
The accelerating rate at whichDNAsequence data are now generated by high‐throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a singleSNPper sequenced region ignores substantial additional information in the phased short‐read sequences that are provided by these sequencing instruments. We target sequenced regions with multipleSNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi‐allelic marker data from such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false‐positive rates by several orders of magnitude, relative to calling bi‐allelicSNPs, for two challenging analytical procedures, full‐sibling and single parent–offspring pair identification. We also show how the identification of half‐sibling pairs requires so much data that physical linkage becomes a consideration, and that most published studies that attempt to do so are dramatically underpowered. The advent of phased short‐readDNAsequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.
Nelson, Thomas C.; Monnahan, Patrick J.; McIntosh, Mariah K.; Anderson, Kayli; MacArthur‐Waltz, Evan; Finseth, Findley R.; Kelly, John K.; Fishman, Lila(
, Molecular Ecology)
Abstract
Copy number variation (CNV) is a major part of the genetic diversity segregating within populations, but remains poorly understood relative to single nucleotide variation. Here, we report on atRNAligase gene (Migut.N02091;RLG1a) exhibiting unprecedented, and fitness‐relevant,CNVwithin an annual population of the yellow monkeyflowerMimulus guttatus.RLG1a variation was associated with multiple traits in pooled population sequencing (PoolSeq) scans of phenotypic and phenological cohorts. Resequencing of inbred lines revealed intermediate‐frequency three‐copy variants ofRLG1a (trip+;5/35 = 14%), andtrip+lines exhibited elevatedRLG1a expression under multiple conditions.trip+carriers, in addition to being over‐represented in late‐flowering and large‐flowered PoolSeq populations, flowered later under stressful conditions in a greenhouse experiment (p < 0.05). In wild population samples, we discovered an additional rareRLG1a variant (high+) that carries 250–300 copies ofRLG1a totalling ~5.7 Mb (20–40% of a chromosome). In the progeny of ahigh+carrier, Mendelian segregation of diagnostic alleles andqPCR‐based copy counts indicate thathigh+is a single tandem array unlinked to the single‐copyRLG1a locus. In the wild,high+carriers had highest fitness in two particularly dry and/or hot years (2015 and 2017; bothp < 0.01), while single‐copy individuals were twice as fecund as eitherCNVtype in a lush year (2016:p < 0.005). Our results demonstrate fluctuating selection onCNVs affecting phenological traits in a wild population, suggest that planttRNAligases mediate stress‐responsive life‐history traits, and introduce a novel system for investigating the molecular mechanisms of gene amplification.
The paleback darter,Etheostoma pallididorsum, is considered imperilled and has recently been petitioned for listing under the Endangered Species Act. Previous allozyme‐based studies found evidence of a small effective population size, warranting conservation concern. The objective of this study was to assess the population dynamics and the phylogeographical history of the paleback darter, using a multilocus microsatellite approach and mitochondrial DNA.
The predictions of this study were that: paleback darter populations will exhibit low genetic diversity and minimal gene flow; population structure will correspond to the river systems from which the samples are derived; reservoir dams impounding the reaches between the Caddo and Ouachita rivers would serve as effective barriers to gene flow; and the Caddo and Ouachita rivers are reciprocally monophyletic.
Microsatellite DNA loci revealed significant structure among sampled localities (globalFst= 0.17,P< 0.001), with evidence of two distinct populations representing the Caddo and Ouachita rivers. However, Bayesian phylogeographical analyses resulted in three distinct clades: Caddo River, Ouachita River, and Mazarn Creek. Divergence from the most recent ancestor shared among the river drainages was estimated at 60 Kya. Population genetic diversity was relatively low (He= 0.65; mean alleles per locus,A= 6.26), but was comparable with the population genetic diversity found in the close relatives slackwater darter,Etheostoma boschungi(He= 0.65;A= 6.74), and Tuscumbia darter,Etheostoma tuscumbia(He= 0.57;A= 5.53).
These results have conservation implications for paleback darter populations and can be informative for other headwater specialist species. Like other headwater species with population structuring and relatively low genetic diversity, the persistence of paleback darter populations is likely to be tied to the persistence and connectivity of local breeding and non‐breeding habitat. These results do not raise conservation concern for a population decline; however, the restricted distribution and endemic status of the species still renders paleback darter populations vulnerable to extirpation or extinction.
Barbian, Hannah J., Connell, Andrew Jesse, Avitto, Alexa N., Russell, Ronnie M., Smith, Andrew G., Gundlapally, Madhurima S., Shazad, Alexander L., Li, Yingying, Bibollet‐Ruche, Frederic, Wroblewski, Emily E., Mjungu, Deus, Lonsdorf, Elizabeth V., Stewart, Fiona A., Piel, Alexander K., Pusey, Anne E., Sharp, Paul M., and Hahn, Beatrice H. CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees. Ecology and Evolution 8.16 Web. doi:10.1002/ece3.4302.
Barbian, Hannah J., Connell, Andrew Jesse, Avitto, Alexa N., Russell, Ronnie M., Smith, Andrew G., Gundlapally, Madhurima S., Shazad, Alexander L., Li, Yingying, Bibollet‐Ruche, Frederic, Wroblewski, Emily E., Mjungu, Deus, Lonsdorf, Elizabeth V., Stewart, Fiona A., Piel, Alexander K., Pusey, Anne E., Sharp, Paul M., & Hahn, Beatrice H. CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees. Ecology and Evolution, 8 (16). https://doi.org/10.1002/ece3.4302
Barbian, Hannah J., Connell, Andrew Jesse, Avitto, Alexa N., Russell, Ronnie M., Smith, Andrew G., Gundlapally, Madhurima S., Shazad, Alexander L., Li, Yingying, Bibollet‐Ruche, Frederic, Wroblewski, Emily E., Mjungu, Deus, Lonsdorf, Elizabeth V., Stewart, Fiona A., Piel, Alexander K., Pusey, Anne E., Sharp, Paul M., and Hahn, Beatrice H.
"CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees". Ecology and Evolution 8 (16). Country unknown/Code not available: Wiley Blackwell (John Wiley & Sons). https://doi.org/10.1002/ece3.4302.https://par.nsf.gov/biblio/10064528.
@article{osti_10064528,
place = {Country unknown/Code not available},
title = {CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees},
url = {https://par.nsf.gov/biblio/10064528},
DOI = {10.1002/ece3.4302},
abstractNote = {Abstract Short tandem repeats (STRs), also known as microsatellites, are commonly used to noninvasively genotype wild‐living endangered species, including African apes. Until recently, capillary electrophoresis has been the method of choice to determine the length of polymorphicSTRloci. However, this technique is labor intensive, difficult to compare across platforms, and notoriously imprecise. Here we developed a MiSeq‐based approach and tested its performance using previously genotyped fecal samples from long‐term studied chimpanzees in Gombe National Park, Tanzania. Using data from eight microsatellite loci as a reference, we designed a bioinformatics platform that converts raw MiSeq reads into locus‐specific files and automatically calls alleles after filtering stutter sequences and otherPCRartifacts. Applying this method to the entire Gombe population, we confirmed previously reported genotypes, but also identified 31 new alleles that had been missed due to sequence differences and size homoplasy. The new genotypes, which increased the allelic diversity and heterozygosity in Gombe by 61% and 8%, respectively, were validated by replicate amplification and pedigree analyses. This demonstrated inheritance and resolved one case of an ambiguous paternity. Using both singleplex and multiplex locus amplification, we also genotyped fecal samples from chimpanzees in the Greater Mahale Ecosystem in Tanzania, demonstrating the utility of the MiSeq‐based approach for genotyping nonhabituated populations and performing comparative analyses across field sites. The new automated high‐throughput analysis platform (available athttps://github.com/ShawHahnLab/chiimp) will allow biologists to more accurately and effectively determine wildlife population size and structure, and thus obtain information critical for conservation efforts.},
journal = {Ecology and Evolution},
volume = {8},
number = {16},
publisher = {Wiley Blackwell (John Wiley & Sons)},
author = {Barbian, Hannah J. and Connell, Andrew Jesse and Avitto, Alexa N. and Russell, Ronnie M. and Smith, Andrew G. and Gundlapally, Madhurima S. and Shazad, Alexander L. and Li, Yingying and Bibollet‐Ruche, Frederic and Wroblewski, Emily E. and Mjungu, Deus and Lonsdorf, Elizabeth V. and Stewart, Fiona A. and Piel, Alexander K. and Pusey, Anne E. and Sharp, Paul M. and Hahn, Beatrice H.},
}
Warning: Leaving National Science Foundation Website
You are now leaving the National Science Foundation website to go to a non-government website.
Website:
NSF takes no responsibility for and exercises no control over the views expressed or the accuracy of
the information contained on this site. Also be aware that NSF's privacy policy does not apply to this site.