skip to main content


Title: CHIIMP : An automated high‐throughput microsatellite genotyping platform reveals greater allelic diversity in wild chimpanzees
Abstract

Short tandem repeats (STRs), also known as microsatellites, are commonly used to noninvasively genotype wild‐living endangered species, including African apes. Until recently, capillary electrophoresis has been the method of choice to determine the length of polymorphicSTRloci. However, this technique is labor intensive, difficult to compare across platforms, and notoriously imprecise. Here we developed a MiSeq‐based approach and tested its performance using previously genotyped fecal samples from long‐term studied chimpanzees in Gombe National Park, Tanzania. Using data from eight microsatellite loci as a reference, we designed a bioinformatics platform that converts raw MiSeq reads into locus‐specific files and automatically calls alleles after filtering stutter sequences and otherPCRartifacts. Applying this method to the entire Gombe population, we confirmed previously reported genotypes, but also identified 31 new alleles that had been missed due to sequence differences and size homoplasy. The new genotypes, which increased the allelic diversity and heterozygosity in Gombe by 61% and 8%, respectively, were validated by replicate amplification and pedigree analyses. This demonstrated inheritance and resolved one case of an ambiguous paternity. Using both singleplex and multiplex locus amplification, we also genotyped fecal samples from chimpanzees in the Greater Mahale Ecosystem in Tanzania, demonstrating the utility of the MiSeq‐based approach for genotyping nonhabituated populations and performing comparative analyses across field sites. The new automated high‐throughput analysis platform (available athttps://github.com/ShawHahnLab/chiimp) will allow biologists to more accurately and effectively determine wildlife population size and structure, and thus obtain information critical for conservation efforts.

 
more » « less
NSF-PAR ID:
10064528
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecology and Evolution
Volume:
8
Issue:
16
ISSN:
2045-7758
Page Range / eLocation ID:
p. 7946-7963
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Premise

    Morphometric analysis is a common approach for comparing and categorizing botanical samples; however, completing a suite of analyses using existing tools may require a multi‐stage, multi‐program process. To facilitate streamlined analysis within a single program, Morphological Analysis of Size and Shape (MASS) for leaves was developed. Its utility is demonstrated using exemplar leaf samples fromAcer saccharum,Malus domestica, andLithospermum.

    Methods

    Exemplar samples were obtained from across a single tree (Acer saccharum), three trees in the same species (Malus domestica), and online, digitized herbarium specimens (Lithospermum).MASSwas used to complete simple geometric measurements of samples, such as length and area, as well as geometric morphological analyses including elliptical Fourier and Procrustes analyses. Principal component analysis (PCA) of data was also completed within the same program.

    Results

    MASSis capable of making desired measurements and analyzing traditional morphometric data as well as landmark and outline data.

    Discussion

    UsingMASS, differences were observed among leaves of the three studied taxa, but only inMalus domesticawere differences statistically significant or correlated with other morphological features. In the future,MASScould be applied for analysis of other two‐dimensional organs and structures.MASSis available for download athttps://github.com/gillianlynnryan/MASS.

     
    more » « less
  2. Summary Open Research Badges

    This article has earned an Open Data Badge for making publicly available the digitally‐shareable data necessary to reproduce the reported results. The data is available athttps://github.com/SNAnderson/maizeTE_variation;https://mcstitzer.github.io/maize_TEs.

     
    more » « less
  3. Abstract

    The accelerating rate at whichDNAsequence data are now generated by high‐throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a singleSNPper sequenced region ignores substantial additional information in the phased short‐read sequences that are provided by these sequencing instruments. We target sequenced regions with multipleSNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi‐allelic marker data from such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false‐positive rates by several orders of magnitude, relative to calling bi‐allelicSNPs, for two challenging analytical procedures, full‐sibling and single parent–offspring pair identification. We also show how the identification of half‐sibling pairs requires so much data that physical linkage becomes a consideration, and that most published studies that attempt to do so are dramatically underpowered. The advent of phased short‐readDNAsequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.

     
    more » « less
  4. Abstract

    Copy number variation (CNV) is a major part of the genetic diversity segregating within populations, but remains poorly understood relative to single nucleotide variation. Here, we report on atRNAligase gene (Migut.N02091;RLG1a) exhibiting unprecedented, and fitness‐relevant,CNVwithin an annual population of the yellow monkeyflowerMimulus guttatus.RLG1a variation was associated with multiple traits in pooled population sequencing (PoolSeq) scans of phenotypic and phenological cohorts. Resequencing of inbred lines revealed intermediate‐frequency three‐copy variants ofRLG1a (trip+;5/35 = 14%), andtrip+lines exhibited elevatedRLG1a expression under multiple conditions.trip+carriers, in addition to being over‐represented in late‐flowering and large‐flowered PoolSeq populations, flowered later under stressful conditions in a greenhouse experiment (p < 0.05). In wild population samples, we discovered an additional rareRLG1a variant (high+) that carries 250–300 copies ofRLG1a totalling ~5.7 Mb (20–40% of a chromosome). In the progeny of ahigh+carrier, Mendelian segregation of diagnostic alleles andqPCR‐based copy counts indicate thathigh+is a single tandem array unlinked to the single‐copyRLG1a locus. In the wild,high+carriers had highest fitness in two particularly dry and/or hot years (2015 and 2017; bothp < 0.01), while single‐copy individuals were twice as fecund as eitherCNVtype in a lush year (2016:p < 0.005). Our results demonstrate fluctuating selection onCNVs affecting phenological traits in a wild population, suggest that planttRNAligases mediate stress‐responsive life‐history traits, and introduce a novel system for investigating the molecular mechanisms of gene amplification.

     
    more » « less
  5. Abstract

    The paleback darter,Etheostoma pallididorsum, is considered imperilled and has recently been petitioned for listing under the Endangered Species Act. Previous allozyme‐based studies found evidence of a small effective population size, warranting conservation concern. The objective of this study was to assess the population dynamics and the phylogeographical history of the paleback darter, using a multilocus microsatellite approach and mitochondrial DNA.

    The predictions of this study were that: paleback darter populations will exhibit low genetic diversity and minimal gene flow; population structure will correspond to the river systems from which the samples are derived; reservoir dams impounding the reaches between the Caddo and Ouachita rivers would serve as effective barriers to gene flow; and the Caddo and Ouachita rivers are reciprocally monophyletic.

    Microsatellite DNA loci revealed significant structure among sampled localities (globalFst= 0.17,P< 0.001), with evidence of two distinct populations representing the Caddo and Ouachita rivers. However, Bayesian phylogeographical analyses resulted in three distinct clades: Caddo River, Ouachita River, and Mazarn Creek. Divergence from the most recent ancestor shared among the river drainages was estimated at 60 Kya. Population genetic diversity was relatively low (He= 0.65; mean alleles per locus,A= 6.26), but was comparable with the population genetic diversity found in the close relatives slackwater darter,Etheostoma boschungi(He= 0.65;A= 6.74), and Tuscumbia darter,Etheostoma tuscumbia(He= 0.57;A= 5.53).

    These results have conservation implications for paleback darter populations and can be informative for other headwater specialist species. Like other headwater species with population structuring and relatively low genetic diversity, the persistence of paleback darter populations is likely to be tied to the persistence and connectivity of local breeding and non‐breeding habitat. These results do not raise conservation concern for a population decline; however, the restricted distribution and endemic status of the species still renders paleback darter populations vulnerable to extirpation or extinction.

     
    more » « less