skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Comparing genome‐based estimates of relatedness for use in pedigree‐based conservation management
Abstract Researchers have long debated which estimator of relatedness best captures the degree of relationship between two individuals. In the genomics era, this debate continues, with relatedness estimates being sensitive to the methods used to generate markers, marker quality, and levels of diversity in sampled individuals. Here, we compare six commonly used genome‐based relatedness estimators (kinship genetic distance [KGD], Wang maximum likelihood [TrioML], Queller and Goodnight [Rxy], Kinship INference for Genome‐wide association studies [KING‐robust), and pairwise relatedness [RAB], allele‐sharing coancestry [AS]) across five species bred in captivity–including three birds and two mammals–with varying degrees of reliable pedigree data, using reduced‐representation and whole genome resequencing data. Genome‐based relatedness estimates varied widely across estimators, sequencing methods, and species, yet the most consistent results for known first order relationships were found usingRxy,RAB, and AS. However, AS was found to be less consistently correlated with known pedigree relatedness than eitherRxyorRAB. Our combined results indicate there is not a single genome‐based estimator that is ideal across different species and data types. To determine the most appropriate genome‐based relatedness estimator for each new data set, we recommend assessing the relative: (1) correlation of candidate estimators with known relationships in the pedigree and (2) precision of candidate estimators with known first‐order relationships. These recommendations are broadly applicable to conservation breeding programmes, particularly where genome‐based estimates of relatedness can complement and complete poorly pedigreed populations. Given a growing interest in the application of wild pedigrees, our results are also applicable to in situ wildlife management.  more » « less
Award ID(s):
1826801
PAR ID:
10371086
Author(s) / Creator(s):
 ;  ;  ;  ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Molecular Ecology Resources
Volume:
22
Issue:
7
ISSN:
1755-098X
Page Range / eLocation ID:
p. 2546-2558
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Kinship plays a fundamental role in the evolution of social systems and is considered a key driver of group living. To understand the role of kinship in the formation and maintenance of social bonds, accurate measures of genetic relatedness are critical. Genotype‐by‐sequencing technologies are rapidly advancing the accuracy and precision of genetic relatedness estimates for wild populations. The ability to assign kinship from genetic data varies depending on a species’ or population's mating system and pattern of dispersal, and empirical data from longitudinal studies are crucial to validate these methods. We use data from a long‐term behavioural study of a polygynandrous, bisexually philopatric marine mammal to measure accuracy and precision of parentage and genetic relatedness estimation against a known partial pedigree. We show that with moderate but obtainable sample sizes of approximately 4,235 SNPs and 272 individuals, highly accurate parentage assignments and genetic relatedness coefficients can be obtained. Additionally, we subsample our data to quantify how data availability affects relatedness estimation and kinship assignment. Lastly, we conduct a social network analysis to investigate the extent to which accuracy and precision of relatedness estimation improve statistical power to detect an effect of relatedness on social structure. Our results provide practical guidance for minimum sample sizes and sequencing depth for future studies, as well as thresholds for post hoc interpretation of previous analyses. 
    more » « less
  2. Abstract Non‐random mating among individuals can lead to spatial clustering of genetically similar individuals and population stratification. This deviation from panmixia is commonly observed in natural populations. Consequently, individuals can have parentage in single populations or involving hybridization between differentiated populations. Accounting for this mixture and structure is important when mapping the genetics of traits and learning about the formative evolutionary processes that shape genetic variation among individuals and populations. Stratified genetic relatedness among individuals is commonly quantified using estimates of ancestry that are derived from a statistical model. Development of these models for polyploid and mixed‐ploidy individuals and populations has lagged behind those for diploids. Here, we extend and test a hierarchical Bayesian model, calledentropy, which can use low‐depth sequence data to estimate genotype and ancestry parameters in autopolyploid and mixed‐ploidy individuals (including sex chromosomes and autosomes within individuals). Our analysis of simulated data illustrated the trade‐off between sequencing depth and genome coverage and found lower error associated with low‐depth sequencing across a larger fraction of the genome than with high‐depth sequencing across a smaller fraction of the genome. The model has high accuracy and sensitivity as verified with simulated data and through analysis of admixture among populations of diploid and tetraploidArabidopsis arenosa. 
    more » « less
  3. Kinship relationship estimation plays a significant role in today's genome studies. Since genetic data are mostly stored and protected in different silos, retrieving the desirable kinship relationships across federated data warehouses is a non-trivial problem. The ability to identify and connect related individuals is important for both research and clinical applications. In this work, we propose a new privacy-preserving kinship relationship estimation framework: Incremental Update Kinship Identification (INK). The proposed framework includes three key components that allow us to control the balance between privacy and accuracy (of kinship estimation): an incremental process coupled with the use of auxiliary information and informative scores. Our empirical evaluation shows that INK can achieve higher kinship identification correctness while exposing fewer genetic markers. 
    more » « less
  4. Pyhäjärvi, T (Ed.)
    Abstract Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome-length assembly and annotation of the diploid blackberry germplasm accession “Hillquist” (R. argutus). “Hillquist” is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The “Hillquist” assembly, generated using Pacific Biosciences long reads scaffolded with high-throughput chromosome conformation capture sequencing, consisted of 298 Mb, of which 270 Mb (90%) was placed on 7 chromosome-length scaffolds with an average length of 38.6 Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome sequence was highly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and genome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2 Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the “Hillquist” genome has been demonstrated here by the development of the first genotyping-by-sequencing-based linkage map of tetraploid blackberry and the identification of possible candidate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and genomics and strengthen applied breeding programs. 
    more » « less
  5. Abstract Managing endangered species in fragmented landscapes requires estimating dispersal rates between populations over contemporary timescales. Here, we developed a new method for quantifying recent dispersal using genetic pedigree data for close and distant kin. Specifically, we describe an approach that infers missing shared ancestors between pairs of kin in habitat patches across a fragmented landscape. We then applied a stepping‐stone model to assign unsampled individuals in the pedigree to probable locations based on minimizing the number of movements required to produce the observed locations in sampled kin pairs. Finally, we used all pairs of reconstructed parent‐offspring sets to estimate dispersal rates between habitat patches under a Bayesian model. Our approach measures connectivity over the timescale represented by the small number of generations contained within the pedigree and so is appropriate for estimating the impacts of recent habitat changes due to human activity. We used our method to estimate recent movement between newly discovered populations of threatened Eastern Massasauga rattlesnakes (Sistrurus catenatus) using data from 2996 RAD‐based genetic loci. Our pedigree analyses found no evidence for contemporary connectivity between five genetic groups, but, as validation of our approach, showed high dispersal rates between sample sites within a single genetic cluster. We conclude that these five genetic clusters of Eastern Massasauga rattlesnakes have small numbers of resident snakes and are demographically isolated conservation units. More broadly, our methodology can be widely applied to determine contemporary connectivity rates, independent of bias from shared genetic similarity due to ancestry that impacts other approaches. 
    more » « less