Discussion of ‘A Unified Framework for De-Duplication and Population Size Estimation’ by Tancredi, Steorts, and Liseo.

Sadinle, Mauricio

Citation Details

Population size estimation techniques, such as multiple-systems or capture-recapture estimation, typically require multiple samples from the study population, in addition to the information on which individuals are included in which samples. In many contexts, these samples come from existing data sources that contain certain information on the individuals but no unique identifiers. The goal of record linkage and duplicate detection techniques is to identify unique individuals across and within samples based on the information collected on them, which might correspond to basic partial identifiers, such as given and family name, and other demographic information. Therefore, record linkage and duplicate detection are often needed to generate the input for a given population size estimation technique that a researcher might want to use. Linkage decisions, however, are subject to uncertainty when partial identifiers are limited or contain errors and missingness, and therefore, intuitively, uncertainty in the linkage and deduplication process should somehow be taken into account in the stage of population size estimation. more »

Award ID(s):: 1852841

PAR ID:: 10389235

Author(s) / Creator(s):: Sadinle, Mauricio

Date Published:: 2020-01-01

Journal Name:: Bayesian analysis

Volume:: 15

Issue:: 2

ISSN:: 1931-6690

Page Range / eLocation ID:: 659 - 663

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this