skip to main content


The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Thursday, May 23 until 2:00 AM ET on Friday, May 24 due to maintenance. We apologize for the inconvenience.

Search for: All records

Creators/Authors contains: "Chan-Golston, Alec"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Community-based public health interventions often rely on representative, spatially referenced outcome data to draw conclusions about a finite population. To estimate finite-population parameters, we are posed with two challenges: to correctly account for spatial association among the sampled and nonsampled participants and to correctly model missingness in key covariates, which may be also spatially associated. To accomplish this, we take inspiration from the preferential sampling literature and develop a general Bayesian framework that can specifically account for preferential non-response. This framework is first applied to three missing data scenarios in a simulation study. It is then used to account for missing data patterns seen in reported annual household income in a corner-store intervention project. Through this, we are able to construct finite-population estimates of the percent of income spent on fruits and vegetables. Such a framework provides a flexible way to account for spatial association and complex missing data structures in finite populations.

    more » « less
  2. Abstract

    We develop a Bayesian model–based approach to finite population estimation accounting for spatial dependence. Our innovation here is a framework that achieves inference for finite population quantities in spatial process settings. A key distinction from the small area estimation setting is that we analyze finite populations referenced by their geographic coordinates. Specifically, we consider a two‐stage sampling design in which the primary units are geographic regions, the secondary units are point‐referenced locations, and the measured values are assumed to be a partial realization of a spatial process. Estimation of finite population quantities from geostatistical models does not account for sampling designs, which can impair inferential performance, whereas design‐based estimates ignore the spatial dependence in the finite population. We demonstrate by using simulation experiments that process‐based finite population sampling models improve model fit and inference over models that fail to account for spatial correlation. Furthermore, the process‐based models offer richer inference with spatially interpolated maps over the entire region. We reinforce these improvements and demonstrate scalable inference for groundwater nitrate levels in the population of California Central Valley wells by offering estimates of mean nitrate levels and their spatially interpolated maps.

    more » « less