skip to main content

Title: Demonstrating a systems approach for integrating disparate data streams to inform decisions on children’s environmental health
Abstract Background

The use of systems science methodologies to understand complex environmental and human health relationships is increasing. Requirements for advanced datasets, models, and expertise limit current application of these approaches by many environmental and public health practitioners.


A conceptual system-of-systems model was applied for children in North Carolina counties that includes example indicators of children’s physical environment (home age, Brownfield sites, Superfund sites), social environment (caregiver’s income, education, insurance), and health (low birthweight, asthma, blood lead levels). The web-based Toxicological Prioritization Index (ToxPi) tool was used to normalize the data, rank the resulting vulnerability index, and visualize impacts from each indicator in a county. Hierarchical clustering was used to sort the 100 North Carolina counties into groups based on similar ToxPi model results. The ToxPi charts for each county were also superimposed over a map of percentage county population under age 5 to visualize spatial distribution of vulnerability clusters across the state.


Data driven clustering for this systems model suggests 5 groups of counties. One group includes 6 counties with the highest vulnerability scores showing strong influences from all three categories of indicators (social environment, physical environment, and health). A second group contains 15 counties with high vulnerability scores driven more » by strong influences from home age in the physical environment and poverty in the social environment. A third group is driven by data on Superfund sites in the physical environment.


This analysis demonstrated how systems science principles can be used to synthesize holistic insights for decision making using publicly available data and computational tools, focusing on a children’s environmental health example. Where more traditional reductionist approaches can elucidate individual relationships between environmental variables and health, the study of collective, system-wide interactions can enable insights into the factors that contribute to regional vulnerabilities and interventions that better address complex real-world conditions.

« less
; ; ; ; ;
Award ID(s):
Publication Date:
Journal Name:
BMC Public Health
Springer Science + Business Media
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Unregulated private wells are understudied potential sources of community-acquired Legionnaires’ disease. Here we conducted a comprehensive survey of 44 homes supplied by private wells in Wake County, North Carolina, quantifying Legionella spp. DNA, Legionella pneumophila DNA, and total bacterial 16S rRNA genes via real-time polymerase chain reaction in hot and cold drinking water samples, along with culturable L. pneumophila via IDEXX Legiolert in cold drinking water samples. Legionella spp. DNA, L. pneumophila DNA and culturable L. pneumophila were detected in 100, 65·5 and 15·9% of the 44 homes, respectively, and culturable levels were comparable to some municipal surveys applying the same methods. Total coliforms and Escherichia coli were monitored as representative faecal indicators and were found in 20·4 and 0·0% of homes. Within certain sample types, Legionella spp. and L. pneumophila gene copy numbers were positively associated with total bacteria (i.e. total 16S rRNA genes) and water softener use, but were not associated with faecal indicator bacteria, inorganic water parameters or other well characteristics. These findings confirm that occurrence of Legionella and L. pneumophila is highly variable in private wells.

    Significance and Impact of the Study

    Legionella is the leading identified cause of waterborne disease outbreaks associated with US municipalmore »water systems. While Legionella is known to occur naturally in groundwater, prior efforts to characterize its occurrence in unregulated private wells are limited to sampling at the wellhead and not in the home plumbing where Legionella can thrive. This work documents much higher levels of Legionella in home plumbing versus water directly from private wells and examines factors associated with higher Legionella occurrence.

    « less
  2. Midlife non-Hispanic white mortality in the United States is rising, particularly in small metro and rural counties. This article responds to calls for county-level studies. We examine social determinants of morbidity and mortality among adult non-Hispanic whites in Yavapai County, Arizona, as part of an integrative study. We report overall mortality trends in Yavapai County using CDC Wonder data and then examine social determinants of reported physical health and mental distress in Yavapai County data using 6 years (2011–2016) of the Arizona Behavioral Risk Factor Surveillance System (BRFSS). The BRFSS includes 1,024 non-Hispanic white respondents aged 25–64. We also present data from the recently established Yavapai County Overdose Fatality Review Board (YCOFRB). Mortality trends indicate that suicide and drug and alcoholrelated mortality have all increased since 1999. These increases affect all 5-year age groups from 25 to 64 and both men and women. BRFSS data show that low education and unemployment, but not number of children or home ownership, are significantly associated with worse reported health and frequent mental distress in multivariate analyses. The YCOFRB point to the importance of homelessness and mental health. The mortality crisis in Yavapai County is not restricted to midlife or to drug-related deaths. The unemployedmore »and those with low levels of education are particularly at risk. There is a need for integrative approaches that use local data to elucidate social determinants of morbidity and mortality and to reveal structural determinants.« less
  3. Abstract Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) the causal agent for COVID-19, is a communicable disease spread through close contact. It is known to disproportionately impact certain communities due to both biological susceptibility and inequitable exposure. In this study, we investigate the most important health, social, and environmental factors impacting the early phases (before July, 2020) of per capita COVID-19 transmission and per capita all-cause mortality in US counties. We aggregate county-level physical and mental health, environmental pollution, access to health care, demographic characteristics, vulnerable population scores, and other epidemiological data to create a large feature set to analyze per capita COVID-19 outcomes. Because of the high-dimensionality, multicollinearity, and unknown interactions of the data, we use ensemble machine learning and marginal prediction methods to identify the most salient factors associated with several COVID-19 outbreak measure. Our variable importance results show that measures of ethnicity, public transportation and preventable diseases are the strongest predictors for both per capita COVID-19 incidence and mortality. Specifically, the CDC measures for minority populations, CDC measures for limited English, and proportion of Black- and/or African-American individuals in a county were the most important features for per capita COVID-19 cases within a month after the pandemicmore »started in a county and also at the latest date examined. For per capita all-cause mortality at day 100 and total to date, we find that public transportation use and proportion of Black- and/or African-American individuals in a county are the strongest predictors. The methods predict that, keeping all other factors fixed, a 10% increase in public transportation use, all other factors remaining fixed at the observed values, is associated with increases mortality at day 100 of 2012 individuals (95% CI [1972, 2356]) and likewise a 10% increase in the proportion of Black- and/or African-American individuals in a county is associated with increases total deaths at end of study of 2067 (95% CI [1189, 2654]). Using data until the end of study, the same metric suggests ethnicity has double the association as the next most important factors, which are location, disease prevalence, and transit factors. Our findings shed light on societal patterns that have been reported and experienced in the U.S. by using robust methods to understand the features most responsible for transmission and sectors of society most vulnerable to infection and mortality. In particular, our results provide evidence of the disproportionate impact of the COVID-19 pandemic on minority populations. Our results suggest that mitigation measures, including how vaccines are distributed, could have the greatest impact if they are given with priority to the highest risk communities.« less
  4. Abstract Background

    We conducted a large-scale, passive regional survey of ticks associated with wildlife of the eastern United States. Our primary goals were to better assess the current geographical distribution of exoticHaemaphysalis longicornisand to identify potential wild mammalian and avian host species. However, this large-scale survey also provided valuable information regarding the distribution and host associations for many other important tick species that utilize wildlife as hosts.


    Ticks were opportunistically collected by cooperating state and federal wildlife agencies. All ticks were placed in the supplied vials and host information was recorded, including host species, age, sex, examination date, location (at least county and state), and estimated tick burden. All ticks were identified to species using morphology, and suspectH. longicorniswere confirmed through molecular techniques.


    In total, 1940 hosts were examined from across 369 counties from 23 states in the eastern USA. From these submissions, 20,626 ticks were collected and identified belonging to 11 different species. Our passive surveillance efforts detected exoticH. longicornisfrom nine host species from eight states. Notably, some of the earliest detections ofH. longicornisin the USA were collected from wildlife through this passive surveillance network. In addition, numerous new county reports were generated forAmblyomma americanum,Amblyomma maculatum,Dermacentor albipictus,Dermacentor variabilis, andIxodes scapularis.

    more »Conclusions

    This study provided data on ticks collected from animals from 23 different states in the eastern USA between 2010 and 2021, with the primary goal of better characterizing the distribution and host associations of the exotic tickH. longicornis;however, new distribution data on tick species of veterinary or medical importance were also obtained. Collectively, our passive surveillance has detected numerous new county reports forH. longicornisas well asI. scapularis.Our study utilizing passive wildlife surveillance for ticks across the eastern USA is an effective method for surveying a diversity of wildlife host species, allowing us to better collect data on current tick distributions relevant to human and animal health.

    « less
  5. Turner, Richard (Ed.)
    Background With the availability of multiple Coronavirus Disease 2019 (COVID-19) vaccines and the predicted shortages in supply for the near future, it is necessary to allocate vaccines in a manner that minimizes severe outcomes, particularly deaths. To date, vaccination strategies in the United States have focused on individual characteristics such as age and occupation. Here, we assess the utility of population-level health and socioeconomic indicators as additional criteria for geographical allocation of vaccines. Methods and findings County-level estimates of 14 indicators associated with COVID-19 mortality were extracted from public data sources. Effect estimates of the individual indicators were calculated with univariate models. Presence of spatial autocorrelation was established using Moran’s I statistic. Spatial simultaneous autoregressive (SAR) models that account for spatial autocorrelation in response and predictors were used to assess (i) the proportion of variance in county-level COVID-19 mortality that can explained by identified health/socioeconomic indicators (R 2 ); and (ii) effect estimates of each predictor. Adjusting for case rates, the selected indicators individually explain 24%–29% of the variability in mortality. Prevalence of chronic kidney disease and proportion of population residing in nursing homes have the highest R 2 . Mortality is estimated to increase by 43 per thousand residentsmore »(95% CI: 37–49; p < 0.001) with a 1% increase in the prevalence of chronic kidney disease and by 39 deaths per thousand (95% CI: 34–44; p < 0.001) with 1% increase in population living in nursing homes. SAR models using multiple health/socioeconomic indicators explain 43% of the variability in COVID-19 mortality in US counties, adjusting for case rates. R 2 was found to be not sensitive to the choice of SAR model form. Study limitations include the use of mortality rates that are not age standardized, a spatial adjacency matrix that does not capture human flows among counties, and insufficient accounting for interaction among predictors. Conclusions Significant spatial autocorrelation exists in COVID-19 mortality in the US, and population health/socioeconomic indicators account for a considerable variability in county-level mortality. In the context of vaccine rollout in the US and globally, national and subnational estimates of burden of disease could inform optimal geographical allocation of vaccines.« less