skip to main content

Title: Dynamics of data availability in disease modeling: An example evaluating the trade-offs of ultra-fine-scale factors applied to human West Nile virus disease models in the Chicago area, USA
Background Since 1999, West Nile virus (WNV) has moved rapidly across the United States, resulting in tens of thousands of human cases. Both the number of human cases and the minimum infection rate (MIR) in vector mosquitoes vary across time and space and are driven by numerous abiotic and biotic forces, ranging from differences in microclimates to socio-demographic factors. Because the interactions among these multiple factors affect the locally variable risk of WNV illness, it has been especially difficult to model human disease risk across varying spatial and temporal scales. Cook and DuPage Counties, comprising the city of Chicago and surrounding suburbs, experience some of the highest numbers of human neuroinvasive cases of WNV in the United States. Despite active mosquito control efforts, there is consistent annual WNV presence, resulting in more than 285 confirmed WNV human cases and 20 deaths from the years 2014–2018 in Cook County alone. Methods A previous Chicago-area WNV model identified the fifty-five most high and low risk locations in the Northwest Mosquito Abatement District (NWMAD), an enclave ¼ the size of the combined Cook and DuPage county area. In these locations, human WNV risk was stratified by model performance, as indicated by differences in more » studentized residuals. Within these areas, an additional two-years of field collections and data processing was added to a 12-year WNV dataset that includes human cases, MIR, vector abundance, and land-use, historical climate, and socio-economic and demographic variables, and was assessed by an ultra-fine-scale (1 km spatial x 1 week temporal resolution) multivariate logistic regression model. Results Multivariate statistical methods applied to the ultra-fine-scale model identified fewer explanatory variables while improving upon the fit of the previous model. Beyond MIR and climatic factors, efforts to acquire additional covariates only slightly improved model predictive performance. Conclusions These results suggest human WNV illness in the Chicago area may be associated with fewer, but increasingly critical, key variables at finer scales. Given limited resources, these findings suggest large variations in model performance occur, depending on covariate availability, and provide guidance in variable selection for optimal WNV human illness modeling. « less
; ; ; ; ; ;
Wen, Feng
Award ID(s):
Publication Date:
Journal Name:
Page Range or eLocation-ID:
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT Mosquito surveillance is critical to reduce the risk of West Nile virus (WNV) transmission to humans. In response to surveillance indicators such as elevated mosquito abundance or increased WNV levels, many mosquito control programs will perform truck-mounted ultra-low volume (ULV) adulticide application to reduce the number of mosquitoes and associated virus transmission. Despite the common use of truck-based ULV adulticiding as a public health measure to reduce WNV prevalence, limited evidence exists to support a role in reducing viral transmission to humans. We use a generalized additive and fused ridge regression model to quantify the location-specific impact of truck-mountedmore »ULV adulticide spray efforts from 2010 to 2018 in the North Shore Mosquito Abatement District (NSMAD) in metropolitan Chicago, IL, on commonly assessed risk factors from NSMAD surveillance gravid traps: Culex abundance, infection rate, and vector index. Our model also takes into account environmental variables commonly associated with WNV, including temperature, precipitation, wind speed, location, and week of year. Since it is unlikely ULV adulticide spraying will have the same impact at each trap location, we use a spatially varying spray effect with a fused ridge penalty to determine how the effect varies by trap location. We found that ULV adulticide spraying has an immediate temporary reduction in abundance followed by an increase after 5 days. It is estimated that mosquito abundance increased more in sprayed areas than if left unsprayed in all but 3 trap locations. The impact on infection rate and vector index were inconclusive due to the large error associated with estimating trap-specific infection rates.« less
  2. Abstract

    Mosquito-borne diseases (MBD) threaten over 80% of the world’s population, and are increasing in intensity and shifting in geographical range with land use and climate change. Mitigation hinges on understanding disease-specific risk profiles, but current risk maps are severely limited in spatial resolution. One important determinant of MBD risk is temperature, and though the relationships between temperature and risk have been extensively studied, maps are often created using sparse data that fail to capture microclimatic conditions. Here, we leverage high resolution land surface temperature (LST) measurements, in conjunction with established relationships between air temperature and MBD risk factors likemore »mosquito biting rate and transmission probability, to produce fine resolution (70 m) maps of MBD risk components. We focus our case study on West Nile virus (WNV) in the San Joaquin Valley of California, where temperatures vary widely across the day and the diverse agricultural/urban landscape. We first use field measurements to establish a relationship between LST and air temperature, and apply it to Ecosystem Spaceborne Thermal Radiometer Experiment data (2018–2020) in peak WNV transmission months (June–September). We then use the previously derived equations to estimate spatially explicit mosquito biting and WNV transmission rates. We use these maps to uncover significant differences in risk across land cover types, and identify the times of day which contribute to high risk for different land covers. Additionally, we evaluate the value of high resolution spatial and temporal data in avoiding biased risk estimates due to Jensen’s inequality, and find that using aggregate data leads to significant biases of up to 40.5% in the possible range of risk values. Through this analysis, we show that the synergy between novel remote sensing technology and fundamental principles of disease ecology can unlock new insights into the spatio-temporal dynamics of MBDs.

    « less
  3. Temperature is widely known to influence the spatio-temporal dynamics of vector-borne disease transmission, particularly as temperatures vary across critical thermal thresholds. When temperature conditions exhibit such ‘transcritical variation’, abrupt spatial or temporal discontinuities may result, generating sharp geographical or seasonal boundaries in transmission. Here, we develop a spatio-temporal machine learning algorithm to examine the implications of transcritical variation for West Nile virus (WNV) transmission in the Los Angeles metropolitan area (LA). Analysing a large vector and WNV surveillance dataset spanning 2006–2016, we found that mean temperatures in the previous month strongly predicted the probability of WNV presence in pools ofmore »Culex quinquefasciatus mosquitoes, forming distinctive inhibitory (10.0–21.0°C) and favourable (22.7–30.2°C) mean temperature ranges that bound a narrow 1.7°C transitional zone (21–22.7°C). Temperatures during the most intense months of WNV transmission (August/September) were more strongly associated with infection probability in Cx. quinquefasciatus pools in coastal LA, where temperature variation more frequently traversed the narrow transitional temperature range compared to warmer inland locations. This contributed to a pronounced expansion in the geographical distribution of human cases near the coast during warmer-than-average periods. Our findings suggest that transcritical variation may influence the sensitivity of transmission to climate warming, and that especially vulnerable locations may occur where present climatic fluctuations traverse critical temperature thresholds.« less
  4. We present an interpretable high-resolution spatio-temporal model to estimate COVID-19 deaths together with confirmed cases 1 week ahead of the current time, at the county level and weekly aggregated, in the United States. A notable feature of our spatio-temporal model is that it considers the (1) temporal auto- and pairwise correlation of the two local time series (confirmed cases and deaths from the COVID-19), (2) correlation between locations (propagation between counties), and (3) covariates such as local within-community mobility and social demographic factors. The within-community mobility and demographic factors, such as total population and the proportion of the elderly, aremore »included as important predictors since they are hypothesized to be important in determining the dynamics of COVID-19. To reduce the model’s high dimensionality, we impose sparsity structures as constraints and emphasize the impact of the top 10 metropolitan areas in the nation, which we refer to (and treat within our models) as hubs in spreading the disease. Our retrospective out-of-sample county-level predictions were able to forecast the subsequently observed COVID-19 activity accurately. The proposed multivariate predictive models were designed to be highly interpretable, with clear identification and quantification of the most important factors that determine the dynamics of COVID-19. Ongoing work involves incorporating more covariates, such as education and income, to improve prediction accuracy and model interpretability.« less
  5. The deployment of vaccines across the US provides significant defense against serious illness and death from COVID-19. Over 70% of vaccine-eligible Americans are at least partially vaccinated, but there are pockets of the population that are under-vaccinated, such as in rural areas and some demographic groups (e.g. age, race, ethnicity). These unvaccinated pockets are extremely susceptible to the Delta variant, exacerbating the healthcare crisis and increasing the risk of new variants. In this paper, we describe a data-driven model that provides real-time support to Virginia public health officials by recommending mobile vaccination site placement in order to target under-vaccinated populations.more »Our strategy uses fine-grained mobility data, along with US Census and vaccination uptake data, to identify locations that are most likely to be visited by unvaccinated individuals. We further extend our model to choose locations that maximize vaccine uptake among hesitant groups. We show that the top recommended sites vary substantially across some demographics, demonstrating the value of developing customized recommendation models that integrate fine-grained, heterogeneous data sources. In addition, we used a statistically equivalent Synthetic Population to study the effect of combined demographics (eg, people of a particular race and age), which is not possible using US Census data alone. We validate our recommendations by analyzing the success rates of deployed vaccine sites, and show that sites placed closer to our recommended areas administered higher numbers of doses. Our model is the first of its kind to consider evolving mobility patterns in real-time for suggesting placement strategies customized for different targeted demographic groups. Our results will be presented at IAAI-22, but given the critical nature of the pandemic, we offer this extended version of that paper for more timely consideration of our approach and to cover additional findings.« less