Abstract India is largely devoid of high‐quality and reliable on‐the‐ground measurements of fine particulate matter (PM2.5). Ground‐level PM2.5concentrations are estimated from publicly available satellite Aerosol Optical Depth (AOD) products combined with other information. Prior research has largely overlooked the possibility of gaining additional accuracy and insights into the sources of PM using satellite retrievals of tropospheric trace gas columns. We evaluate the information content of tropospheric trace gas columns for PM2.5estimates over India within a modeling testbed using an Automated Machine Learning (AutoML) approach, which selects from a menu of different machine learning tools based on the data set. We then quantify the relative information content of tropospheric trace gas columns, AOD, meteorological fields, and emissions for estimating PM2.5over four Indian sub‐regions on daily and monthly time scales. Our findings suggest that, regardless of the specific machine learning model assumptions, incorporating trace gas modeled columns improves PM2.5estimates. We use the ranking scores produced from the AutoML algorithm and Spearman’s rank correlation to infer or link the possible relative importance of primary versus secondary sources of PM2.5as a first step toward estimating particle composition. Our comparison of AutoML‐derived models to selected baseline machine learning models demonstrates that AutoML is at least as good as user‐chosen models. The idealized pseudo‐observations (chemical‐transport model simulations) used in this work lay the groundwork for applying satellite retrievals of tropospheric trace gases to estimate fine particle concentrations in India and serve to illustrate the promise of AutoML applications in atmospheric and environmental research.
more »
« less
Developing high-resolution PM2.5 exposure models by integrating low-cost sensors, automated machine learning, and big human mobility data
Introduction:Traditional methods to estimate exposure to PM2.5(particulate matter with less than 2.5 µm in diameter) have typically relied on limited regulatory monitors and do not consider human mobility and travel. However, the limited spatial coverage of regulatory monitors and the lack of consideration of mobility limit the ability to capture actual air pollution exposure. Methods:This study aims to improve traditional exposure assessment methods for PM2.5by incorporating the measurements from a low-cost sensor network (PurpleAir) and regulatory monitors, an automated machine learning modeling framework, and big human mobility data. We develop a monthly-aggregated hourly land use regression (LUR) model based on automated machine learning (AutoML) and assess the model performance across eight metropolitan areas within the US. Results:Our results show that integrating low-cost sensor with regulatory monitor measurements generally improves the AutoML-LUR model accuracy and produces higher spatial variation in PM2.5concentration maps compared to using regulatory monitor measurements alone. Feature importance analysis shows factors highly correlated with PM2.5concentrations, including satellite aerosol optical depth, meteorological variables, vegetation, and land use. In addition, we incorporate human mobility data on exposure estimates regarding where people visit to identify spatiotemporal hotspots of places with higher risks of exposure, emphasizing the need to consider both visitor numbers and PM2.5concentrations when developing exposure reduction strategies. Discussion:This research provides important insights for further public health studies on air pollution by comprehensively assessing the performance of AutoML-LUR models and incorporating human mobility into considering human exposure to air pollution.
more »
« less
- Award ID(s):
- 1852428
- PAR ID:
- 10541225
- Publisher / Repository:
- Frontiers
- Date Published:
- Journal Name:
- Frontiers in Environmental Science
- Volume:
- 11
- ISSN:
- 2296-665X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract The paucity of fine particulate matter (PM2.5) measurements limits estimates of air pollution mortality in Sub‐Saharan Africa. Well calibrated low‐cost sensors can provide reliable data especially where reference monitors are unavailable. We evaluate the performance of Clarity Node‐S PM monitors against a Tapered element oscillating microbalance (TEOM) 1400a and develop a calibration model in Mombasa, Kenya's second largest city. As‐reported Clarity Node‐S data from January 2023 through April 2023 was moderately correlated with the TEOM‐1400a measurements (R2 = 0.61) and exhibited a mean absolute error (MAE) of 7.03 μg m−3. Employing three calibration models, namely, multiple linear regression (MLR), Gaussian mixture regression and random forest (RF) decreased the MAE to 4.28, 3.93, and 4.40 μg m−3respectively. TheR2value improved to 0.63 for the MLR model but all other models registered a decrease (R2 = 0.44 and 0.60 respectively). Applying the correction factor to a five‐sensor network in Mombasa that was operated between July 2021 and July 2022 gave insights to the air quality in the city. The average daily concentrations of PM2.5within the city ranged from 12 to 18 μg m−3. The concentrations exceeded the WHO daily PM2.5limits more than 50% of the time, in particular at the sites nearby frequent industrial activity. Higher averages were observed during the dry and cold seasons and during early morning and evening periods of high activity. These results represent some of the first air quality monitoring measurements in Mombasa and highlight the need for more study.more » « less
-
Abstract AimTwo important environmental hazards for nocturnally migrating birds are artificial light at night (ALAN) and air pollution, with ambient fine particulate matter (PM2.5) considered to be especially harmful. Nocturnally migrating birds are attracted to ALAN during seasonal migration, which could increase exposure to PM2.5. Here, we examine PM2.5concentrations and PM2.5trends and the spatial correlation between ALAN and PM2.5within the geographical ranges of the world’s nocturnally migrating birds. LocationGlobal. Time period1998–2018. Major taxa studiedNocturnally migrating birds. MethodsWe intersected a global database of annual mean PM2.5concentrations over a 21‐year period (1998–2018) with the geographical ranges (breeding, non‐breeding and regions of passage) of 225 nocturnally migrating bird species in three migration flyways (Americas,n = 143; Africa–Europe,n = 36; and East Asia–Australia,n = 46). For each species, we estimated PM2.5concentrations and trends and measured the correlation between ALAN and PM2.5, which we summarized by season and flyway. ResultsCorrelations between ALAN and PM2.5were significantly positive across all seasons and flyways. The East Asia–Australia flyway had the strongest ALAN–PM2.5correlations within regions of passage, the highest PM2.5concentrations across all three seasons and the strongest positive PM2.5trends on the non‐breeding grounds and within regions of passage. The Americas flyway had the strongest negative air pollution trends on the non‐breeding grounds and within regions of passage. The breeding grounds had similarly negative air pollution trends within the three flyways. Main conclusionsThe combined threats of ALAN and air pollution are greatest and likely to be increasing within the East Asia–Australia flyway and lowest and likely to be decreasing within the Americas and Africa–Europe flyways. Reversing PM2.5trends in the East Asia–Australia flyway and maintaining negative PM2.5trends in the Americas and Africa–Europe flyways while reducing ALAN levels would likely be beneficial for the nocturnally migrating bird populations in each region.more » « less
-
Abstract Particulate matter poses significant risks to respiratory and cardiovascular health. Monitoring ambient particulate matter concentrations can provide information on potential exposures and inform mitigation strategies, but ground-based measurements are sparse. Data fusion approaches that integrate data from multiple sources can complement existing observation networks and reveal insights that single-sensor data might miss to better manage pollutant exposure risks. However, data fusion approaches face multiple challenges, including incompatible measurement units, varying data resolutions, and differing levels of uncertainty. As a result, the optimal method for data fusion remains an open question. Here, we propose a probabilistic spatiotemporal model, based on the stochastic advection–diffusion (SAD) equation, as a data fusion method to process multimodal air quality data to predict hourly concentrations of fine particulate matter (PM2.5). We employ a variational inference method to calibrate the probabilistic model using ground-level observations and the numerical output of two simulation models. We then evaluate the prediction performance of our model for two scenarios: (1) incorporating simulation outputs and ground-level observations from sparse regulatory-grade stations and (2) using ground-level observations from both low-cost and regulatory-grade stations. For the first scenario, the data fusion method reduces prediction error by 14% compared to the nearest regulatory-grade air monitor located 20 km away. For the second scenario, error is reduced by 40% compared to the nearest regulatory-grade monitor and 11% compared to the nearest low-cost sensor located approximately 1 km away. The model captures 78% of observed data within a 75% confidence interval across both scenarios, demonstrating its ability to accurately represent uncertainty. Our findings demonstrate that the proposed SAD model can effectively integrate multimodal data to provide improved prediction of particulate matter concentrations at high spatial resolution. Model outputs can inform individual and community-level decision-making to mitigate air pollutant exposures.more » « less
-
Abstract Wildfires cause elevated air pollution that can be detrimental to human health. However, health impact assessments associated with emissions from wildfire events are subject to uncertainty arising from different sources. Here, we quantify and compare major uncertainties in mortality and morbidity outcomes of exposure to fine particulate matter (PM2.5) pollution estimated for a series of wildfires in the Southeastern U.S. We present an approach to compare uncertainty in estimated health impacts specifically due to two driving factors, wildfire‐related smoke PM2.5fields and variability in concentration‐response parameters from epidemiologic studies of ambient and smoke PM2.5. This analysis, focused on the 2016 Southeastern wildfires, suggests that emissions from these fires had public health consequences in North Carolina. Using several methods based on publicly available monitor data and atmospheric models to represent wildfire‐attributable PM2.5, we estimate impacts on several health outcomes and quantify associated uncertainty. Multiple concentration‐response parameters derived from studies of ambient and wildfire‐specific PM2.5are used to assess health‐related uncertainty. Results show large variability and uncertainty in wildfire impact estimates, with comparable uncertainties due to the smoke pollution fields and health response parameters for some outcomes, but substantially larger health‐related uncertainty for several outcomes. Consideration of these uncertainties can support efforts to improve estimates of wildfire impacts and inform fire‐related decision‐making.more » « less
An official website of the United States government

