Breathing in fine particulate matter of diameter less than 2.5 µm (PM2.5) greatly increases an individual’s risk of cardiovascular and respiratory diseases. As climate change progresses, extreme weather events, including wildfires, are expected to increase, exacerbating air pollution. However, models often struggle to capture extreme pollution events due to the rarity of high PM2.5 levels in training datasets. To address this, we implemented cluster-based undersampling and trained Transformer models to improve extreme event prediction using various cutoff thresholds (12.1 µg/m3 and 35.5 µg/m3) and partial sampling ratios (10/90, 20/80, 30/70, 40/60, 50/50). Our results demonstrate that the 35.5 µg/m3 threshold, paired with a 20/80 partial sampling ratio, achieved the best performance, with an RMSE of 2.080, MAE of 1.386, and R2 of 0.914, particularly excelling in forecasting high PM2.5 events. Overall, models trained on augmented data significantly outperformed those trained on original data, highlighting the importance of resampling techniques in improving air quality forecasting accuracy, especially for high-pollution scenarios. These findings provide critical insights into optimizing air quality forecasting models, enabling more reliable predictions of extreme pollution events. By advancing the ability to forecast high PM2.5 levels, this study contributes to the development of more informed public health and environmental policies to mitigate the impacts of air pollution, and advanced the technology for building better air quality digital twins. 
                        more » 
                        « less   
                    
                            
                            Performance evaluation of the Alphasense OPC-N3 and Plantower PMS5003 sensor in measuring dust events in the Salt Lake Valley, Utah
                        
                    
    
            Abstract. As the changing climate expands the extent of arid andsemi-arid lands, the number of, severity of, and health effects associated with dust events are likely to increase. However, regulatory measurements capable of capturing dust (PM10, particulate matter smaller than10 µm in diameter) are sparse, sparser than measurements of PM2.5 (PM smaller than 2.5 µm in diameter). Although low-cost sensors couldsupplement regulatory monitors, as numerous studies have shown forPM2.5 concentrations, most of these sensors are not effective atmeasuring PM10 despite claims by sensor manufacturers. This studyfocuses on the Salt Lake Valley, adjacent to the Great Salt Lake, whichrecently reached historic lows exposing 1865 km2 of dry lake bed. Itevaluated the field performance of the Plantower PMS5003, a common low-costPM sensor, and the Alphasense OPC-N3, a promising candidate for low-costmeasurement of PM10, against a federal equivalent method (FEM, betaattenuation) and research measurements (GRIMM aerosol spectrometer model1.109) at three different locations. During a month-long field study thatincluded five dust events in the Salt Lake Valley with PM10 concentrations reaching 311 µg m−3, the OPC-N3 exhibited strong correlation with FEM PM10 measurements (R2 = 0.865, RMSE = 12.4 µg m−3) and GRIMM (R2 = 0.937, RMSE = 17.7 µg m−3). The PMS exhibited poor to moderate correlations(R2 < 0.49, RMSE = 33–45 µg m−3) withreference or research monitors and severely underestimated the PM10concentrations (slope < 0.099) for PM10. We also evaluated aPM-ratio-based correction method to improve the estimated PM10concentration from PMSs. After applying this method, PMS PM10concentrations correlated reasonably well with FEM measurements (R2 > 0.63) and GRIMM measurements (R2 > 0.76), andthe RMSE decreased to 15–25 µg m−3. Our results suggest that itmay be possible to obtain better resolved spatial estimates of PM10concentration using a combination of PMSs (often publicly availablein communities) and measurements of PM2.5 and PM10, such as thoseprovided by FEMs, research-grade instrumentation, or the OPC-N3. 
        more » 
        « less   
        
    
    
                            - PAR ID:
- 10422410
- Date Published:
- Journal Name:
- Atmospheric Measurement Techniques
- Volume:
- 16
- Issue:
- 10
- ISSN:
- 1867-8548
- Page Range / eLocation ID:
- 2455 to 2470
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            The emergence of low-cost air quality sensors as viable tools for the monitoring of air quality at population and individual levels necessitates the evaluation of these instruments. The Flow air quality tracker, a product of Plume Labs, is one such sensor. To evaluate these sensors, we assessed 34 of them in a controlled laboratory setting by exposing them to PM10 and PM2.5 and compared the response with Plantower A003 measurements. The overall coefficient of determination (R2) of measured PM2.5 was 0.76 and of PM10 it was 0.73, but the Flows’ accuracy improved after each introduction of incense. Overall, these findings suggest that the Flow can be a useful air quality monitoring tool in air pollution areas with higher concentrations, when incorporated into other monitoring frameworks and when used in aggregate. The broader environmental implications of this work are that it is possible for individuals and groups to monitor their individual exposure to particulate matter pollution.more » « less
- 
            null (Ed.)Short-term exposure to fine particulate matter (PM2.5) pollution is linked to numerous adverse health effects. Pollution episodes, such as wildfires, can lead to substantial increases in PM2.5 levels. However, sparse regulatory measurements provide an incomplete understanding of pollution gradients. Here, we demonstrate an infrastructure that integrates community-based measurements from a network of low-cost PM2.5 sensors with rigorous calibration and a Gaussian process model to understand neighborhood-scale PM2.5 concentrations during three pollution episodes (July 4, 2018, fireworks; July 5 and 6, 2018, wildfire; Jan 3−7, 2019, persistent cold air pool, PCAP). The firework/wildfire events included 118 sensors in 84 locations, while the PCAP event included 218 sensors in 138 locations. The model results accurately predict reference measurements during the fireworks (n: 16, hourly root-mean-square error, RMSE, 12.3−21.5 μg/m3, n(normalized)-RMSE: 9−24%), the wildfire (n: 46, RMSE: 2.6−4.0 μg/m3; nRMSE: 13.1−22.9%), and the PCAP (n: 96, RMSE: 4.9−5.7 μg/m3; nRMSE: 20.2−21.3%). They also revealed dramatic geospatial differences in PM2.5 concentrations that are not apparent when only considering government measurements or viewing the US Environmental Protection Agency’s AirNow’s visualizations. Complementing the PM2.5 estimates and visualizations are highly resolved uncertainty maps. Together, these results illustrate the potential for low-cost sensor networks that combined with a data-fusion algorithm and appropriate calibration and training can dynamically and with improved accuracy estimate PM2.5 concentrations during pollution episodes. These highly resolved uncertainty estimates can provide a much-needed strategy to communicate uncertainty to end users.more » « less
- 
            As the importance of non-tailpipe particles (NTP) over tailpipe emissions from urban traffic has been increasing, there is a need to evaluate NTP contributions to ambient particulate matter (PM) using representative source profiles. The Brake and Tire Wear Study conducted in Los Angeles, California in the winter of 2020 collected 64 PM2.5 and 64 PM10 samples from 32 pairs of downwind-upwind measurements at two near-road locations (I-5 in Anaheim and I-710 in Long Beach). These samples were characterized for inorganic and organic markers and, along with locally-developed brake wear, tire wear, and road dust source profiles, subject to source apportionment using the effective-variance chemical mass balance (EV-CMB) model. Model results highlighted the dominance of resuspended dust in both PM2.5 (23–33%) and PM10 (32–53%). Brake and tire wear contributed more to PM2.5 than tailpipe exhausts (diesel + gasoline) for I-5 (29–30% vs. 19–21%) while they were comparable for I-710 (15–17% vs. 15–19%). For PM10, the brake and tire wear contributions were 2–3 times the exhaust contributions. Different fleet compositions on and near I-5 and I-710 appeared to influence the relative importance of NTP and exhaust sources. The downwind-upwind differences in source contributions were often insignificant, consistent with small and/or nearly equal impacts of adjacent highway traffic emissions on the downwind and upwind sites. The utility of sole markers, such as barium and zinc, to predict brake and tire wear abundances in ambient PM is evaluated.more » « less
- 
            In recent years, the utilization of machine learning algorithms and advancements in unmanned aerial vehicle (UAV) technology have caused significant shifts in remote sensing practices. In particular, the integration of machine learning with physical models and their application in UAV–satellite data fusion have emerged as two prominent approaches for the estimation of vegetation biochemistry. This study evaluates the performance of five machine learning regression algorithms (MLRAs) for the mapping of crop canopy chlorophyll at the Kellogg Biological Station (KBS) in Michigan, USA, across three scenarios: (1) application to Landsat 7, RapidEye, and PlanetScope satellite images; (2) application to UAV–satellite data fusion; and (3) integration with the PROSAIL radiative transfer model (hybrid methods PROSAIL + MLRAs). The results indicate that the majority of the five MLRAs utilized in UAV–satellite data fusion perform better than the five PROSAIL + MLRAs. The general trend suggests that the integration of satellite data with UAV-derived information, including the normalized difference red-edge index (NDRE), canopy height model, and leaf area index (LAI), significantly enhances the performance of MLRAs. The UAV–RapidEye dataset exhibits the highest coefficient of determination (R2) and the lowest root mean square errors (RMSE) when employing kernel ridge regression (KRR) and Gaussian process regression (GPR) (R2 = 0.89 and 0.89 and RMSE = 8.99 µg/cm2 and 9.65 µg/cm2, respectively). Similar performance is observed for the UAV–Landsat and UAV–PlanetScope datasets (R2 = 0.86 and 0.87 for KRR, respectively). For the hybrid models, the maximum performance is attained with the Landsat data using KRR and GPR (R2 = 0.77 and 0.51 and RMSE = 33.10 µg/cm2 and 42.91 µg/cm2, respectively), followed by R2 = 0.75 and RMSE = 39.78 µg/cm2 for the PlanetScope data upon integrating partial least squares regression (PLSR) into the hybrid model. Across all hybrid models, the RapidEye data yield the most stable performance, with the R2 ranging from 0.45 to 0.71 and RMSE ranging from 19.16 µg/cm2 to 33.07 µg/cm2. The study highlights the importance of synergizing UAV and satellite data, which enables the effective monitoring of canopy chlorophyll in small agricultural lands.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    