NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Climate smart computing: A perspective

https://doi.org/10.1016/j.pmcj.2025.102019

Yang, Mingzhou; Jayaprakash, Bharat; Ghosh, Subhankar; Jung, Hyeonjung Tari; Eagon, Matthew; Northrop, William F; Shekhar, Shashi (March 2025, Pervasive and Mobile Computing)

Full Text Available
Towards Pareto-optimality with Multi-level Bi-objective Routing: A Summary of Results

https://doi.org/10.1145/3681772.3698215

Yang, Mingzhou; Zeng, Ruolei; Sharma, Arun; Sawamura, Shunichi; Northrop, William F; Shekhar, Shashi (October 2024, ACM)

Full Text Available
De-Snowing Algorithm for Long-Wavelength LiDAR

https://doi.org/10.1109/IV55156.2024.10588657

Jayaprakash, Bharat; Eagon, Matthew; Zhan, Lu; Northrop, William F (June 2024, IEEE)

Full Text Available
Towards Statistically Significant Taxonomy Aware Co-Location Pattern Detection (Short Paper)

https://doi.org/10.4230/LIPIcs.COSIT.2024.25

Ghosh, Subhankar; Sharma, Arun; Gupta, Jayant; Shekhar, Shashi (January 2024, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Adams, Benjamin; Griffin, Amy L; Scheider, Simon; McKenzie, Grant (Ed.)
Given a collection of Boolean spatial feature types, their instances, a neighborhood relation (e.g., proximity), and a hierarchical taxonomy of the feature types, the goal is to find the subsets of feature types or their parents whose spatial interaction is statistically significant. This problem is for taxonomy-reliant applications such as ecology (e.g., finding new symbiotic relationships across the food chain), spatial pathology (e.g., immunotherapy for cancer), retail, etc. The problem is computationally challenging due to the exponential number of candidate co-location patterns generated by the taxonomy. Most approaches for co-location pattern detection overlook the hierarchical relationships among spatial features, and the statistical significance of the detected patterns is not always considered, leading to potential false discoveries. This paper introduces two methods for incorporating taxonomies and assessing the statistical significance of co-location patterns. The baseline approach iteratively checks the significance of co-locations between leaf nodes or their ancestors in the taxonomy. Using the Benjamini-Hochberg procedure, an advanced approach is proposed to control the false discovery rate. This approach effectively reduces the risk of false discoveries while maintaining the power to detect true co-location patterns. Experimental evaluation and case study results show the effectiveness of the approach.
more » « less
Full Text Available
Towards Carbon-Aware Spatial Computing: Challenges and Opportunities

Jayaprakash, Bharat; Eagon, Matthew; Yang, Mingzhou; Northrop, William; Shekhar, Shashi (October 2023, I-GUIDE FORUM 2023 - HARNESSING THE GEOSPATIAL DATA REVOLUTION FOR SUSTAINABILITY SOLUTIONS)

Full Text Available
Initial Development of a Physics-Aware Machine Learning Framework for Soot Mass Prediction in Gasoline Direct Injection Engines

https://doi.org/10.4271/2023-24-0174

Jayaprakash, Bharat; Wilmer, Brady; Northrop, William F (August 2023, 16th International Conference on Engines & Vehicles)

Calibration of automotive engines to ensure compliance with emission regulations is a critical phase in product development. Control of engine-out particulate emissions, which directly impact the environment and public health, is particularly important. Detailed physics-based models are typically used to gain a rich understanding of the complex physical phenomena that drive the soot particle formation in an engine cylinder. However, such models often fail to correctly represent the highly dynamic nature of the underlying mechanisms under transient combustion conditions. Moreover, most physics-based models were initially developed for diesel engine applications and their applicability to gasoline engines remains questionable due to differences in flame structure and fuel-wall interactions. Black-box models have been previously proposed to predict engine-out soot emissions, but their lack of physical interpretability is an unsolved drawback. To address these limitations, we present a physics-aware twin-model machine learning framework to predict and analyze engine-out soot mass from a gasoline direct injection (GDI) engine. The framework combines a physics-based model with a bagging-type ensemble learning model that both maintains high accuracy and allows physical interpretation of results without using computationally intensive high-fidelity models. This work shows why a one-model-fits-all approach fails in the case of predicting soot emissions due to clustered co-occurrences of operating conditions that cause non-compliant behavior. We compare the performance of the proposed framework with that of the standalone baseline model and a feed-forward deep neural network. Using WLTP data from a 2.0L naturally aspirated GDI engine, the proposed framework predicts engine-out soot mass with an improvement of 29% in the R² value and 21% in the root mean squared error from the baseline physics-based model, without compromising physical interpretability. These improvements are significant enough to warrant further framework development with additional engine datasets.
more » « less
Full Text Available
Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

https://doi.org/10.1145/3487893

Xie, Yiqun; Shekhar, Shashi; Li, Yan (March 2023, ACM Computing Surveys)

Mapping of spatial hotspots, i.e., regions with significantly higher rates of generating cases of certain events (e.g., disease or crime cases), is an important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, and so on. Clustering techniques required by these domains differ from traditional clustering methods due to the high economic and social costs of spurious results (e.g., false alarms of crime clusters). As a result, statistical rigor is needed explicitly to control the rate of spurious detections. To address this challenge, techniques for statistically-robust clustering (e.g., scan statistics) have been extensively studied by the data mining and statistics communities. In this survey, we present an up-to-date and detailed review of the models and algorithms developed by this field. We first present a general taxonomy for statistically-robust clustering, covering key steps of data and statistical modeling, region enumeration and maximization, and significance testing. We further discuss different paradigms and methods within each of the key steps. Finally, we highlight research gaps and potential future directions, which may serve as a stepping stone in generating new ideas and thoughts in this growing field and beyond.
more » « less
Full Text Available
Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

https://doi.org/10.4230/LIPIcs.GIScience.2023.3

Ghosh, Subhankar; Gupta, Jayant; Sharma, Arun; An, Shuai; Shekhar, Shashi (January 2023, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Beecham, Roger; Long, Jed A; Smith, Dianna; Zhao, Qunshan; Wise, Sarah (Ed.)
Given a set S of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs such that C is a statistically significant regional-colocation pattern in r_{g}. This problem is important for applications in various domains including ecology, economics, and sociology. The problem is computationally challenging due to the exponential number of regional colocation patterns and candidate regions. Previously, we proposed a miner [Subhankar et. al, 2022] that finds statistically significant regional colocation patterns. However, the numerous simultaneous statistical inferences raise the risk of false discoveries (also known as the multiple comparisons problem) and carry a high computational cost. We propose a novel algorithm, namely, multiple comparisons regional colocation miner (MultComp-RCM) which uses a Bonferroni correction. Theoretical analysis, experimental evaluation, and case study results show that the proposed method reduces both the false discovery rate and computational cost.
more » « less
Full Text Available
Eco-PiNN: A Physics-informed Neural Network for Eco-toll Estimation

https://doi.org/10.1137/1.9781611977653.ch94

Li, Yan; Yang, Mingzhou; Eagon, Matthew; Farhadloo, Majid; Xie, Yiqun; Northrop, William; Shekhar, Shashi (January 2023, Proceedings of the SIAM International Conference on Data Mining)

The eco-toll estimation problem quantifies the expected environmental cost (e.g., energy consumption, exhaust emissions) for a vehicle to travel along a path. This problem is important for societal applications such as eco-routing, which aims to find paths with the lowest exhaust emissions or energy need. The challenges of this problem are threefold: (1) the dependence of a vehicle's eco-toll on its physical parameters; (2) the lack of access to data with eco-toll information; and (3) the influence of contextual information (i.e. the connections of adjacent segments in the path) on the eco-toll of road segments. Prior work on eco-toll estimation has mostly relied on pure data-driven approaches and has high estimation errors given the limited training data. To address these limitations, we propose a novel Eco-toll estimation Physics-informed Neural Network framework (Eco-PiNN) using three novel ideas, namely, (1) a physics-informed decoder that integrates the physical laws governing vehicle dynamics into the network, (2) an attention-based contextual information encoder, and (3) a physics-informed regularization to reduce overfitting. Experiments on real-world heavy-duty truck data show that the proposed method can greatly improve the accuracy of eco-toll estimation compared with state-of-the-art methods. *The full version of the paper can be accessed at https://arxiv.org/abs/2301.05739
more » « less
Full Text Available
Data Mining Challenges and Opportunities to Achieve Net Zero Carbon Emissions: Focus on Electrified Vehicles

https://doi.org/10.1137/1.9781611977653.ch108

Yang, Mingzhou; Jayaprakash, Bharat; Eagon, Matthew; Jung, Hyeonjung; Northrop, William; Shekhar, Shashi (January 2023, Proceedings of the SIAM International Conference on Data Mining)

Full Text Available

« Prev Next »

Search for: All records