NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications

Siampoi, Maria Despoina; Li, Jialiang; Krumm, John; Shahabi, Cyrus (July 2025, Inernational Conference on Machine Learning)

Encoding geospatial objects is fundamental for geospatial artificial intelligence (GeoAI) applications, which leverage machine learning (ML) models to analyze spatial information. Common approaches transform each object into known formats, like image and text, for compatibility with ML models. However, this process often discards crucial spatial information, such as the object’s position relative to the entire space, reducing downstream task effectiveness. Alternative encoding methods that preserve some spatial properties are often devised for specific data objects (e.g., point encoders), making them unsuitable for tasks that involve different data types (i.e., points, polylines, and polygons). To this end, we propose POLY2VEC, a polymorphic Fourier-based encoding approach that unifies the representation of geospatial objects, while preserving the essential spatial properties. POLY2VEC incorporates a learned fusion module that adaptively integrates the magnitude and phase of the Fourier transform for different tasks and geometries. We evaluate POLY2VEC on five diverse tasks, organized into two categories. The first empirically demonstrates that POLY2VEC consistently outperforms objectspecific baselines in preserving three key spatial relationships: topology, direction, and distance. The second shows that integrating POLY2VEC into a state-of-the-art GeoAI workflow improves the performance in two popular tasks: population prediction and land use inference.
more » « less
Free, publicly-accessible full text available July 13, 2026
Differentially Private Publication of Smart Electricity Grid Data

https://doi.org/10.48786/EDBT.2025.17

Shaham, Sina; Ghinita, Gabriel; Krishnamachari, Bhaskar; Shahabi, Cyrus (March 2025, Proceedings 28th International Conference on Extending Database Technology (EDBT 2025))
Legally-Compliant Spatial Fairness Framework: Advancing Beyond Spatial Fairness

https://doi.org/10.48786/edbt.2025.68

Saxena, Nripsuta Ani; Mathur, Ronit; Shahabi, Cyrus (January 2025, OpenProceedings.org)
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model

Hsu, Shang-Ling; Tung, Emmanuel; Krumm, John; Shahabi, Cyrus; Shafique, Khurram (October 2024, ACM Digital Library)

Human mobility modeling from GPS-trajectories and synthetic trajectory generation are crucial for various applications, such as urban planning, disaster management and epidemiology. Both of these tasks often require filling gaps in a partially specified sequence of visits, – a new problem that we call “controlled” synthetic trajectory generation. Existing methods for next-location prediction or synthetic trajectory generation cannot solve this problem as they lack the mechanisms needed to constrain the generated sequences of visits. Moreover, existing approaches (1) frequently treat space and time as independent factors, an assumption that fails to hold true in real-world scenarios, and (2) suffer from challenges in accuracy of temporal prediction as they fail to deal with mixed distributions and the inter-relationships of different modes with latent variables (e.g., day-of-the-week). These limitations become even more pronounced when the task involves filling gaps within sequences instead of solely predicting the next visit. We introduce TrajGPT, a transformer-based, multi-task, joint spatiotemporal generative model to address these issues. Taking inspiration from large language models, TrajGPT poses the problem of controlled trajectory generation as that of text infilling in natural language. TrajGPT integrates the spatial and temporal models in a transformer architecture through a Bayesian probability model that ensures that the gaps in a visit sequence are filled in a spatiotemporally consistent manner. Our experiments on public and private datasets demonstrate that TrajGPT not only excels in controlled synthetic visit generation but also outperforms competing models in next-location prediction tasks–Relatively, TrajGPT achieves a 26-fold improvement in temporal accuracy while retaining more than 98% of spatial accuracy on average.
more » « less
Full Text Available
Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability

Zeighami, Sepanta; Shahabi, Cyrus (July 2024, Forty-first International Conference on Machine Learning (ICML 2024) -- Oral Presentation)

Full Text Available
BiasBuster: a Neural Approach for Accurate Estimation of Population Statistics using Biased Location Data

https://doi.org/10.1109/MDM61037.2024.00022

Zeighami, Sepanta; Shahabi, Cyrus (June 2024, 25th IEEE International Conference on Mobile Data Management (MDM 2024))

Full Text Available
A Neural Database for Answering Aggregate Queries on Incomplete Relational Data

https://doi.org/10.1109/TKDE.2023.3310914

Zeighami, Sepanta; Seshadri, Raghav; Shahabi, Cyrus (July 2024, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available
Fair Spatial Indexing: A paradigm for Group Spatial Fairness

Shaham, Sina; Ghinita, Gabriel; Shahabi, Cyrus (March 2024, EDBT: Extending Database Technology)

Machine learning (ML) is playing an increasing role in decision-making tasks that directly affect individuals, e.g., loan approvals, or job applicant screening. Significant concerns arise that, without special provisions, individuals from under-privileged backgrounds may not get equitable access to services and opportunities. Existing research studies {\em fairness} with respect to protected attributes such as gender, race or income, but the impact of location data on fairness has been largely overlooked. With the widespread adoption of mobile apps, geospatial attributes are increasingly used in ML, and their potential to introduce unfair bias is significant, given their high correlation with protected attributes. We propose techniques to mitigate location bias in machine learning. Specifically, we consider the issue of miscalibration when dealing with geospatial attributes. We focus on {\em spatial group fairness} and we propose a spatial indexing algorithm that accounts for fairness. Our KD-tree inspired approach significantly improves fairness while maintaining high learning accuracy, as shown by extensive experimental results on real data.
more » « less
Full Text Available
Unified Modeling and Clustering of Mobility Trajectories with Spatiotemporal Point Processes

Lin, Haowen; Chiang, Yao-Yi; Xiong, Li; Shahabi, Cyrus (April 2024, SIAM)

Full Text Available
Fair Spatial Indexing: A paradigm for Group Spatial Fairness

https://doi.org/10.48786/edbt.2024.14

Shaham, Sina; Ghinita, Gabriel; Shahabi, Cyrus (January 2024, OpenProceedings.org)

{}
more » « less

« Prev Next »

Search for: All records