NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Differentially Private Publication of Smart Electricity Grid Data

https://doi.org/10.48786/EDBT.2025.17

Shaham, Sina; Ghinita, Gabriel; Krishnamachari, Bhaskar; Shahabi, Cyrus (March 2025, Proceedings 28th International Conference on Extending Database Technology (EDBT 2025))
Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability

Zeighami, Sepanta; Shahabi, Cyrus (July 2024, Forty-first International Conference on Machine Learning (ICML 2024) -- Oral Presentation)

Full Text Available
A Neural Database for Answering Aggregate Queries on Incomplete Relational Data

https://doi.org/10.1109/TKDE.2023.3310914

Zeighami, Sepanta; Seshadri, Raghav; Shahabi, Cyrus (July 2024, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available
BiasBuster: a Neural Approach for Accurate Estimation of Population Statistics using Biased Location Data

https://doi.org/10.1109/MDM61037.2024.00022

Zeighami, Sepanta; Shahabi, Cyrus (June 2024, 25th IEEE International Conference on Mobile Data Management (MDM 2024))

Full Text Available
Learning Dynamic Graphs from All Contextual Information for Accurate Point-of-Interest Visit Forecasting

https://doi.org/10.1145/3589132.3625567

Hajisafi, Arash; Lin, Haowen; Shaham, Sina; Hu, Haoji; Siampou, Maria Despoina; Chiang, Yao-Yi; Shahabi, Cyrus (November 2023, ACM)

Full Text Available
Generating Realistic and Representative Trajectories with Mobility Behavior Clustering

https://doi.org/10.1145/3589132.3625657

Lin, Haowen; Shaham, Sina; Chiang, Yao-Yi; Shahabi, Cyrus (November 2023, ACM)

Full Text Available
OnDistribution Dependent Sub-Logarithmic Query Time of Learned Indexing

Zeighami, Speanta; Shahabi, Cyrus (July 2023, ICML 2023)

Full Text Available
CSGAN: Modality-Aware Trajectory Generation via Clustering-based Sequence GAN

https://doi.org/10.1109/MDM58254.2023.00032

Zhang, Minxing; Lin, Haowen; Takagi, Shun; Cao, Yang; Shahabi, Cyrus; Xiong, Li (July 2023, IEEE International Conference on Mobile Data Management)

Full Text Available
NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks

https://doi.org/10.1145/3588954

Zeighami, Sepanta; Shahabi, Cyrus; Sharan, Vatsal (May 2023, Proceedings of the ACM on Management of Data)

Range aggregate queries (RAQs) are an integral part of many real-world applications, where, often, fast and approximate answers for the queries are desired. Recent work has studied answering RAQs using machine learning (ML) models, where a model of the data is learned to answer the queries. However, there is no theoretical understanding of why and when the ML based approaches perform well. Furthermore, since the ML approaches model the data, they fail to capitalize on any query specific information to improve performance in practice. In this paper, we focus on modeling "queries" rather than data and train neural networks to learn the query answers. This change of focus allows us to theoretically study our ML approach to provide a distribution and query dependent error bound for neural networks when answering RAQs. We confirm our theoretical results by developing NeuroSketch, a neural network framework to answer RAQs in practice. Extensive experimental study on real-world, TPC-benchmark and synthetic datasets show that NeuroSketch answers RAQs multiple orders of magnitude faster than state-of-the-art and with better accuracy.
more » « less
Full Text Available
A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy

https://doi.org/10.1145/3588701

Ahuja, Ritesh; Zeighami, Sepanta; Ghinita, Gabriel; Shahabi, Cyrus (May 2023, Proceedings of the ACM on Management of Data)

Several "data-for-good" projects [1, 5, 12] initiated by major companies (e.g., Meta, Google) release to the public spatio-temporal datasets to benefit COVID-19 spread modeling [17, 47, 64] and understand human mobility [14, 24]. Most often, spatio-temporal data are provided in the form of snapshot high resolution population density information, where the released statistics capture population counts in small areas for short time periods. Since high resolution is required for utility (e.g., in modeling COVID hotspots) privacy risks are elevated. To prevent malicious actors from using the data to infer sensitive details about individuals, the released datasets must be first sanitized. Typically, [1, 5, 7, 12], differential privacy (DP) is employed as protection model, due to its formal protection guarantees that prevent an adversary to learn whether a particular individual's data has been included in the release or not.
more » « less
Full Text Available

« Prev Next »

Search for: All records