NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MemFreezing: A Novel Adversarial Attack on Temporal Graph Neural Networks under Limited Future Knowledge

Dai, Yue; Liu, Liang; Tang, Xulong; Zhang, Youtao; Yang, Jun (July 2025, In Proceedings of the 42nd International Conference on Machine Learning)

Free, publicly-accessible full text available July 19, 2026
OASIS: Object-Aware Page Management for Multi-GPU Systems

Wang, Yueqi; Li, Bingyao; Ziad, Mohamed_Tarek_Ibn; Eeckhout, Lieven; Yang, Jun; Jaleel, Aamer; Tang, Xulong (March 2025, IEEE)

Free, publicly-accessible full text available March 5, 2026
PARQO: Penalty-Aware Robust Plan Selection in Query Optimization

Xiu, Haibo; Agarwal, Pankaj K; Yang, Jun (December 2024, Proceedings of the VLDB Endowment)

Free, publicly-accessible full text available December 1, 2025
Computing A Well-Representative Summary of Conjunctive Query Results

https://doi.org/10.1145/3695835

Agarwal, Pankaj K; Esmailpour, Aryan; Hu, Xiao; Sintos, Stavros; Yang, Jun (November 2024, Proceedings of the ACM on Management of Data)

Data summarization is a powerful approach to deal with large-scale data analytics, which has wide applications in web search, recommendation systems, approximate query processing, etc. It computes a small, compact summary that preserves vital properties of the original data. In this paper, we study the data summarization problem of conjunctive query results, i.e., computing a k-size subset of a conjunctive query output, for any given k>0, that optimizes a certain objective. More specifically, we are interested in two commonly studied objectives: cohesion, which measures the maximum distance between a tuple in the query result tuples and its closest tuple in the summary (k-center clustering); and diversity, which measures the pairwise distances between the summary items. A simple approach that computes the entire query output and then applies existing algorithms on top of these materialized tuples suffers from high computational complexity because the query output can be large, e.g., for a relational database of N tuples, the number of result tuples can be N^O(1).We propose O(1)-approximation algorithms that compute well-representative summaries of size k in time O(N*k^O(1)), or even O(N+ k^O(1)) in some cases, without computing all result tuples. We also propose the first efficient (2+\eps)-approximation algorithm for the k-center clustering problem over relational data. Our main idea is to formulate a few oracles that enable us to access specific query result tuples with certain properties, to show how these oracles can be implemented efficiently, and to compute desired summaries with few invocations of these oracles.
more » « less
Free, publicly-accessible full text available November 4, 2025
Benchmarking Automated Program Repair: An Extensive Study on Both Real-World and Artificial Bugs

https://doi.org/10.1145/3650212.3652140

Ouyang, Yicheng; Yang, Jun; Zhang, Lingming (September 2024, ACM)

Full Text Available
STAR: Sub-Entry Sharing-Aware TLB for Multi-Instance GPU

https://doi.org/10.1109/MICRO61859.2024.00031

Li, Bingyao; Wang, Yueqi; Wang, Tianyu; Eeckhout, Lieven; Yang, Jun; Jaleel, Aamer; Tang, Xulong (November 2024, IEEE)

Free, publicly-accessible full text available November 2, 2025
GPU Memory Exploitation for Fun and Profit

Guo, Yanan; Zhang, Zhenkai; Yang, Jun (August 2024, USENIX Association)

Full Text Available
CrypQ: A Database Benchmark Based on Dynamic, Ever-Evolving Ethereum Data

Capol, Vincent; Liu, Yuxi; Xiu, Haibo; Yang, Jun (August 2024, Proceedings of the 2024 TPC Technology Conference on Performance Evaluation and Benchmarking)

Full Text Available
Single-Trajectory Distributionally Robust Reinforcement Learning

Liang, Zhipeng; Ma, Xiaoteng; Blanchet, Jose; Yang, Jun; Zhang, Jiheng; Zhou, Zhengyuan (August 2024, Proceedings of the 41st International Conference on Machine Learning)
Salakhutdinov, Ruslan; Kolter, Zico; Heller, Katherine; Weller, Adrian; Oliver, Nuria; Scarlett, Jonathan; Berkenkamp, Felix (Ed.)
To mitigate the limitation that the classical reinforcement learning (RL) framework heavily relies on identical training and test environments, Distributionally Robust RL (DRRL) has been proposed to enhance performance across a range of environments, possibly including unknown test environments. As a price for robustness gain, DRRL involves optimizing over a set of distributions, which is inherently more challenging than optimizing over a fixed distribution in the non-robust case. Existing DRRL algorithms are either model-based or fail to learn from a single sample trajectory. In this paper, we design a first fully model-free DRRL algorithm, called distributionally robust Q-learning with single trajectory (DRQ). We delicately design a multi-timescale framework to fully utilize each incrementally arriving sample and directly learn the optimal distributionally robust policy without modeling the environment, thus the algorithm can be trained along a single trajectory in a model-free fashion. Despite the algorithm’s complexity, we provide asymptotic convergence guarantees by generalizing classical stochastic approximation tools. Comprehensive experimental results demonstrate the superior robustness and sample complexity of our proposed algorithm, compared to non-robust methods and other robust RL algorithms.
more » « less
Full Text Available
On Reporting Durable Patterns in Temporal Proximity Graphs

https://doi.org/10.1145/3651144

Agarwal, Pankaj K; Hu, Xiao; Sintos, Stavros; Yang, Jun (May 2024, Proceedings of the ACM on Management of Data)

Finding patterns in graphs is a fundamental problem in databases and data mining. In many applications, graphs are temporal and evolve over time, so we are interested in finding durable patterns, such as triangles and paths, which persist over a long time. While there has been work on finding durable simple patterns, existing algorithms do not have provable guarantees and run in strictly super-linear time. The paper leverages the observation that many graphs arising in practice are naturally proximity graphs or can be approximated as such, where nodes are embedded as points in some high-dimensional space, and two nodes are connected by an edge if they are close to each other. We work with an implicit representation of the proximity graph, where nodes are additionally annotated by time intervals, and design near-linear-time algorithms for finding (approximately) durable patterns above a given durability threshold. We also consider an interactive setting where a client experiments with different durability thresholds in a sequence of queries; we show how to compute incremental changes to result patterns efficiently in time near-linear to the size of the changes.
more » « less
Full Text Available

« Prev Next »

Search for: All records