NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TUCKET: A Tensor Time Series Data Structure for Efficient and Accurate Factor Analysis over Time Ranges

Qiu, Ruizhong; Jang, Jun-Gi; Lin, Xiao; Liu, Lihui Liu; Tong, Hanghang (September 2025, PVLDB 2025)

Free, publicly-accessible full text available September 1, 2026
Joint Optimal Transport and Embedding for Network Alignment

Qi, Yu; Zeng, Zhichen; Yan, Yuchen; Ying, Lei; Srikant, R; Tong, Hanghang (April 2025, WWW 2025)

Free, publicly-accessible full text available April 28, 2026
SPLAT: A Framework for Optimised GPU Code-Generation for SParse reguLar ATtention

https://doi.org/10.1145/3720503

Gupta, Ahan; Yuan, Yueming; Jain, Devansh; Ge, Yuhao; Aponte, David; Zhou, Yanqi; Mendis, Charith (April 2025, Proceedings of the ACM on Programming Languages)

Multi-head-self-attention (MHSA) mechanisms achieve state-of-the-art (SOTA) performance across natural language processing and vision tasks. However, their quadratic dependence on sequence lengths has bottlenecked inference speeds. To circumvent this bottleneck, researchers have proposed various sparse-MHSA models, where a subset of full attention is computed. Despite their promise, current sparse libraries and compilers do not support high-performance implementations fordiversesparse-MHSA patterns due to the underlying sparse formats they operate on. On one end, sparse libraries operate ongeneral sparse formatswhich target extreme amounts of random sparsity (<10% non-zero values) and have high metadata inO(nnzs). On the other end, hand-written kernels operate oncustom sparse formatswhich target specific sparse-MHSA patterns. However, the sparsity patterns in sparse-MHSA are moderately sparse (10-50% non-zero values) and varied, resulting in general sparse formats incurring high metadata overhead and custom sparse formats covering few sparse-MSHA patterns, trading off generality for performance. We bridge this gap, achieving both generality and performance, by proposing a novel sparse format: affine-compressed-sparse-row (ACSR) and supporting code-generation scheme, SPLAT, that generates high-performance implementations for diverse sparse-MHSA patterns on GPUs. Core to our proposed format and code generation algorithm is the observation that common sparse-MHSA patterns have uniquely regular geometric properties. These properties, which can be analyzed just-in-time, expose novel optimizations and tiling strategies that SPLAT exploits to generate high-performance implementations for diverse patterns. To demonstrate SPLAT’s efficacy, we use it to generate code for various sparse-MHSA models, achieving speedups of up-to 2.05x and 4.05x over hand-written kernels written in triton and TVM respectively on A100 GPUs in single-precision.
more » « less
Free, publicly-accessible full text available April 9, 2026
TIPP3 and TIPP3-fast: Improved abundance profiling in metagenomics

https://doi.org/10.1371/journal.pcbi.1012593

Shen, Chengze; Wedell, Eleanor; Pop, Mihai; Warnow, Tandy (April 2025, PLOS Computational Biology)
Zhu, Shanfeng (Ed.)
We present TIPP3 and TIPP3-fast, new tools for abundance profiling in metagenomic datasets. Like its predecessor, TIPP2, the TIPP3 pipeline uses a maximum likelihood approach to place reads into labeled taxonomies using marker genes, but it achieves superior accuracy to TIPP2 by enabling the use of much larger taxonomies through improved algorithmic techniques. We show that TIPP3 is generally more accurate than leading methods for abundance profiling in two important contexts: when reads come from genomes not already in a public database (i.e., novel genomes) and when reads contain sequencing errors. We also show that TIPP3-fast has slightly lower accuracy than TIPP3, but is also generally more accurate than other leading methods and uses a small fraction of TIPP3’s runtime. Additionally, we highlight the potential benefits of restricting abundance profiling methods to those reads that map to marker genes (i.e., using a filtered marker-gene based analysis), which we show typically improves accuracy. TIPP3 is freely available athttps://github.com/c5shen/TIPP3.
more » « less
Free, publicly-accessible full text available April 4, 2026
BSCAMPP: Batch-Scaled Phylogenetic Placement on Large Trees

https://doi.org/10.1109/TCBBIO.2025.3562281

Wedell, Eleanor; Shen, Chengze; Warnow, Tandy (January 2025, IEEE Transactions on Computational Biology and Bioinformatics)

Free, publicly-accessible full text available January 1, 2026
Towards Efficient Temporal Graph Learning: Algorithms, Frameworks, and Tools

https://doi.org/10.1145/3627673.3679104

Wang, Ruijie; Zhao, Wanyu; Sun, Dachun; Mendis, Charith; Abdelzaher, Tarek (October 2024, ACM)

Full Text Available
TGOnline: Enhancing Temporal Graph Learning with Adaptive Online Meta-Learning

https://doi.org/10.1145/3626772.3657791

Wang, Ruijie; Huang, Jingyuan; Zhang, Yutong; Li, Jinyang; Wang, Yufeng; Zhao, Wanyu; Liu, Shengzhong; Mendis, Charith; Abdelzaher, Tarek (July 2024, ACM)

Full Text Available
Adversarial Graph Contrastive Learning

https://doi.org/10.1145/3638054

Feng, Shengyu; Jing, Baoyu; Zhu, Yada; Tong, Hanghang (May 2024, ACM Transactions on Knowledge Discovery from Data)

Contrastive learning is an effective unsupervised method in graph representation learning. The key component of contrastive learning lies in the construction of positive and negative samples. Previous methods usually utilize the proximity of nodes in the graph as the principle. Recently, the data-augmentation-based contrastive learning method has advanced to show great power in the visual domain, and some works have extended this method from images to graphs. However, unlike the data augmentation on images, the data augmentation on graphs is far less intuitive and it is much harder to provide high-quality contrastive samples, which leaves much space for improvement. In this work, by introducing an adversarial graph view for data augmentation, we propose a simple but effective method,Adversarial Graph Contrastive Learning(ArieL), to extract informative contrastive samples within reasonable constraints. We develop a new technique calledinformation regularizationfor stable training and use subgraph sampling for scalability. We generalize our method from node-level contrastive learning to the graph level by treating each graph instance as a super-node.ArieLconsistently outperforms the current graph contrastive learning methods for both node-level and graph-level classification tasks on real-world datasets. We further demonstrate thatArieLis more robust in the face of adversarial attacks.
more » « less
Full Text Available
Large graph property prediction via graph segment training

Cao, Kaidi; Phothilimthana, Phitchaya Mangpo; Abu-El-Haija, Sami; Zelle, Dustin; Zhou, Yanqi; Mendis, Charith; Leskovec, Jure; Perozzi, Bryan (May 2024, International Conference on Neural Information Processing Systems)

Full Text Available
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

Phothilimthana, Phitchaya Mangpo; Abu-El-Haija, Sami; Cao, Kaidi; Fatemi, Bahare; Burrows, Michael; Mendis, Charith; Perozzi, Bryan (May 2024, Proceedings of the 37th International Conference on Neural Information Processing Systems)

Full Text Available

« Prev Next »

Search for: All records