NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficiently Constructing Sparse Navigable Graphs

Conway, Alex; Dhulipala, Laxman; Farach-Colton, Martin; Johnson, Rob; Landrum, Ben; Musco, Christopher; Shechter, Yarin; Suel, Torsten; Wen, Richard (January 2026, ACM-SIAM Symposium on Discrete Algorithms (SODA).)

Free, publicly-accessible full text available January 11, 2027
AutoDDG: Automated Dataset Description Generation using Large Language Models

Zhang, Haoxiang; Liu, Yurong; Santos, Aecio; Hung, Wei-Lun; Freire, Juliana (December 2025, https://arxiv.org/pdf/2502.01050)

Free, publicly-accessible full text available December 18, 2026
Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search

Al-Jazzazi, Yousef; Diwan, Haya; Gou, Jinrui; Musco, Cameron; Musco, Christopher; Suel, Torsten (December 2025, Neural Information Processing Systems (NeurIPS))

Free, publicly-accessible full text available December 11, 2026
HILTS: Human-LLM collaboration for effective data labeling

https://doi.org/10.1016/j.is.2025.102660

Barbosa, Juliana; Alencar, Eduarda; Fan, Grace; Santos, Aécio; Freire, Juliana (December 2025, Information Systems)

Free, publicly-accessible full text available December 1, 2026
Hierarchical Table Semantics for Exploratory Table Discovery

https://doi.org/10.1145/3736733.3736746

Fan, Grace; Freire, Juliana (July 2025, ACM)

Free, publicly-accessible full text available July 8, 2026
Coupling without Communication and Drafter-Invariant Speculative Decoding

Daliri, Majid; Musco, Christopher; Suresh, Ananda Theertha (June 2025, IEEE International Symposium on Information Theory (ISIT))

Free, publicly-accessible full text available June 22, 2026
A Cost-Effective LLM-based Approach to Identify Wildlife Trafficking in Online Marketplaces

https://doi.org/10.1145/3725256

Barbosa, Juliana Silva; Gondhali, Ulhas; Petrossian, Gohar; Sharma, Kinshuk; Chakraborty, Sunandan; Jacquet, Jennifer; Freire, Juliana (June 2025, Proceedings of the ACM on Management of Data)

Wildlife trafficking remains a critical global issue, significantly impacting biodiversity, ecological stability, and public health. Despite efforts to combat this illicit trade, the rise of e-commerce platforms has made it easier to sell wildlife products, putting new pressure on wild populations of endangered and threatened species. The use of these platforms also opens a new opportunity: as criminals sell wildlife products online, they leave digital traces of their activity that can provide insights into trafficking activities as well as how they can be disrupted. The challenge lies in finding these traces. Online marketplaces publish ads for a plethora of products, and identifying ads for wildlife-related products is like finding a needle in a haystack. Learning classifiers can automate ad identification, but creating them requires costly, time-consuming data labeling that hinders support for diverse ads and research questions. This paper addresses a critical challenge in the data science pipeline for wildlife trafficking analytics: generating quality labeled data for classifiers that select relevant data. While large language models (LLMs) can directly label advertisements, doing so at scale is prohibitively expensive. We propose a cost-effective strategy that leverages LLMs to generate pseudo labels for a small sample of the data and uses these labels to create specialized classification models. Our novel method automatically gathers diverse and representative samples to be labeled while minimizing the labeling costs. Our experimental evaluation shows that our classifiers achieve up to 95% F1 score, outperforming LLMs at a lower cost. We present real use cases that demonstrate the effectiveness of our approach in enabling analyses of different aspects of wildlife trafficking.
more » « less
Free, publicly-accessible full text available June 17, 2026
Matrix Product Sketching via Coordinated Sampling

Daliri, Majid; Freire, Juliana; Li, Danrong; Musco, Christopher (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
Matrix Product Sketching via Coordinated Sampling

Daliri, Majid; Friere, Juliana; Li, Danrong; Musco, Christopher (April 2025, International Conference on Learning Representations (ICLR))

Free, publicly-accessible full text available April 24, 2026
QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

https://doi.org/10.1609/aaai.v39i24.34773

Zandieh, Amir; Daliri, Majid; Han, Insu (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Free, publicly-accessible full text available April 11, 2026

« Prev Next »

Search for: All records