NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AID: Adaptive Integration of Detectors for Safe AI with Language Models

Wang, Xinran; Diao, Enmao; Le, Qi; Ding, Jie; Anwar, Ali (April 2025, Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics)

Free, publicly-accessible full text available April 29, 2026
AID: Adaptive Integration of Detectors for Safe AI with Language Models

Wang, Xinran; Diao, Enmao; Le, Qi; Ding, Jie; Anwar, Ali (April 2025, Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics)

Free, publicly-accessible full text available April 29, 2026
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Le, Qi; Diao, Enmao; Wang, Ziyan; Wang, Xinran; Ding, Jie; Yang, Li; Anwar, Ali (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 25, 2026
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Le, Qi; Diao, Enmao; Wang, Ziyan; Wang, Xinran; Ding, Jie; Yang, Li; Anwar, Ali (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 25, 2026
MAP: Multi-Human-Value Alignment Palette

Wang, Xinran; Le, Qi; Ahmed, Ammar; Diao, Enmao; Zhou, Yi; Baracaldo, Nathalie; Ding, Jie; Anwar, Ali (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 25, 2026
IP-FL: Incentive-driven Personalization in Federated Learning.

Khan, Ahmad_Faraz; Wang, Xinran; Le, Qi; Abdeen, Zain ul; Khan, Azal Ahmad; Ali, Haider; Jin, Ming; Ding, Jie; Butt, Ali R; Anwar, Ali (June 2025, Proceedings of the 39th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2025))

Free, publicly-accessible full text available June 3, 2026
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

Qi, Le; Diao, Enmao; Wang, Ziyan; Wang, Xinran; Ding, Jie; Yang, Li; Anwar, Ali (February 2025, The Thirteenth International Conference on Learning Representations (ICLR 2025))

Free, publicly-accessible full text available February 21, 2026
ICL: An Incentivized Collaborative Learning Framework

Wang, Xinran; Le, Qi; Khan, Ahmad_Faraz; Ding, Jie; Anwar, Ali (October 2024, IEEE International Conference on Big Data)

Full Text Available
ICL: An Incentivized Collaborative Learning Framework

Wang, Xinran; Le, Qi; Khan, Ahmad_Faraz; Ding, Jie; Anwar, Ali (October 2024, IEEE International Conference on Big Data)

Full Text Available
Balancing Costs and Durability for Serverless Data

Merenstein, Alex; Wang, Xinran; Tarasov, Vasily; Agarwal, Prajjawal; Guthridge, Scott; Thakkar, Kapil; Wu, Katherine; Anwar, Ali; Zadok, Erez (June 2024, IEEE)

Durability features such as replication or erasure coding serve an important role in storage systems, enabling users to store data without fear of loss due to device failures. However, these durability features come with a cost, in terms of storage, network traffic, and computational overheads. For most data, loss is a catastrophic event and so these overheads are acceptable. However, some data tolerates low durability and does not need the high level of durability that most storage systems provide. Identifying the proper level of durability for a piece of data is difficult, especially since it is often not clear how to determine the cost of loss. For some data used in serverless applications, however, this cost is relatively straightforward to calculate: serverless functions are often required to be idempotent, meaning that the data produced by them can be re-created by re-running the function. The cost of losing a piece of data then is merely the cost of re-running the function that originally created the data. In this paper, we explore the tradeoff between the cost of storing data durably and the cost to re-create data. We focus on serverless data because its ability to be recreated makes it possible to assign a cost to its loss. We develop a mathematical model that relates compute costs, storage costs, and application-specific parameters to calculate the cost-optimal placement of data. We also develop an execution framework capable of handling lost data transparently, enabling applications to use lower-durability storage with no additional burden on the developer. Next, we show how different factors such as failure rate and compute costs affect the placement decision. We find that thanks to the relatively short lifetime of serverless data, the probability of data loss even on low-durability storage is fairly low. Finally, we use the model to place data for several applications, including a video-transcoding application and an image-assembly application. We show that our model can predict execution costs within 7% of actual execution costs, and can reduce storage costs by up to 3x while never exceeding baseline costs.
more » « less
Full Text Available

« Prev Next »

Search for: All records