NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Soft prompt recovers compressed LLMs, transferably

Xu, Zhaozhuo; Liu, Ziru; Chen, Beidi; Zhong, Shaochen; Tang, Yuxin; Wang, Jue; Zhou, Kaixiong; Hu, Xia; Shrivastava, Anshumali (July 2024, JMLR.org)

Full Text Available
Federated Learning Over Images: Vertical Decompositions and Pre-Trained Backbones Are Difficult to Beat

https://doi.org/10.1109/ICCV51070.2023.01776

Hu, Erdong; Tang, Yuxin; Kyrillidis, Anastasios; Jermaine, Chris (October 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV))

We carefully evaluate a number of algorithms for learning in a federated environment, and test their utility for a variety of image classification tasks. We consider many issues that have not been adequately considered before: whether learning over data sets that do not have diverse sets of images affects the results; whether to use a pre-trained feature extraction "backbone"; how to evaluate learner performance (we argue that classification accuracy is not enough), among others. Overall, across a wide variety of settings, we find that vertically decomposing a neural network seems to give the best results, and outperforms more standard reconciliation-used methods.
more » « less
Full Text Available
Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

Tang, Yuxin; Ding, Zhimin; Jankov, Dimitrije; Yuan, Binhang; Bourgeois, Daniel; Jermaine, Chris (July 2023, Proceedings of Machine Learning Research (ICML))

The relational data model was designed to facilitate large-scale data management and analytics. We consider the problem of how to differentiate computations expressed relationally. We show experimentally that a relational engine running an auto-differentiated relational algorithm can easily scale to very large datasets, and is competitive with state-of-the-art, special-purpose systems for large-scale distributed machine learning.
more » « less
Full Text Available
Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

Tang, Yuxin; Ding, Zhimin; Jankov, Dimitrije; Yuan, Binhang; Bourgeois, Daniel; Jermaine, Chris (July 2023, Proceedings of Machine Learning Research)

Full Text Available
Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

Tang, Yuxin; Ding, Zhimin; Jankov, Dimitrije; Yuan, Binhang; Bourgeois, Daniel; Jermaine, Chris (July 2023, International Conference on Machine Learning)
Distributed learning of fully connected neural networks using independent subnet training

https://doi.org/10.14778/3529337.3529343

Yuan, Binhang; Wolfe, Cameron R.; Dun, Chen; Tang, Yuxin; Kyrillidis, Anastasios; Jermaine, Chris (April 2022, Proceedings of the VLDB Endowment)

Full Text Available
Tensor relational algebra for distributed machine learning system design

https://doi.org/10.14778/3457390.3457399

Yuan, Binhang; Jankov, Dimitrije; Zou, Jia; Tang, Yuxin; Bourgeois, Daniel; Jermaine, Chris (April 2021, Proceedings of the VLDB Endowment)

We consider the question: what is the abstraction that should be implemented by the computational engine of a machine learning system? Current machine learning systems typically push whole tensors through a series of compute kernels such as matrix multiplications or activation functions, where each kernel runs on an AI accelerator (ASIC) such as a GPU. This implementation abstraction provides little built-in support for ML systems to scale past a single machine, or for handling large models with matrices or tensors that do not easily fit into the RAM of an ASIC. In this paper, we present an alternative implementation abstraction called the tensor relational algebra (TRA). The TRA is a set-based algebra based on the relational algebra. Expressions in the TRA operate over binary tensor relations, where keys are multi-dimensional arrays and values are tensors. The TRA is easily executed with high efficiency in a parallel or distributed environment, and amenable to automatic optimization. Our empirical study shows that the optimized TRA-based back-end can significantly outperform alternatives for running ML workflows in distributed clusters.
more » « less
Full Text Available
Programmable In-Network Security for Context-aware BYOD Policies

Kang, Qiao; Xue, Lei; Morrison, Adam; Tang, Yuxin; Chen, Ang; Luo, Xiapu (January 2020, USENIX Security)
null (Ed.)
Full Text Available
Exploring Simulation of Software-Defined Underwater Wireless Networks

https://doi.org/10.1145/3148675.3148720

Wei, Li; Tang, Yuxin; Cao, Yuching; Wang, Zhaohui; Gerla, Mario (November 2017, The ACM International Workshop on Underwater Networks (WUWNet))

Full Text Available

Search for: All records