NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Statistical Minimax Lower Bounds for Transfer Learning in Linear Binary Classification

https://doi.org/10.1109/ISIT50566.2022.9834760

Mousavi Kalan, Seyed Mohammadreza; Soltanolkotabi, Mahdi; Avestimehr, A. Salman (June 2022, IEEE)

Modern machine learning models require a large amount of labeled data for training to perform well. A recently emerging paradigm for reducing the reliance of large model training on massive labeled data is to take advantage of abundantly available labeled data from a related source task to boost the performance of the model in a desired target task where there may not be a lot of data available. This approach, which is called transfer learning, has been applied successfully in many application domains. However, despite the fact that many transfer learning algorithms have been developed, the fundamental understanding of "when" and "to what extent" transfer learning can reduce sample complexity is still limited. In this work, we take a step towards foundational understanding of transfer learning by focusing on binary classification with linear models and Gaussian features and develop statistical minimax lower bounds in terms of the number of source and target samples and an appropriate notion of similarity between source and target tasks. To derive this bound, we reduce the transfer learning problem to hypothesis testing via constructing a packing set of source and target parameters by exploiting Gilbert-Varshamov bound, which in turn leads to a lower bound on sample complexity. We also evaluate our theoretical results by experiments on real data sets.
more » « less
Full Text Available
ApproxIFER: A Model-Agnostic Approach to Resilient and Robust Prediction Serving Systems

https://doi.org/10.1609/aaai.v36i8.20809

Soleymani, Mahdi; Ali, Ramy E.; Mahdavifar, Hessam; Avestimehr, A. Salman (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Due to the surge of cloud-assisted AI services, the problem of designing resilient prediction serving systems that can effectively cope with stragglers and minimize response delays has attracted much interest. The common approach for tackling this problem is replication which assigns the same prediction task to multiple workers. This approach, however, is inefficient and incurs significant resource overheads. Hence, a learning-based approach known as parity model (ParM) has been recently proposed which learns models that can generate ``parities’’ for a group of predictions to reconstruct the predictions of the slow/failed workers. While this learning-based approach is more resource-efficient than replication, it is tailored to the specific model hosted by the cloud and is particularly suitable for a small number of queries (typically less than four) and tolerating very few stragglers (mostly one). Moreover, ParM does not handle Byzantine adversarial workers. We propose a different approach, named Approximate Coded Inference (ApproxIFER), that does not require training any parity models, hence it is agnostic to the model hosted by the cloud and can be readily applied to different data domains and model architectures. Compared with earlier works, ApproxIFER can handle a general number of stragglers and scales significantly better with the number of queries. Furthermore, ApproxIFER is robust against Byzantine workers. Our extensive experiments on a large number of datasets and model architectures show significant degraded mode accuracy improvement by up to 58% over ParM.
more » « less
Full Text Available
Analog Secret Sharing With Applications to Private Distributed Learning

https://doi.org/10.1109/TIFS.2022.3173417

Soleymani, Mahdi; Mahdavifar, Hessam; Avestimehr, A. Salman (January 2022, IEEE Transactions on Information Forensics and Security)

Full Text Available
Analog Privacy-Preserving Coded Computing

https://doi.org/10.1109/ISIT45174.2021.9517715

Soleymani, Mahdi; Mahdavifar, Hessam; Avestimehr, A. Salman (July 2021, 2021 IEEE International Symposium on Information Theory (ISIT))

Full Text Available
List-Decodable Coded Computing: Breaking the Adversarial Toleration Barrier

https://doi.org/10.1109/JSAIT.2021.3102956

Soleymani, Mahdi; Ali, Ramy E.; Mahdavifar, Hessam; Avestimehr, A. Salman (September 2021, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
Edge Computing in the Dark: Leveraging Contextual-Combinatorial Bandit and Coded Computing

https://doi.org/10.1109/TNET.2021.3058685

Yang, Chien-Sheng; Pedarsani, Ramtin; Avestimehr, A. Salman (June 2021, IEEE/ACM Transactions on Networking)
null (Ed.)
Full Text Available
Coded Computing for Secure Boolean Computations

https://doi.org/10.1109/JSAIT.2021.3055341

Yang, Chien-Sheng; Avestimehr, A. Salman (March 2021, IEEE Journal on Selected Areas in Information Theory)
null (Ed.)
Full Text Available
CodedPrivateML: A Fast and Privacy-Preserving Framework for Distributed Machine Learning

https://doi.org/10.1109/JSAIT.2021.3053220

So, Jinhyun; Guler, Basak; Avestimehr, A. Salman (March 2021, IEEE Journal on Selected Areas in Information Theory)
null (Ed.)
Full Text Available
Analog Lagrange Coded Computing

https://doi.org/10.1109/JSAIT.2021.3056377

Soleymani, Mahdi; Mahdavifar, Hessam; Avestimehr, A. Salman (March 2021, IEEE Journal on Selected Areas in Information Theory)
null (Ed.)
Full Text Available
Turbo-Aggregate: Breaking the Quadratic Aggregation Barrier in Secure Federated Learning

https://doi.org/10.1109/JSAIT.2021.3054610

So, Jinhyun; Guler, Basak; Avestimehr, A. Salman (March 2021, IEEE Journal on Selected Areas in Information Theory)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records