Search for: All records

Creators/Authors contains: "Data, Deepesh"

« Prev Next »

Total Resources

23

Resource Type
Conference Paper

18

Conference Proceeding

0

Dataset

0

Journal Article

5

Workshop Report

0

Availability
Full Text / Resource Available

23

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Optimization

https://doi.org/10.1109/TAC.2022.3145576

Singh, Navjot ; Data, Deepesh ; George, Jemin ; Diggavi, Suhas ( February 2023 , IEEE Transactions on Automatic Control)

Full Text Available
Distributed User-Level Private Mean Estimation

https://doi.org/10.1109/ISIT50566.2022.9834713

Girgis, Antonious M. ; Data, Deepesh ; Diggavi, Suhas ( January 2022 , IEEE International Symposium on Information Theory (ISIT))

Traditionally, an item-level differential privacy framework has been studied for applications in distributed learning. However, when a client has multiple data samples, and might want to also hide its potential participation, a more appropriate notion is that of user-level privacy [1]. In this paper, we develop a distributed private optimization framework that studies the trade-off between user-level local differential privacy guarantees and performance. This is enabled by a novel distributed user- level private mean estimation algorithm using distributed private heavy-hitter estimation. We use this result to develop the privacy- performance trade-off for distributed optimization.
more » « less
Full Text Available
Renyi differential privacy of the subsampled shuffle model in distributed learning

https://doi.org/https://doi.org/10.48550/arXiv.2107.08763

Girgis, Antonious ; Data, Deepesh ; Diggavi, Suhas ( December 2021 , Advances in Neural Information Processing Systems 34 (NeurIPS 2021))

Full Text Available
Decentralized Learning Robust to Data Poisoning Attacks

https://doi.org/10.1109/CDC51059.2022.9992702

Mao, Yanwen ; Data, Deepesh ; Diggavi, Suhas ; Tabuada, Paulo ( January 2022 , IEEE Control and Decision Conference (CDC))

This paper addresses the problem of decentralized learning in the presence of data poisoning attacks. In this problem, we consider a collection of nodes connected through a network, each equipped with a local function. The objective is to compute the global minimizer of the aggregated local functions, in a decentralized manner, i.e., each node can only use its local function and data exchanged with nodes it is connected to. Moreover, each node is to agree on the said minimizer despite an adversary that can arbitrarily change the local functions of a fraction of the nodes. This problem setting has applications in robust learning, where nodes in a network are collectively training a model that minimizes the empirical loss with possibly attacked local data sets. In this paper, we propose a novel decentralized learning algorithm that enables all nodes to reach consensus on the optimal model, in the absence of attacks, and approximate consensus in the presence of data poisoning attacks.
more » « less
Full Text Available
Byzantine-Resilient SGD in High Dimensions on Heterogeneous Data

https://doi.org/10.1109/ISIT45174.2021.9518248

Data, Deepesh ; Diggavi, Suhas ( July 2021 , IEEE International Symposium on Information Theory (ISIT))

We study distributed stochastic gradient descent (SGD) in the master-worker architecture under Byzantine attacks. We consider the heterogeneous data model, where different workers may have different local datasets, and we do not make any probabilistic assumptions on data generation. At the core of our algorithm, we use the polynomial-time outlier-filtering procedure for robust mean estimation proposed by Steinhardt et al. (ITCS 2018) to filter-out corrupt gradients. In order to be able to apply their filtering procedure in our heterogeneous data setting where workers compute stochastic gradients, we derive a new matrix concentration result, which may be of independent interest. We provide convergence analyses for smooth strongly-convex and non-convex objectives and show that our convergence rates match that of vanilla SGD in the Byzantine-free setting. In order to bound the heterogeneity, we assume that the gradients at different workers have bounded deviation from each other, and we also provide concrete bounds on this deviation in the statistical heterogeneous data model.
more » « less
Full Text Available
On the Rényi Differential Privacy of the Shuffle Model

https://doi.org/10.1145/3460120.3484794

Girgis, Antonious M. ; Data, Deepesh ; Diggavi, Suhas ; Suresh, Ananda Theertha ; Kairouz, Peter ( November 2021 , ACM Symposium on Computer and Communication Security (CCS))

The central question studied in this paper is Rényi Differential Privacy (RDP) guarantees for general discrete local randomizers in the shuffle privacy model. In the shuffle model, each of the 𝑛 clients randomizes its response using a local differentially private (LDP) mechanism and the untrusted server only receives a random permutation (shuffle) of the client responses without association to each client. The principal result in this paper is the first direct RDP bounds for general discrete local randomization in the shuffle pri- vacy model, and we develop new analysis techniques for deriving our results which could be of independent interest. In applications, such an RDP guarantee is most useful when we use it for composing several private interactions. We numerically demonstrate that, for important regimes, with composition our bound yields an improve- ment in privacy guarantee by a factor of 8× over the state-of-the-art approximate Differential Privacy (DP) guarantee (with standard composition) for shuffle models. Moreover, combining with Pois- son subsampling, our result leads to at least 10× improvement over subsampled approximate DP with standard composition.
more » « less
Full Text Available
Byzantine-Resilient High-Dimensional SGD with Local Iterations on Heterogeneous Data

Data, Deepesh ; Diggavi, Suhas N. ( July 2021 , International Conference on Machine Learning (ICML))

We study stochastic gradient descent (SGD) with local iterations in the presence of malicious/Byzantine clients, motivated by the federated learning. The clients, instead of communicating with the central server in every iteration, maintain their local models, which they update by taking several SGD iterations based on their own datasets and then communicate the net update with the server, thereby achieving communication-efficiency. Furthermore, only a subset of clients communicate with the server, and this subset may be different at different synchronization times. The Byzantine clients may collaborate and send arbitrary vectors to the server to disrupt the learning process. To combat the adversary, we employ an efficient high-dimensional robust mean estimation algorithm from Steinhardt et al.~i̧te[ITCS 2018]Resilience_SCV18 at the server to filter-out corrupt vectors; and to analyze the outlier-filtering procedure, we develop a novel matrix concentration result that may be of independent interest. We provide convergence analyses for strongly-convex and non-convex smooth objectives in the heterogeneous data setting, where different clients may have different local datasets, and we do not make any probabilistic assumptions on data generation. We believe that ours is the first Byzantine-resilient algorithm and analysis with local iterations. We derive our convergence results under minimal assumptions of bounded variance for SGD and bounded gradient dissimilarity (which captures heterogeneity among local datasets). We also extend our results to the case when clients compute full-batch gradients.
more » « less
Full Text Available
Byzantine-Resilient SGD in High Dimensions on Heterogeneous Data

Data, Deepesh ; Diggavi, Suhas N. ( July 2021 , IEEE International Symposium on Information Theory (ISIT))

We study distributed stochastic gradient descent (SGD) in the master-worker architecture under Byzantine at- tacks. We consider the heterogeneous data model, where different workers may have different local datasets, and we do not make any probabilistic assumptions on data generation. At the core of our algorithm, we use the polynomial-time outlier-filtering procedure for robust mean estimation proposed by Steinhardt et al. (ITCS 2018) to filter-out corrupt gradients. In order to be able to apply their filtering procedure in our heterogeneous data setting where workers compute stochastic gradients, we derive a new matrix concentration result, which may be of independent interest. We provide convergence analyses for smooth strongly- convex and non-convex objectives and show that our convergence rates match that of vanilla SGD in the Byzantine-free setting. In order to bound the heterogeneity, we assume that the gradients at different workers have bounded deviation from each other, and we also provide concrete bounds on this deviation in the statistical heterogeneous data model.
more » « less
Full Text Available
Differentially Private Federated Learning with Shuffling and Client Self-Sampling

https://doi.org/10.1109/ISIT45174.2021.9517906

Girgis, Antonious M. ; Data, Deepesh ; Diggavi, Suhas ( July 2021 , IEEE International Symposium on Information Theory (ISIT))

Full Text Available
Differentially Private Federated Learning with Shuffling and Client Self-Sampling

Girgis, Antonious M ; Data, Deepesh ; and Diggavi, Suhas. ( July 2021 , IEEE International Symposium on Information Theory (ISIT))

This paper studies a distributed optimization problem in the federated learning (FL) framework under differential privacy constraints, whereby a set of clients having local samples are connected to an untrusted server, who wants to learn a global model while preserving the privacy of clients’ local datasets. We propose a new client sampling called self-sampling that reflects the random availability of clients in the learning process in FL. We analyze the differential privacy of the SGD with client self-sampling by composing amplification by sub-sampling along with amplification by shuffling. Furthermore, we analyze the convergence of the proposed SGD algorithm showing that we can get a reasonable learning performance while preserving the privacy of clients’ data even with client self-sampling.
more » « less
Full Text Available

« Prev Next »