NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

NysAct: A Scalable Preconditioned Gradient Descent using Nyström Approximation

https://doi.org/10.1109/BigData62323.2024.10825352

Seung, Hyunseok; Lee, Jaewoo; Ko, Hyunsuk (December 2024, IEEE)

Adaptive gradient methods are computationally efficient and converge quickly, but they often suffer from poor generalization. In contrast, second-order methods enhance convergence and generalization but typically incur high computational and memory costs. In this work, we introduce NYSACT, a scalable first-order gradient preconditioning method that strikes a balance between state-of-the-art first-order and second-order optimization methods. NYSACT leverages an eigenvalue-shifted Nyström method to approximate the activation covariance matrix, which is used as a preconditioning matrix, significantly reducing time and memory complexities with minimal impact on test accuracy. Our experiments show that NYSACT not only achieves improved test accuracy compared to both first-order and second-order methods but also demands considerably less computational resources than existing second-order methods.
more » « less
Full Text Available
MBAG: A Scalable Mini-Block Adaptive Gradient Method for Deep Neural Networks

https://doi.org/10.1109/BigData55660.2022.10020262

Lee, Jaewoo (December 2022, IEEE)

Full Text Available
Wasserstein Adversarial Transformer for Cloud Workload Prediction

https://doi.org/10.1609/aaai.v36i11.21509

Arbat, Shivani; Jayakumar, Vinodh Kumaran; Lee, Jaewoo; Wang, Wei; Kim, In Kee (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Predictive VM (Virtual Machine) auto-scaling is a promising technique to optimize cloud applications’ operating costs and performance. Understanding the job arrival rate is crucial for accurately predicting future changes in cloud workloads and proactively provisioning and de-provisioning VMs for hosting the applications. However, developing a model that accurately predicts cloud workload changes is extremely challenging due to the dynamic nature of cloud workloads. Long- Short-Term-Memory (LSTM) models have been developed for cloud workload prediction. Unfortunately, the state-of-the-art LSTM model leverages recurrences to predict, which naturally adds complexity and increases the inference overhead as input sequences grow longer. To develop a cloud workload prediction model with high accuracy and low inference overhead, this work presents a novel time-series forecasting model called WGAN-gp Transformer, inspired by the Transformer network and improved Wasserstein-GANs. The proposed method adopts a Transformer network as a generator and a multi-layer perceptron as a critic. The extensive evaluations with real-world workload traces show WGAN- gp Transformer achieves 5× faster inference time with up to 5.1% higher prediction accuracy against the state-of-the-art. We also apply WGAN-gp Transformer to auto-scaling mechanisms on Google cloud platforms, and the WGAN-gp Transformer-based auto-scaling mechanism outperforms the LSTM-based mechanism by significantly reducing VM over-provisioning and under-provisioning rates.
more » « less
Full Text Available
Differentially Private Normalizing Flows for Synthetic Tabular Data Generation

https://doi.org/10.1609/aaai.v36i7.20697

Lee, Jaewoo; Kim, Minjung; Jeong, Yonghyun; Ro, Youngmin (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Normalizing flows have shown to be a promising approach to deep generative modeling due to their ability to exactly evaluate density --- other alternatives either implicitly model the density or use approximate surrogate density. In this work, we present a differentially private normalizing flow model for heterogeneous tabular data. Normalizing flows are in general not amenable to differentially private training because they require complex neural networks with larger depth (compared to other generative models) and use specialized architectures for which per-example gradient computation is difficult (or unknown). To reduce the parameter complexity, the proposed model introduces a conditional spline flow which simulates transformations at different stages depending on additional input and is shared among sub-flows. For privacy, we introduce two fine-grained gradient clipping strategies that provide a better signal-to-noise ratio and derive fast gradient clipping methods for layers with custom parameterization. Our empirical evaluations show that the proposed model preserves statistical properties of original dataset better than other baselines.
more » « less
Full Text Available
Performance Testing for Cloud Computing with Dependent Data Bootstrapping

https://doi.org/10.1109/ASE51524.2021.9678687

He, Sen; Liu, Tianyi; Lama, Palden; Lee, Jaewoo; Kim, In Kee; Wang, Wei (November 2021, IEEE/ACM International Conference on Automated Software Engineering, 2021)

Full Text Available
Differentially Private Goodness-of-Fit Tests for Continuous Variables

https://doi.org/10.1016/j.ecosta.2021.09.007

Kwak, Seung Woo; Ahn, Jeongyoun; Lee, Jaewoo; Park, Cheolwoo (October 2021, Econometrics and Statistics)

Full Text Available
Scaling up Differentially Private Deep Learning with Fast Per-Example Gradient Clipping

https://doi.org/10.2478/popets-2021-0008

Lee, Jaewoo; Kifer, Daniel (January 2021, Proceedings on Privacy Enhancing Technologies)
null (Ed.)
Abstract Recent work on Renyi Differential Privacy has shown the feasibility of applying differential privacy to deep learning tasks. Despite their promise, however, differentially private deep networks often lag far behind their non-private counterparts in accuracy, showing the need for more research in model architectures, optimizers, etc. One of the barriers to this expanded research is the training time — often orders of magnitude larger than training non-private networks. The reason for this slowdown is a crucial privacy-related step called “per-example gradient clipping” whose naive implementation undoes the benefits of batch training with GPUs. By analyzing the back-propagation equations we derive new methods for per-example gradient clipping that are compatible with auto-differeniation (e.g., in Py-Torch and TensorFlow) and provide better GPU utilization. Our implementation in PyTorch showed significant training speed-ups (by factors of 54x - 94x for training various models with batch sizes of 128). These techniques work for a variety of architectural choices including convolutional layers, recurrent networks, attention, residual blocks, etc.
more » « less
Full Text Available
Stochastic Adaptive Line Search for Differentially Private Optimization

Chen, Chen; Lee, Jaewoo (December 2020, 2020 IEEE International Conference on Big Data (Big Data))
null (Ed.)
The performance of private gradient-based optimization algorithms is highly dependent on the choice of step size (or learning rate) which often requires non-trivial amount of tuning. In this paper, we introduce a stochastic variant of classic backtracking line search algorithm that satisfies Renyi differential privacy. Specifically, the proposed algorithm adaptively chooses the step size satisfying the the Armijo condition (with high probability) using noisy gradients and function estimates. Furthermore, to improve the probability with which the chosen step size satisfies the condition, it adjusts per-iteration privacy budget during runtime according to the reliability of noisy gradient. A naive implementation of the backtracking search algorithm may end up using unacceptably large privacy budget as the ability of adaptive step size selection comes at the cost of extra function evaluations. The proposed algorithm avoids this problem by using the sparse vector technique combined with the recent privacy amplification lemma. We also introduce a privacy budget adaptation strategy in which the algorithm adaptively increases the budget when it detects that directions pointed by consecutive gradients are drastically different. Extensive experiments on both convex and non-convex problems show that the adaptively chosen step sizes allow the proposed algorithm to efficiently use the privacy budget and show competitive performance against existing private optimizers.
more » « less
Full Text Available
Towards a Practical Differentially Private Collaborative Phone Blacklisting System

https://doi.org/10.1145/3427228.3427239

Ucci, Daniele; Perdisci, Roberto; Lee, Jaewoo; Ahamad, Mustaque (December 2020, ACSAC 2020: Annual Computer Security Applications Conference)
null (Ed.)
Spam phone calls have been rapidly growing from nuisance to an increasingly effective scam delivery tool. To counter this increasingly successful attack vector, a number of commercial smartphone apps that promise to block spam phone calls have appeared on app stores, and are now used by hundreds of thousands or even millions of users. However, following a business model similar to some online social network services, these apps often collect call records or other potentially sensitive information from users’ phones with little or no formal privacy guarantees. In this paper, we study whether it is possible to build a practical collaborative phone blacklisting system that makes use of local differential privacy (LDP) mechanisms to provide clear privacy guarantees. We analyze the challenges and trade-offs related to using LDP, evaluate our LDP-based system on real-world user-reported call records collected by the FTC, and show that it is possible to learn a phone blacklist using a reasonable overall privacy budget and at the same time preserve users’ privacy while maintaining utility for the learned blacklist.
more » « less
Full Text Available
Rényi Differentially Private ADMM for Non-Smooth Regularized Optimization

https://doi.org/10.1145/3374664.3375733

Chen, Chen; Lee, Jaewoo (March 2020, CODASPY '20: Proceedings of the Tenth ACM Conference on Data and Application Security and Privacy)
null (Ed.)
In this paper we consider the problem of minimizing composite objective functions consisting of a convex differentiable loss function plus a non-smooth regularization term, such as $$L_1$$ norm or nuclear norm, under Rényi differential privacy (RDP). To solve the problem, we propose two stochastic alternating direction method of multipliers (ADMM) algorithms: ssADMM based on gradient perturbation and mpADMM based on output perturbation. Both algorithms decompose the original problem into sub-problems that have closed-form solutions. The first algorithm, ssADMM, applies the recent privacy amplification result for RDP to reduce the amount of noise to add. The second algorithm, mpADMM, numerically computes the sensitivity of ADMM variable updates and releases the updated parameter vector at the end of each epoch. We compare the performance of our algorithms with several baseline algorithms on both real and simulated datasets. Experimental results show that, in high privacy regimes (small ε), ssADMM and mpADMM outperform baseline algorithms in terms of classification and feature selection performance, respectively.
more » « less
Full Text Available

Search for: All records