Search for: All records

Creators/Authors contains: "Gurbuzbalaban, Mert"

« Prev Next »

Total Resources

23

Resource Type
Conference Paper

13

Conference Proceeding

0

Dataset

0

Journal Article

10

Workshop Report

0

Availability
Full Text / Resource Available

23

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Algorithmic stability of heavy-tailed stochastic gradient descent on least squares

Raj, Anant ; Barsbey, Melih ; Gurbuzbalaban, Mert ; Zhu, Lingjiong ; Simsekli, Umut ( January 2023 , International Conference on Algorithmic Learning Theory)

Full Text Available
Global convergence of stochastic gradient Hamiltonian Monte Carlo for non-convex stochastic optimization: Non-asymptotic performance bounds and momentum-based acceleration

Gao, Xuefeng ; Gurbuzbalaban, Mert ; Zhu, Lingjiong ( January 2022 , Operations research)

Full Text Available
Boundary Conditions for Linear Exit Time Gradient Trajectories Around Saddle Points: Analysis and Algorithm

https://doi.org/10.1109/TIT.2022.3213607

Dixit, Rishabh ; Gurbuzbalaban, Mert ; Bajwa, Waheed U. ( January 2022 , IEEE Transactions on Information Theory)

Full Text Available
Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks

Fallah, Alireza ; Gurbuzbalaban, Mert ; Ozdaglar, Asuman ; Simsekli, Umut ; Zhu Lingjiong ( January 2022 , Journal of machine learning research)
Jain, Prateek (Ed.)
Full Text Available
Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks

Fallah, Alireza ; Gurbuzbalaban, Mert ; Ozdaglar, Asuman ; Simsekli, Umut ; Zhu Lingjiong ( January 2022 , Journal of machine learning research)

Full Text Available
Robust distributed accelerated stochastic gradient methods for multi-agent networks

Fallah, Alireza ; Gurbuzbalaban, Mert ; Ozdaglar, Asuman ; Simsekli, Umut ; Zhu, Lingjiong ( January 2022 , Journal of machine learning research)

Full Text Available
L-DQN: An Asynchronous Limited-Memory Distributed Quasi-Newton Method

https://doi.org/10.1109/CDC45484.2021.9682985

Can, Bugra ; Soori, Saeed ; Dehnavi, Maryam Mehri ; Gurbuzbalaban, Mert ( December 2021 , 2021 60th IEEE Conference on Decision and Control (CDC))

Full Text Available
HyLo: a hybrid low-rank natural gradient descent method

Mu, Baorun ; Soori, Saeed ; Can, Bugra ; Gurbuzbalaban, Mert ; Dehnavi, Maryam Mehri ( January 2022 , Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis)

This work presents a Hybrid Low-Rank Natural Gradient Descent method, called HyLo, that accelerates the training time of deep neural networks. Natural gradient descent (NGD) requires computing the inverse of the Fisher information matrix (FIM), which is typically expensive at large-scale. Kronecker factorization methods such as KFAC attempt to improve NGD's running time by approximating the FIM with Kronecker factors. However, the size of Kronecker factors increases quadratically as the model size grows. Instead, in HyLo, we use the Sherman-Morrison-Woodbury variant of NGD (SNGD) and propose a reformulation of SNGD to resolve its scalability issues. HyLo uses a computationally-efficient low-rank factorization to achieve superior timing for Fisher inverses. We evaluate HyLo on large models including ResNet-50, U-Net, and ResNet-32 on up to 64 GPUs. HyLo converges 1.4×-2.1× faster than the state-of-the-art distributed implementation of KFAC and reduces the computation and communication time up to 350× and 10.7× on ResNet-50.
more » « less
Full Text Available
Randomized Gossiping with Effective Resistance Weights: Performance Guarantees and Applications

https://doi.org/10.1109/TCNS.2022.3161201

Can, Bugra ; Gurbuzbalaban, Mert ; Aybat, Necdet Serhat ; Soori, Saeed ; Mehri Dehnavi, Maryam ( January 2022 , IEEE Transactions on Control of Network Systems)

Full Text Available
The Heavy-Tail Phenomenon in SGD

Gurbuzbalaban, Mert ; Simsekli, Umut ; Zhu, Lingjiong ( January 2021 , Proceedings of Machine Learning Research)

Full Text Available

« Prev Next »