NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stability of SGD: Tightness Analysis and Improved Bounds

Zhang, Yikai; Zhang, Wenjia; Bald, Sammy; Pingali, Vamsi P.; Chen, Chao; Goswami, Mayank (August 2022, Uncertainty in artificial intelligence)

Stochastic Gradient Descent (SGD) based methods have been widely used for training large-scale machine learning models that also generalize well in practice. Several explanations have been offered for this generalization performance, a prominent one being algorithmic stability Hardt et al [2016]. However, there are no known examples of smooth loss functions for which the analysis can be shown to be tight. Furthermore, apart from properties of the loss function, data distribution has also been shown to be an important factor in generalization performance. This raises the question: is the stability analysis of Hardt et al [2016] tight for smooth functions, and if not, for what kind of loss functions and data distributions can the stability analysis be improved? In this paper we first settle open questions regarding tightness of bounds in the data-independent setting: we show that for general datasets, the existing analysis for convex and strongly-convex loss functions is tight, but it can be improved for non-convex loss functions. Next, we give novel and improved data-dependent bounds: we show stability upper bounds for a large class of convex regularized loss functions, with negligible regularization parameters, and improve existing data-dependent bounds in the non-convex setting. We hope that our results will initiate further efforts to better understand the data-dependent setting under non-convex loss functions, leading to an improved understanding of the generalization abilities of deep networks.
more » « less
Full Text Available
GPU Computation of the Euler Characteristic Curve for Imaging Data

Wang, Fan; Wagner, Hubert; Chen, Chao (June 2022, Proceedings of the annual ACM Symposium on Computational Geometry)

Persistent homology is perhaps the most popular and useful tool offered by topological data analysis, with point-cloud data being the most common setup. Its older cousin, the Euler characteristic curve (ECC) is less expressive, but far easier to compute. It is particularly suitable for analyzing imaging data, and is commonly used in fields ranging from astrophysics to biomedical image analysis. These fields are embracing GPU computations to handle increasingly large datasets. We therefore propose an optimized GPU implementation of ECC computation for 2D and 3D grayscale images. The goal of this paper is twofold. First, we offer a practical tool, illustrating its performance with thorough experimentation, but also explain its inherent shortcomings. Second, this simple algorithm serves as a perfect backdrop for highlighting basic GPU programming techniques that make our implementation so efficient, and some common pitfalls we avoided. This is intended as a step towards a wider usage of GPU programming in computational geometry and topology software. We find this is particularly important as geometric and topological tools are used in conjunction with modern, GPU-accelerated machine learning frameworks.
more » « less
Full Text Available
Predicting COVID-19 Lung Infiltrate Progression on Chest Radiographs Using Spatio-temporal LSTM based Encoder-Decoder Network

Konwer, Aishik; Bae, Joseph; Singh, Gagandeep; Gattu, Rishabh; Ali, Syed; Green, Jeremy; Phatak, Tej; Gupta, Amit; Chen, Chao; Saltz, Joel; et al (July 2021, Medical Imaging with Deep Learning (MIDL))

Full Text Available
TopoTxR: A Topological Biomarker for Predicting Treatment Response in Breast Cancer

https://doi.org/10.1007/978-3-030-78191-0_30

Wang, Fan; Kapse, Saarthak; Liu, Steven; Prasanna, Prateek; Chen, Chao (June 2021, International conference on Information Processing in Medical Imaging (IPMI))

Full Text Available
3D Topology-Preserving Segmentation with Compound Multi-Slice Representation

https://doi.org/10.1109/ISBI48211.2021.9433941

Yang, Jiaqi; Hu, Xiaoling; Chen, Chao; Tsai, Chialing (April 2021, IEEE International Symposium on Biomedical Imaging (ISBI))

Full Text Available
A Topological Filter for Learning with Label Noise

Wu, Pengxiang; Zheng, Songzhu; Goswami, Mayank; Metaxas, Dimitris; Chen, Chao (December 2020, The Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data

https://doi.org/10.1109/CVPR42600.2020.01387

Chang, Qi; Qu, Hui; Zhang, Yikai; Sabuncu, Mert; Chen, Chao; Zhang, Tong; Metaxas, Dimitris N. (June 2020, IEEE Conference on Computer Vision and Pattern Recognition (CVPR))
null (Ed.)
Full Text Available
Persistence Enhanced Graph Neural Network

Zhao, Qi; Ye, Ze; Wang, Yusu; Chen, Chao (April 2020, Proceedings of Machine Learning Research)
null (Ed.)
Full Text Available
Curvature Graph Network

Ye, Ze; Liu, Kin Sum; Ma, Tengfei; Gao, Jie; Chen, Chao (April 2020, 8th International Conference on Learning Representations (ICLR))
null (Ed.)
Full Text Available
Learn Distributed GAN with Temporary Discriminators

Qu, Hui; Zhang, Yikai; Chang, Qi; Yan, Zhennan; Chen, Chao; Metaxas, Dimitris N (January 2020, European Conference on Computer Vision (ECCV))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records