NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Shao, Yucai; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (July 2023, International Conference on Machine Learning (ICML))

Full Text Available
Graph transfer learning

https://doi.org/10.1007/s10115-022-01782-6

Gritsenko, Andrey; Shayestehfard, Kimia; Guo, Yuan; Moharrer, Armin; Dy, Jennifer; Ioannidis, Stratis (December 2022, Knowledge and Information Systems)

Full Text Available
Pruning Adversarially Robust Neural Networks without Adversarial Examples

https://doi.org/10.1109/ICDM54844.2022.00120

Jian, Tong; Wang, Zifeng; Wang, Yanzhi; Dy, Jennifer; Ioannidis, Stratis (November 2022, International Conference on Data Mining (ICDM))

Full Text Available
SparCL: Sparse Continual Learning on the Edge

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Yuan, Geng; Niu, Wei; Jian, Tong; Ren, Bin; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (December 2022, 2022 Conference on Neural Information Processing Systems)

Existing work in continual learning (CL) focuses on mitigating catastrophic forgetting, i.e., model performance deterioration on past tasks when learning a new task. However, the training efficiency of a CL system is under-investigated, which limits the real-world application of CL systems under resource-limited scenarios. In this work, we propose a novel framework called Sparse Continual Learning(SparCL), which is the first study that leverages sparsity to enable cost-effective continual learning on edge devices. SparCL achieves both training acceleration and accuracy preservation through the synergy of three aspects: weight sparsity, data efficiency, and gradient sparsity. Specifically, we propose task-aware dynamic masking (TDM) to learn a sparse network throughout the entire CL process, dynamic data removal (DDR) to remove less informative training data, and dynamic gradient masking (DGM) to sparsify the gradient updates. Each of them not only improves efficiency, but also further mitigates catastrophic forgetting. SparCL consistently improves the training efficiency of existing state-of-the-art (SOTA) CL methods by at most 23X less training FLOPs, and, surprisingly, further improves the SOTA accuracy by at most 1.7%. SparCL also outperforms competitive baselines obtained from adapting SOTA sparse training methods to the CL setting in both efficiency and accuracy. We also evaluate the effectiveness of SparCL on a real mobile phone, further indicating the practical potential of our method.
more » « less
Full Text Available
SparCL: Sparse Continual Learning on the Edge

Wang, Zifeng; Zhan, Zheng; Gong, Yifan; Yuan, Geng; Niu, Wei; Jian, Tong; Ren, Bin; Ioannidis, Stratis; Wang, Yanzhi; Dy, Jennifer (November 2022, Neural Information Processing Systems (NeurIPS))

Full Text Available
Deep Learning on Multimodal Sensor Data at the Wireless Edge for Vehicular Network

https://doi.org/10.1109/TVT.2022.3170733

Salehi, Batool; Reus-Muns, Guillem; Roy, Debashri; Wang, Zifeng; Jian, Tong; Dy, Jennifer; Ioannidis, Stratis; Chowdhury, Kaushik (July 2022, IEEE Transactions on Vehicular Technology)

Full Text Available
Explainable Deep Learning for Insights in El Nino and River Flows

https://doi.org/10.48550/arXiv.2201.02596

Liu, Yumin; Duffy, Kate; Dy, Jennifer G.; Ganguly, Auroop R. (January 2022, ArXivorg)

The El Nino Southern Oscillation (ENSO) is a semi-periodic fluctuation in sea surface temperature (SST) over the tropical central and eastern Pacific Ocean that influences interannual variability in regional hydrology across the world through long-range dependence or teleconnections. Recent research has demonstrated the value of Deep Learning (DL) methods for improving ENSO prediction as well as Complex Networks (CN) for understanding teleconnections. However, gaps in predictive understanding of ENSO-driven river flows include the black box nature of DL, the use of simple ENSO indices to describe a complex phenomenon and translating DL-based ENSO predictions to river flow predictions. Here we show that eXplainable DL (XDL) methods, based on saliency maps, can extract interpretable predictive information contained in global SST and discover novel SST information regions and dependence structures relevant for river flows which, in tandem with climate network constructions, enable improved predictive understanding. Our results reveal additional information content in global SST beyond ENSO indices, develop new understanding of how SSTs influence river flows, and generate improved river flow predictions with uncertainties. Observations, reanalysis data, and earth system model simulations are used to demonstrate the value of the XDL-CN based methods for future interannual and decadal scale climate projections.
more » « less
Full Text Available
Deep Learning on Visual and Location Data for V2I mmWave Beamforming

https://doi.org/10.1109/MSN53354.2021.00087

Reus-Muns, Guillem; Salehi, Batool; Roy, Debashri; Jian, Tong; Wang, Zifeng; Dy, Jennifer; Ioannidis, Stratis; Chowdhury, Kaushik (December 2021, International Conference on Mobility, Sensing and Networking (MSN))

Full Text Available
A Computational Neural Model for Mapping Degenerate Neural Architectures

https://doi.org/10.1007/s12021-022-09580-9

Khan, Zulqarnain; Wang, Yiyu; Sennesh, Eli; Dy, Jennifer; Ostadabbas, Sarah; van de Meent, Jan-Willem; Hutchinson, J. Benjamin; Satpute, Ajay B. (March 2022, Neuroinformatics)

Abstract Degeneracy in biological systems refers to a many-to-one mapping between physical structures and their functional (including psychological) outcomes. Despite the ubiquity of the phenomenon, traditional analytical tools for modeling degeneracy in neuroscience are extremely limited. In this study, we generated synthetic datasets to describe three situations of degeneracy in fMRI data to demonstrate the limitations of the current univariate approach. We describe a novel computational approach for the analysis referred to as neural topographic factor analysis (NTFA). NTFA is designed to capture variations in neural activity across task conditions and participants. The advantage of this discovery-oriented approach is to reveal whether and how experimental trials and participants cluster into task conditions and participant groups. We applied NTFA on simulated data, revealing the appropriate degeneracy assumption in all three situations and demonstrating NTFA’s utility in uncovering degeneracy. Lastly, we discussed the importance of testing degeneracy in fMRI data and the implications of applying NTFA to do so.
more » « less
Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Wang, Zifeng; Jian, Tong; Masoomi, Aria; Ioannidis, Stratis; Dy, Jennifer (January 2021, Advances in neural information processing systems)

Full Text Available

« Prev Next »

Search for: All records