NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hermes: Boosting the Performance of Machine-Learning-Based Intrusion Detection System through Geometric Feature Learning

https://doi.org/10.1145/3641512.3686380

Zhang, Chaoyu; Shi, Shanghao; Wang, Ning; Xu, Xiangxiang; Li, Shaoyu; Zheng, Lizhong; Marchany, Randy; Gardner, Mark; Hou, Y Thomas; Lou, Wenjing (October 2024, ACM)

Full Text Available
Operator SVD with Neural Networks via Nested Low-Rank Approximation

Ryu, Jongha; Xu, Xiangxiang; Erol, H. S.; Bu, Yuheng; Zheng, Lizhong; Wornell, Gregory W (December 2023, Proc. NeurIPS 2023 Workshop on Machine Learning and the Physical Sciences (ML4PS))

This paper proposes an optimization-based method to learn the singular value decomposition (SVD) of a compact operator with ordered singular functions. The proposed objective function is based on Schmidt’s low-rank approximation theorem (1907) that characterizes a truncated SVD as a solution minimizing the mean squared error, accompanied with a technique called nesting to learn the ordered structure. When the optimization space is parameterized by neural networks, we refer to the proposed method as NeuralSVD. The implementation does not require sophisticated optimization tricks unlike existing approaches.
more » « less
Full Text Available
Kernel Subspace and Feature Extraction

https://doi.org/10.1109/ISIT54713.2023.10206532

Xu, Xiangxiang; Zheng, Lizhong (June 2023, ISIT 2023)

Full Text Available
An Information Theoretic Interpretation to Deep Neural Networks

https://doi.org/10.3390/e24010135

Xu, Xiangxiang; Huang, Shao-Lun; Zheng, Lizhong; Wornell, Gregory W. (January 2022, Entropy)

With the unprecedented performance achieved by deep learning, it is commonly believed that deep neural networks (DNNs) attempt to extract informative features for learning tasks. To formalize this intuition, we apply the local information geometric analysis and establish an information-theoretic framework for feature selection, which demonstrates the information-theoretic optimality of DNN features. Moreover, we conduct a quantitative analysis to characterize the impact of network structure on the feature extraction process of DNNs. Our investigation naturally leads to a performance metric for evaluating the effectiveness of extracted features, called the H-score, which illustrates the connection between the practical training process of DNNs and the information-theoretic framework. Finally, we validate our theoretical results by experimental designs on synthesized data and the ImageNet dataset.
more » « less
Full Text Available
An Information-Theoretic Approach to Unsupervised Feature Selection for High-Dimensional Data

https://doi.org/10.1109/JSAIT.2020.2981538

Huang, Shao-Lun; Xu, Xiangxiang; Zheng, Lizhong (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
A Local Characterization for Wyner Common Information

https://doi.org/10.1109/ISIT44484.2020.9174206

Huang, Shao-Lun; Xu, Xiangxiang; Zheng, Lizhong; Wornell, Gregory W. (June 2020, 2020 IEEE International Symposium on Information Theory (ISIT))
null (Ed.)
Full Text Available
An Efficient Approach to Informative Feature Extraction from Multimodal Data

https://doi.org/10.1609/aaai.v33i01.33015281

Wang, Lichen; Wu, Jiaxiang; Huang, Shao-Lun; Zheng, Lizhong; Xu, Xiangxiang; Zhang, Lin; Huang, Junzhou (July 2019, Proceedings of the AAAI Conference on Artificial Intelligence)

One primary focus in multimodal feature extraction is to find the representations of individual modalities that are maximally correlated. As a well-known measure of dependence, the Hirschfeld-Gebelein-Rényi (HGR) maximal correlation be-´ comes an appealing objective because of its operational meaning and desirable properties. However, the strict whitening constraints formalized in the HGR maximal correlation limit its application. To address this problem, this paper proposes Soft-HGR, a novel framework to extract informative features from multiple data modalities. Specifically, our framework prevents the “hard” whitening constraints, while simultaneously preserving the same feature geometry as in the HGR maximal correlation. The objective of Soft-HGR is straightforward, only involving two inner products, which guarantees the efficiency and stability in optimization. We further generalize the framework to handle more than two modalities and missing modalities. When labels are partially available, we enhance the discriminative power of the feature representations by making a semi-supervised adaptation. Empirical evaluation implies that our approach learns more informative feature mappings and is more efficient to optimize.
more » « less
Full Text Available

Search for: All records