Search for: All records

Creators/Authors contains: "Wang, Yutong"

« Prev Next »

Total Resources

13

Resource Type
Conference Paper

10

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

13

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

VC DIMENSION OF PARTIALLY QUANTIZED NEURAL NETWORKS IN THE OVERPARAMETRIZED REGIME

Wang, Yutong ; Scott, Clayton ( January 2022 , International Conference on Learning Representations 2022)

Vapnik-Chervonenkis (VC) theory has so far been unable to explain the small generalization error of overparametrized neural networks. Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight. In this work, we focus on a subclass of partially quantized networks that we refer to as hyperplane arrangement neural networks (HANNs). Using a sample compression analysis, we show that HANNs can have VC dimension significantly smaller than the number of weights, while being highly expressive. In particular, empirical risk minimization over HANNs in the overparametrized regime achieves the minimax rate for classification with Lipschitz posterior class probability. We further demonstrate the expressivity of HANNs empirically. On a panel of 121 UCI datasets, overparametrized HANNs match the performance of state-of-the-art full-precision models.
more » « less
Full Text Available
Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel

Wang, Yutong ; Scott, Clayton ( January 2022 , Neural Information Processing Systems 2022)

Recent research in the theory of overparametrized learning has sought to establish generalization guarantees in the interpolating regime. Such results have been established for a few common classes of methods, but so far not for ensemble methods. We devise an ensemble classification method that simultaneously interpolates the training data, and is consistent for a broad class of data distributions. To this end, we define the manifold-Hilbert kernel for data distributed on a Riemannian manifold. We prove that kernel smoothing regression and classification using the manifold-Hilbert kernel are weakly consistent in the setting of Devroye et al. [19]. For the sphere, we show that the manifold-Hilbert kernel can be realized as a weighted random partition kernel, which arises as an infinite ensemble of partition-basedclassifiers.
more » « less
Full Text Available
VC dimension of partially quantized neural networks in the overparametrized regime.

Wang, Yutong ; Scott, Clayton ( January 2022 , ICLR 2022)

Vapnik-Chervonenkis (VC) theory has so far been unable to explain the small generalization error of overparametrized neural networks. Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight. In this work, we focus on a subclass of partially quantized networks that we refer to as hyperplane arrangement neural networks (HANNs). Using a sample compression analysis, we show that HANNs can have VC dimension significantly smaller than the number of weights, while being highly expressive. In particular, empirical risk minimization over HANNs in the overparametrized regime achieves the minimax rate for classification with Lipschitz posterior class probability. We further demonstrate the expressivity of HANNs empirically. On a panel of 121 UCI datasets, overparametrized HANNs match the performance of state-of-the-art full-precision models.
more » « less
Full Text Available
VC DIMENSION OF PARTIALLY QUANTIZED NEURAL NETWORKS IN THE OVERPARAMETRIZED REGIME

Wang, Yutong ; Scott, Clayton ( January 2022 , International Conference on Learning Representations)

Vapnik-Chervonenkis (VC) theory has so far been unable to explain the small generalization error of overparametrized neural networks. Indeed, existing applications of VC theory to large networks obtain upper bounds on VC dimension that are proportional to the number of weights, and for a large class of networks, these upper bound are known to be tight. In this work, we focus on a subclass of partially quantized networks that we refer to as hyperplane arrangement neural networks (HANNs). Using a sample compression analysis, we show that HANNs can have VC dimension significantly smaller than the number of weights, while being highly expressive. In particular, empirical risk minimization over HANNs in the overparametrized regime achieves the minimax rate for classification with Lipschitz posterior class probability. We further demonstrate the expressivity of HANNs empirically. On a panel of 121 UCI datasets, overparametrized HANNs match the performance of state-of-the-art full precision models.
more » « less
Full Text Available
Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel

Wang, Yutong ; Scott, Clayton D. ( January 2022 , Neural Information Processing Systems 2022)

Recent research in the theory of overparametrized learning has sought to establish generalization guarantees in the interpolating regime. Such results have been established for a few common classes of methods, but so far not for ensemble methods. We devise an ensemble classification method that simultaneously interpolates the training data, and is consistent for a broad class of data distributions. To this end, we define the manifold-Hilbert kernel for data distributed on a Riemannian manifold. We prove that kernel smoothing regression and classification using the manifold-Hilbert kernel are weakly consistent in the setting of Devroye et al. [22]. For the sphere, we show that the manifold-Hilbert kernel can be realized as a weighted random partition kernel, which arises as an infinite ensemble of partition-based classifiers.
more » « less
Full Text Available
Learning from Label Proportions by Learning with Label Noise

Zhang, Jianxin ; Wang, Yutong ; Scott, Clayton ( January 2022 , 36th Conference on Neural Information Processing Systems (NeurIPS 2022))

Learning from label proportions (LLP) is a weakly supervised classification problem where data points are grouped into bags, and the label proportions within each bag are observed instead of the instance-level labels. The task is to learn a classifier to predict the labels of future individual instances. Prior work on LLP for multi-class data has yet to develop a theoretically grounded algorithm. In this work, we propose an approach to LLP based on a reduction to learning with label noise, using the forward correction (FC) loss of Patrini et al. [30]. We establish an excess risk bound and generalization error analysis for our approach, while also extending the theory of the FC loss which may be of independent interest. Our approach demonstrates improved empirical performance in deep learning scenarios across multiple datasets and architectures, compared to the leading methods.
more » « less
Full Text Available
Learning from Label Proportions by Learning with Label Noise

Zhang, Jianxin ; Wang, Yutong ; Scott, Clayton ( January 2022 , Conference on Neural Information Processing Systems (NeurIPS 2022))

Learning from label proportions (LLP) is a weakly supervised classification problem where data points are grouped into bags, and the label proportions within each bag are observed instead of the instance-level labels. The task is to learn a classifier to predict the labels of future individual instances. Prior work on LLP for multi-class data has yet to develop a theoretically grounded algorithm. In this work, we propose an approach to LLP based on a reduction to learning with label noise, using the forward correction (FC) loss of Patrini et al. [30]. We establish an excess risk bound and generalization error analysis for our approach, while also extending the theory of the FC loss which may be of independent interest. Our approach demonstrates improved empirical performance in deep learning scenarios across multiple datasets and architectures, compared to the leading methods.
more » « less
Full Text Available
An Exact Solver for the Weston-Watkins SVM Subproblem

Wang, Yutong ; Scott, Clayton ( January 2021 , International Conference on Machine Learning, PMLR)
null (Ed.)
Recent empirical evidence suggests that the Weston-Watkins support vector machine is among the best performing multiclass extensions of the binary SVM. Current state-of-the-art solvers repeatedly solve a particular subproblem approximately using an iterative strategy. In this work, we propose an algorithm that solves the subproblem exactly using a novel reparametrization of the Weston-Watkins dual problem. For linear WW-SVMs, our solver shows significant speed-up over the state-of-the-art solver when the number of classes is large. Our exact subproblem solver also allows us to prove linear convergence of the overall solver.
more » « less
Full Text Available
An Exact Solver for the Weston-Watkins SVM Subproblem

Wang, Yutong ; Scott, Clayton ( January 2021 , Proceedings of the 38th International Conference on Machine Learning)

Recent empirical evidence suggests that the Weston-Watkins support vector machine is among the best performing multiclass extensions of the binary SVM. Current state-of-the-art solvers repeatedly solve a particular subproblem approximately using an iterative strategy. In this work, we propose an algorithm that solves the subproblem exactly using a novel reparametrization of the Weston-Watkins dual problem. For linear WW-SVMs, our solver shows significant speed-up over the state-of-the-art solver when the number of classes is large. Our exact subproblem solver also allows us to prove linear convergence of the overall solver.
more » « less
Full Text Available
Electric-field control of skyrmions in multiferroic heterostructure via magnetoelectric coupling

https://doi.org/10.1038/s41467-020-20528-y

Ba, You ; Zhuang, Shihao ; Zhang, Yike ; Wang, Yutong ; Gao, Yang ; Zhou, Hengan ; Chen, Mingfeng ; Sun, Weideng ; Liu, Quan ; Chai, Guozhi ; et al ( December 2021 , Nature Communications)
null (Ed.)
Abstract Room-temperature skyrmions in magnetic multilayers are considered to be promising candidates for the next-generation spintronic devices. Several approaches have been developed to control skyrmions, but they either cause significant heat dissipation or require ultrahigh electric fields near the breakdown threshold. Here, we demonstrate electric-field control of skyrmions through strain-mediated magnetoelectric coupling in ferromagnetic/ferroelectric multiferroic heterostructures. We show the process of non-volatile creation of multiple skyrmions, reversible deformation and annihilation of a single skyrmion by performing magnetic force microscopy with in situ electric fields. Strain-induced changes in perpendicular magnetic anisotropy and interfacial Dzyaloshinskii–Moriya interaction strength are characterized experimentally. These experimental results, together with micromagnetic simulations, demonstrate that strain-mediated magnetoelectric coupling (via strain-induced changes in both the perpendicular magnetic anisotropy and interfacial Dzyaloshinskii–Moriya interaction is responsible for the observed electric-field control of skyrmions. Our work provides a platform to investigate electric-field control of skyrmions in multiferroic heterostructures and paves the way towards more energy-efficient skyrmion-based spintronics.
more » « less
Full Text Available

« Prev Next »