NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Effective subgrouping enhances machine learning prediction in complex materials science phenomena: Inoue's subgrouping in discovering bulk metallic glasses

https://doi.org/10.1016/j.actamat.2023.119590

Liu, Guannan; Sohn, Sungwoo; O'Hern, Corey S; Gilbert, Anna C; Schroers, Jan (February 2024, Acta Materialia)

Full Text Available
May the force be with you

https://doi.org/10.1109/Allerton49937.2022.9929350

Zhang, Yulan; Gilbert, Anna C.; Steinerberger, Stefan (September 2022, 58th Annual Allerton Conference on Communication, Control, and Computing (Allerton))

Full Text Available
Machine learning versus human learning in predicting glass-forming ability of metallic glasses

https://doi.org/10.1016/j.actamat.2022.118497

Liu, Guannan; Sohn, Sungwoo; Kube, Sebastian A.; Raj, Arindam; Mertz, Andrew; Nawano, Aya; Gilbert, Anna; Shattuck, Mark D.; O'Hern, Corey S.; Schroers, Jan (January 2023, Acta Materialia)

Full Text Available
How can classical multidimensional scaling go wrong?

Sonthalia, Rishi; Van Buskirk, Greg; Raichel, Benjamin; Gilbert, Anna (January 2021, Advances in neural information processing systems)

Given a matrix D describing the pairwise dissimilarities of a data set, a common task is to embed the data points into Euclidean space. The classical multidimensional scaling (cMDS) algorithm is a widespread method to do this. However, theoretical analysis of the robustness of the algorithm and an in-depth analysis of its performance on non-Euclidean metrics is lacking. In this paper, we derive a formula, based on the eigenvalues of a matrix obtained from D, for the Frobenius norm of the difference between D and the metric Dcmds returned by cMDS. This error analysis leads us to the conclusion that when the derived matrix has a significant number of negative eigenvalues, then ∥D−Dcmds∥F, after initially decreasing, willeventually increase as we increase the dimension. Hence, counterintuitively, the quality of the embedding degrades as we increase the dimension. We empirically verify that the Frobenius norm increases as we increase the dimension for a variety of non-Euclidean metrics. We also show on several benchmark datasets that this degradation in the embedding results in the classification accuracy of both simple (e.g., 1-nearest neighbor) and complex (e.g., multi-layer neural nets) classifiers decreasing as we increase the embedding dimension.Finally, our analysis leads us to a new efficiently computable algorithm that returns a matrix Dl that is at least as close to the original distances as Dt (the Euclidean metric closest in ℓ2 distance). While Dl is not metric, when given as input to cMDS instead of D, it empirically results in solutions whose distance to D does not increase when we increase the dimension and the classification accuracy degrades less than the cMDS solution.
more » « less
Full Text Available
Sparse Recovery for Orthogonal Polynomial Transforms

https://doi.org/10.4230/LIPIcs.ICALP.2020.58

Gilbert, Anna; Gu, Albert; Re, Christopher; Rudra, Atri; Wootters, Mary (June 2020, Leibniz international proceedings in informatics)
null (Ed.)
In this paper we consider the following sparse recovery problem. We have query access to a vector 𝐱 ∈ ℝ^N such that x̂ = 𝐅 𝐱 is k-sparse (or nearly k-sparse) for some orthogonal transform 𝐅. The goal is to output an approximation (in an 𝓁₂ sense) to x̂ in sublinear time. This problem has been well-studied in the special case that 𝐅 is the Discrete Fourier Transform (DFT), and a long line of work has resulted in sparse Fast Fourier Transforms that run in time O(k ⋅ polylog N). However, for transforms 𝐅 other than the DFT (or closely related transforms like the Discrete Cosine Transform), the question is much less settled. In this paper we give sublinear-time algorithms - running in time poly(k log(N)) - for solving the sparse recovery problem for orthogonal transforms 𝐅 that arise from orthogonal polynomials. More precisely, our algorithm works for any 𝐅 that is an orthogonal polynomial transform derived from Jacobi polynomials. The Jacobi polynomials are a large class of classical orthogonal polynomials (and include Chebyshev and Legendre polynomials as special cases), and show up extensively in applications like numerical analysis and signal processing. One caveat of our work is that we require an assumption on the sparsity structure of the sparse vector, although we note that vectors with random support have this property with high probability. Our approach is to give a very general reduction from the k-sparse sparse recovery problem to the 1-sparse sparse recovery problem that holds for any flat orthogonal polynomial transform; then we solve this one-sparse recovery problem for transforms derived from Jacobi polynomials. Frequently, sparse FFT algorithms are described as implementing such a reduction; however, the technical details of such works are quite specific to the Fourier transform and moreover the actual implementations of these algorithms do not use the 1-sparse algorithm as a black box. In this work we give a reduction that works for a broad class of orthogonal polynomial families, and which uses any 1-sparse recovery algorithm as a black box.
more » « less
Full Text Available
Generalized Metric Repair on Graphs

https://doi.org/10.4230/LIPIcs.SWAT.2020.25

Fan, Chenglin; Gilbert, Anna; Raichel, Benjamin; Sonthalia, Rishi; Van Buskirk, Gregory (January 2020, Scandinavian Symposium and Workshops on Algorithm Theory (SWAT))

Full Text Available

Search for: All records