NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Rate-Optimal Online Learning for Dynamic Assortment Selection with Positioning

Luo, Yiyun; Sun, Will Wei; Liu, Yufeng (June 2025, Operations Research)

Free, publicly-accessible full text available June 23, 2026
Jointly Modeling and Clustering Tensors in High Dimensions

https://doi.org/10.1287/opre.2021.0635

Cai, Biao; Zhang, Jingfei; Sun, Will Wei (May 2025, Operations Research)

Tackling High-Dimensional Tensor Clustering In the paper “Jointly Modeling and Clustering Tensors in High Dimensions,” Cai, Zhang, and Sun address the challenge of jointly modeling and clustering tensors by introducing a high-dimensional tensor mixture model with heterogeneous covariances. The proposed mixture model exploits the intrinsic structures of tensor data. The authors develop a computationally efficient high-dimensional expectation conditional maximization (HECM) algorithm and show that the HECM iterates, with an appropriate initialization, converge geometrically to a neighborhood that is within statistical precision of the true parameter. The theoretical analysis is nontrivial because of the dual nonconvexity arising from both the expectation maximization-type estimation and the nonconvex objective function in the M step. They also study the convergence rate of the algorithm when the number of clusters is overspecified and when the signal-to-noise ratio diminishes with sample size. The efficacy of the proposed method is demonstrated through numerical experiments and a real-world medical data application.
more » « less
Free, publicly-accessible full text available May 1, 2026
Contextual Dynamic Pricing with Strategic Buyers

https://doi.org/10.1080/01621459.2024.2370613

Liu, Pangpang; Yang, Zhuoran; Wang, Zhaoran; Sun, Will Wei (April 2025, Journal of the American Statistical Association)

Free, publicly-accessible full text available April 3, 2026
Rate-Optimal Rank Aggregation with Private Pairwise Rankings

https://doi.org/10.1080/01621459.2025.2484843

Xu, Shirong; Sun, Will Wei; Cheng, Guang (April 2025, Journal of the American Statistical Association)

Free, publicly-accessible full text available April 3, 2026
Online Statistical Inference in Decision-Making with Matrix Context

Han, Qiyu; Sun, Will Wei; Zhang, Yichen (March 2025, Annals of Statistics)

Free, publicly-accessible full text available March 31, 2026
Joint and Individual Component Regression

https://doi.org/10.1080/10618600.2023.2284227

Wang, Peiyao; Wang, Haodong; Li, Quefeng; Shen, Dinggang; Liu, Yufeng (July 2024, Journal of Computational and Graphical Statistics)

Full Text Available
Stochastic Low-Rank Tensor Bandits for Multi-Dimensional Online Decision Making

https://doi.org/10.1080/01621459.2024.2311364

Zhou, Jie; Hao, Botao; Wen, Zheng; Zhang, Jingfei; Sun, Will Wei (March 2024, Journal of the American Statistical Association)

Full Text Available
Statistical Significance of Clustering with Multidimensional Scaling

https://doi.org/10.1080/10618600.2023.2219708

Shen, Hui; Bhamidi, Shankar; Liu, Yufeng (January 2024, Journal of Computational and Graphical Statistics)

Clustering is a fundamental tool for exploratory data analysis. One central problem in clustering is deciding if the clusters discovered by clustering methods are reliable as opposed to being artifacts of natural sampling variation. Statistical significance of clustering (SigClust) is a recently developed cluster evaluation tool for high-dimension, low-sample size data. Despite its successful application to many scientific problems, there are cases where the original SigClust may not work well. Furthermore, for specific applications, researchers may not have access to the original data and only have the dissimilarity matrix. In this case, clustering is still a valuable exploratory tool, but the original SigClust is not applicable. To address these issues, we propose a new SigClust method using multidimensional scaling (MDS). The underlying idea behind MDS-based SigClust is that one can achieve low-dimensional representations of the original data via MDS using only the dissimilarity matrix and then apply SigClust on the low-dimensional MDS space. The proposed MDS-based SigClust can circumvent the challenge of parameter estimation of the original method in high-dimensional spaces while keeping the essential clustering structure in the MDS space. Both simulations and real data applications demonstrate that the proposed method works remarkably well for assessing the statistical significance of clustering. Supplementary materials for this article are available online.
more » « less
Full Text Available
Multi-response Regression for Block-missing Multi-modal Data without Imputation

https://doi.org/10.5705/ss.202021.0170

Wang, Haodong; Li, Quefeng; Liu, Yufeng (January 2024, Statistica Sinica)

Multi-modal data are prevalent in many scientific fields. In this study, we consider the parameter estimation and variable selection for a multi-response regression using block-missing multi-modal data. Our method allows the dimensions of both the responses and the predictors to be large, and the responses to be incomplete and correlated, a common practical problem in high-dimensional settings. Our proposed method uses two steps to make a prediction from a multi-response linear regression model with block-missing multi-modal predictors. In the first step, without imputing missing data, we use all available data to estimate the covariance matrix of the predictors and the cross-covariance matrix between the predictors and the responses. In the second step, we use these matrices and a penalized method to simultaneously estimate the precision matrix of the response vector, given the predictors, and the sparse regression parameter matrix. Lastly, we demonstrate the effectiveness of the proposed method using theoretical studies, simulated examples, and an analysis of a multi-modal imaging data set from the Alzheimer’s Disease Neuroimaging Initiative.
more » « less
Full Text Available
Online Regularization toward Always-Valid High-Dimensional Dynamic Pricing

https://doi.org/10.1080/01621459.2023.2284979

Wang, Chi-Hua; Wang, Zhanyu; Sun, Will Wei; Cheng, Guang (December 2023, Journal of the American Statistical Association)

Full Text Available

« Prev Next »

Search for: All records