NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Estimating Individualized Treatment Rules for Ordinal Treatments

https://doi.org/10.1111/biom.12865

Chen, Jingxiang; Fu, Haoda; He, Xuanyao; Kosorok, Michael R.; Liu, Yufeng (March 2018, Biometrics)

Summary Precision medicine is an emerging scientific topic for disease treatment and prevention that takes into account individual patient characteristics. It is an important direction for clinical research, and many statistical methods have been proposed recently. One of the primary goals of precision medicine is to obtain an optimal individual treatment rule (ITR), which can help make decisions on treatment selection according to each patient's specific characteristics. Recently, outcome weighted learning (OWL) has been proposed to estimate such an optimal ITR in a binary treatment setting by maximizing the expected clinical outcome. However, for ordinal treatment settings, such as individualized dose finding, it is unclear how to use OWL. In this article, we propose a new technique for estimating ITR with ordinal treatments. In particular, we propose a data duplication technique with a piecewise convex loss function. We establish Fisher consistency for the resulting estimated ITR under certain conditions, and obtain the convergence and risk bound properties. Simulated examples and an application to a dataset from a type 2 diabetes mellitus observational study demonstrate the highly competitive performance of the proposed method compared to existing alternatives.
more » « less
Statistical Significance for Hierarchical Clustering

https://doi.org/10.1111/biom.12647

Kimes, Patrick K.; Liu, Yufeng; Hayes, David Neil; Marron, James Stephen (January 2017, Biometrics)

Summary Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high-dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this article, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets.
more » « less
Ensemble estimation and variable selection with semiparametric regression models

https://doi.org/10.1093/biomet/asaa012

Shin, Sunyoung; Liu, Yufeng; Cole, Stephen R; Fine, Jason P (April 2020, Biometrika)

Summary We consider scenarios in which the likelihood function for a semiparametric regression model factors into separate components, with an efficient estimator of the regression parameter available for each component. An optimal weighted combination of the component estimators, named an ensemble estimator, may be employed as an overall estimate of the regression parameter, and may be fully efficient under uncorrelatedness conditions. This approach is useful when the full likelihood function may be difficult to maximize, but the components are easy to maximize. It covers settings where the nuisance parameter may be estimated at different rates in the component likelihoods. As a motivating example we consider proportional hazards regression with prospective doubly censored data, in which the likelihood factors into a current status data likelihood and a left-truncated right-censored data likelihood. Variable selection is important in such regression modelling, but the applicability of existing techniques is unclear in the ensemble approach. We propose ensemble variable selection using the least squares approximation technique on the unpenalized ensemble estimator, followed by ensemble re-estimation under the selected model. The resulting estimator has the oracle property such that the set of nonzero parameters is successfully recovered and the semiparametric efficiency bound is achieved for this parameter set. Simulations show that the proposed method performs well relative to alternative approaches. Analysis of an AIDS cohort study illustrates the practical utility of the method.
more » « less
Full Text Available
Robust multicategory support matrix machines

https://doi.org/10.1007/s10107-019-01386-z

Qian, Chengde; Tran-Dinh, Quoc; Fu, Sheng; Zou, Changliang; Liu, Yufeng (July 2019, Mathematical Programming)

Full Text Available
Graph-based sparse linear discriminant analysis for high-dimensional classification

https://doi.org/10.1016/j.jmva.2018.12.007

Liu, Jianyu; Yu, Guan; Liu, Yufeng (May 2019, Journal of Multivariate Analysis)

Full Text Available
Convex Bidirectional Large Margin Classifiers

https://doi.org/10.1080/00401706.2018.1497544

Qi, Zhengling; Liu, Yufeng (April 2019, Technometrics)

Full Text Available
Assessing robustness of classification using an angular breakdown point

https://doi.org/10.1214/17-AOS1661

Zhao, Junlong; Yu, Guan; Liu, Yufeng (December 2018, The Annals of Statistics)

Full Text Available
Adaptively weighted large-margin angle-based classifiers

https://doi.org/10.1016/j.jmva.2018.03.004

Fu, Sheng; Zhang, Sanguo; Liu, Yufeng (July 2018, Journal of Multivariate Analysis)

Full Text Available
Efficient test-based variable selection for high-dimensional linear models

https://doi.org/10.1016/j.jmva.2018.01.003

Gong, Siliang; Zhang, Kai; Liu, Yufeng (July 2018, Journal of Multivariate Analysis)

Full Text Available
Robust multicategory support vector machines using difference convex algorithm

https://doi.org/10.1007/s10107-017-1209-5

Zhang, Chong; Pham, Minh; Fu, Sheng; Liu, Yufeng (May 2018, Mathematical Programming)

Full Text Available

« Prev Next »

Search for: All records