NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

modSaRa: a computationally efficient R package for CNV identification

https://doi.org/10.1093/bioinformatics/btx212

Xiao, Feifei; Niu, Yue; Hao, Ning; Xu, Yanxun; Jin, Zhilin; Zhang, Heping (August 2017, Bioinformatics)
Hancock, John (Ed.)
Abstract SummaryChromosomal copy number variation (CNV) refers to a polymorphism that a DNA segment presents deletion or duplication in the population. The computational algorithms developed to identify this type of variation are usually of high computational complexity. Here we present a user-friendly R package, modSaRa, designed to perform copy number variants identification. The package is developed based on a change-point based method with optimal computational complexity and desirable accuracy. The current version of modSaRa package is a comprehensive tool with integration of preprocessing steps and main CNV calling steps. Availability and ImplementationmodSaRa is an R package written in R, C ++ and Rcpp and is now freely available for download at http://c2s2.yale.edu/software/modSaRa. Supplementary informationSupplementary data are available at Bioinformatics online.
more » « less
Full Text Available
Sparsifying the Fisher Linear Discriminant by Rotation

https://doi.org/10.1111/rssb.12092

Hao, Ning; Dong, Bin; Fan, Jianqing (November 2014, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Summary Many high dimensional classification techniques have been proposed in the literature based on sparse linear discriminant analysis. To use them efficiently, sparsity of linear classifiers is a prerequisite. However, this might not be readily available in many applications, and rotations of data are required to create the sparsity needed. We propose a family of rotations to create the sparsity required. The basic idea is to use the principal components of the sample covariance matrix of the pooled samples and its variants to rotate the data first and then to apply an existing high dimensional classifier. This rotate-and-solve procedure can be combined with any existing classifiers and is robust against the level of sparsity of the true model. We show that these rotations do create the sparsity that is needed for high dimensional classifications and we provide theoretical understanding why such a rotation works empirically. The effectiveness of the method proposed is demonstrated by several simulated and real data examples, and the improvements of our method over some popular high dimensional classification rules are clearly shown.
more » « less
Model Selection for High-Dimensional Quadratic Regression via Regularization

https://doi.org/10.1080/01621459.2016.1264956

Hao, Ning; Feng, Yang; Zhang, Hao Helen (April 2018, Journal of the American Statistical Association)

Full Text Available
A New Reduced-Rank Linear Discriminant Analysis Method and Its Applications

https://doi.org/10.5705/ss.202015.0387

Niu, Yue Selena; Hao, Ning; Dong, Bin (January 2018, Statistica Sinica)

Full Text Available
Interaction screening by partial correlation

https://doi.org/10.4310/SII.2018.v11.n2.a9

Niu, Yue Selena; Hao, Ning; Zhang, Hao Helen (January 2018, Statistics and Its Interface)

Full Text Available
A Note on High-Dimensional Linear Regression With Interactions

https://doi.org/10.1080/00031305.2016.1264311

Hao, Ning; Zhang, Hao Helen (October 2017, The American Statistician)

Full Text Available
Oracle P-values and variable screening

https://doi.org/10.1214/17-EJS1284

Hao, Ning; Zhang, Hao Helen (January 2017, Electronic Journal of Statistics)

Full Text Available
Multiple Change-Point Detection: A Selective Overview

https://doi.org/10.1214/16-STS587

Niu, Yue S; Hao, Ning; Zhang, Heping (November 2016, Statistical Science)

Full Text Available
Interaction Screening for Ultrahigh-Dimensional Data

https://doi.org/10.1080/01621459.2014.881741

Hao, Ning; Zhang, Hao Helen (July 2014, Journal of the American Statistical Association)

Full Text Available

Search for: All records