Bag of little bootstraps for massive and distributed longitudinal data

Zhou, Xinkai; Zhou, Jin J.; Zhou, Hua  (ORCID:0000000313207118)

doi:10.1002/sam.11563

Citation Details

Bag of little bootstraps for massive and distributed longitudinal data

Abstract Linear mixed models are widely used for analyzing longitudinal datasets, and the inference for variance component parameters relies on the bootstrap method. However, health systems and technology companies routinely generate massive longitudinal datasets that make the traditional bootstrap method infeasible. To solve this problem, we extend the highly scalable bag of little bootstraps method for independent data to longitudinal data and develop a highly efficient Julia packageMixedModelsBLB.jl.Simulation experiments and real data analysis demonstrate the favorable statistical performance and computational advantages of our method compared to the traditional bootstrap method. For the statistical inference of variance components, it achieves 200 times speedup on the scale of 1 million subjects (20 million total observations), and is the only currently available tool that can handle more than 10 million subjects (200 million total observations) using desktop computers. more »

Award ID(s):: 2054253 2205441

PAR ID:: 10446128

Author(s) / Creator(s):: Zhou, Xinkai ; Zhou, Jin J. ; Zhou, Hua

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2021-11-22

Journal Name:: Statistical Analysis and Data Mining: The ASA Data Science Journal

Volume:: 15

Issue:: 3

ISSN:: 1932-1864

Page Range / eLocation ID:: p. 314-321

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/sam.11563

More Like this