skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: LIMIT THEOREMS FOR FACTOR MODELS
This paper establishes central limit theorems (CLTs) and proposes how to perform valid inference in factor models. We consider a setting where many counties/regions/assets are observed for many time periods, and when estimation of a global parameter includes aggregation of a cross-section of heterogeneous microparameters estimated separately for each entity. The CLT applies for quantities involving both cross-sectional and time series aggregation, as well as for quadratic forms in time-aggregated errors. This paper studies the conditions when one can consistently estimate the asymptotic variance, and proposes a bootstrap scheme for cases when one cannot. A small simulation study illustrates performance of the asymptotic and bootstrap procedures. The results are useful for making inferences in two-step estimation procedures related to factor models, as well as in other related contexts. Our treatment avoids structural modeling of cross-sectional dependence but imposes time-series independence.  more » « less
Award ID(s):
1757199
PAR ID:
10339705
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Econometric Theory
Volume:
37
Issue:
5
ISSN:
0266-4666
Page Range / eLocation ID:
1034 to 1074
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Information from frequency bands in biomedical time series provides useful summaries of the observed signal. Many existing methods consider summaries of the time series obtained over a few well-known, pre-defined frequency bands of interest. However, there is a dearth of data-driven methods for identifying frequency bands that optimally summarize frequency-domain information in the time series. A new method to identify partition points in the frequency space of a multivariate locally stationary time series is proposed. These partition points signify changes across frequencies in the time-varying behavior of the signal and provide frequency band summary measures that best preserve nonstationary dynamics of the observed series. An $$L_2$$-norm based discrepancy measure that finds differences in the time-varying spectral density matrix is constructed, and its asymptotic properties are derived. New nonparametric bootstrap tests are also provided to identify significant frequency partition points and to identify components and cross-components of the spectral matrix exhibiting changes over frequencies. Finite-sample performance of the proposed method is illustrated via simulations. The proposed method is used to develop optimal frequency band summary measures for characterizing time-varying behavior in resting-state electroencephalography time series, as well as identifying components and cross-components associated with each frequency partition point. 
    more » « less
  2. Estimations and applications of factor models often rely on the crucial condition that the number of latent factors is consistently estimated, which in turn also requires that factors be relatively strong, data are stationary and weakly serially dependent, and the sample size be fairly large, although in practical applications, one or several of these conditions may fail. In these cases, it is difficult to analyze the eigenvectors of the data matrix. To address this issue, we propose simple estimators of the latent factors using cross-sectional projections of the panel data, by weighted averages with predetermined weights. These weights are chosen to diversify away the idiosyncratic components, resulting in “diversified factors.” Because the projections are conducted cross-sectionally, they are robust to serial conditions, easy to analyze and work even for finite length of time series. We formally prove that this procedure is robust to over-estimating the number of factors, and illustrate it in several applications, including post-selection inference, big data forecasts, large covariance estimation, and factor specification tests. We also recommend several choices for the diversified weights. 
    more » « less
  3. Summary This paper develops a functional hybrid factor regression modelling framework to handle the heterogeneity of many large-scale imaging studies, such as the Alzheimer’s disease neuroimaging initiative study. Despite the numerous successes of those imaging studies, such heterogeneity may be caused by the differences in study environment, population, design, protocols or other hidden factors, and it has posed major challenges in integrative analysis of imaging data collected from multicentres or multistudies. We propose both estimation and inference procedures for estimating unknown parameters and detecting unknown factors under our new model. The asymptotic properties of both estimation and inference procedures are systematically investigated. The finite-sample performance of our proposed procedures is assessed by using Monte Carlo simulations and a real data example on hippocampal surface data from the Alzheimer’s disease study. 
    more » « less
  4. Abstract Muscles spanning multiple joints play important functional roles in a wide range of systems across tetrapods; however, their fundamental mechanics are poorly understood, particularly the consequences of anatomical position on mechanical advantage. Snakes provide an excellent study system for advancing this topic. They rely on the axial muscles for many activities, including striking, constriction, defensive displays, and locomotion. Moreover, those muscles span from one or a few vertebrae to over 30, and anatomy varies among muscles and among species. We characterized the anatomy of major epaxial muscles in a size series of corn snakes (Pantherophis guttatus) using diceCT scans, and then took several approaches to calculating contributions of each muscle to force and motion generated during body bending, starting from a highly simplistic model and moving to increasingly complex and realistic models. Only the most realistic model yielded equations that included the consequence of muscle span on torque‐displacement trade‐offs, as well as resolving ambiguities that arose from simpler models. We also tested whether muscle cross‐sectional areas or lever arms (total magnitude or pitch/yaw/roll components) were related to snake mass, longitudinal body region (anterior, middle, posterior), and/or muscle group (semispinalis‐spinalis, multifidus, longissimus dorsi, iliocostalis, and levator costae). Muscle cross‐sectional areas generally scaled with positive allometry, and most lever arms did not depart significantly from geometric similarity (isometry). The levator costae had lower cross‐sectional area than the four epaxial muscles, which did not differ significantly from each other in cross‐sectional area. Lever arm total magnitudes and components differed among muscles. We found some evidence for regional variation, indicating that functional regionalization merits further investigation. Our results contribute to knowledge of snake muscles specifically and multiarticular muscle systems generally, providing a foundation for future comparisons across species and bioinspired multiarticular systems. 
    more » « less
  5. Abstract Observations in various applications are frequently represented as a time series of multidimensional arrays, called tensor time series, preserving the inherent multidimensional structure. In this paper, we present a factor model approach, in a form similar to tensor CANDECOMP/PARAFAC (CP) decomposition, to the analysis of high-dimensional dynamic tensor time series. As the loading vectors are uniquely defined but not necessarily orthogonal, it is significantly different from the existing tensor factor models based on Tucker-type tensor decomposition. The model structure allows for a set of uncorrelated one-dimensional latent dynamic factor processes, making it much more convenient to study the underlying dynamics of the time series. A new high-order projection estimator is proposed for such a factor model, utilizing the special structure and the idea of the higher order orthogonal iteration procedures commonly used in Tucker-type tensor factor model and general tensor CP decomposition procedures. Theoretical investigation provides statistical error bounds for the proposed methods, which shows the significant advantage of utilizing the special model structure. Simulation study is conducted to further demonstrate the finite sample properties of the estimators. Real data application is used to illustrate the model and its interpretations. 
    more » « less