NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Obtaining personalized predictions from a randomized controlled trial on Alzheimer’s disease

https://doi.org/10.1038/s41598-024-84687-4

Shen, Dennis; Agarwal, Anish; Misra, Vishal; Schelter, Bjoern; Shah, Devavrat; Shiells, Helen; Wischik, Claude (December 2025, Scientific Reports)

Free, publicly-accessible full text available December 1, 2026
Same Root Different Leaves: Time Series and Cross‐Sectional Methods in Panel Data

https://doi.org/10.3982/ECTA21248

Shen, Dennis; Ding, Peng; Sekhon, Jasjeet; Yu, Bin (January 2023, Econometrica)

One dominant approach to evaluate the causal effect of a treatment is through panel data analysis, whereby the behaviors of multiple units are observed over time. The information across time and units motivates two general approaches: (i) horizontal regression (i.e., unconfoundedness), which exploits time series patterns, and (ii) vertical regression (e.g., synthetic controls), which exploits cross‐sectional patterns. Conventional wisdom often considers the two approaches to be different. We establish this position to be partly false for estimation but generally true for inference. In the absence of any assumptions, we show that both approaches yield algebraically equivalent point estimates for several standard estimators. However, the source of randomness assumed by each approach leads to a distinct estimand and quantification of uncertainty even for the same point estimate. This emphasizes that researchers should carefully consider where the randomness stems from in their data, as it has direct implications for the accuracy of inference.
more » « less
Full Text Available
Same Root Different Leaves: Time Series and Cross-Sectional Methods in Panel Data

Shen, Dennis; Ding, Peng; Sekhon, Jasjeet; Yu, Bin (October 2022, arXivorg)

Full Text Available
mRSC: Multidimensional Robust Synthetic Control

https://doi.org/10.1145/3309697.3331507

Amjad, Muhammad Jehangir; Misra, Vishal; Shah, Devavrat; Shen, Dennis (July 2019, ACM SIGMETRICS)

We propose an algorithm to impute and forecast a time series by transforming the observed time series into a matrix, utilizing matrix estimation to recover missing values and de-noise observed entries, and performing linear regression to make predictions. At the core of our analysis is a representation result, which states that for a large class of models, the transformed time series matrix is (approximately) low-rank. In effect, this generalizes the widely used Singular Spectrum Analysis (SSA) in the time series literature, and allows us to establish a rigorous link between time series analysis and matrix estimation. The key to establishing this link is constructing a Page matrix with non-overlapping entries rather than a Hankel matrix as is commonly done in the literature (e.g., SSA). This particular matrix structure allows us to provide finite sample analysis for imputation and prediction, and prove the asymptotic consistency of our method. Another salient feature of our algorithm is that it is model agnostic with respect to both the underlying time dynamics and the noise distribution in the observations. The noise agnostic property of our approach allows us to recover the latent states when only given access to noisy and partial observations a la a Hidden Markov Model; e.g., recovering the time-varying parameter of a Poisson process without knowing that the underlying process is Poisson. Furthermore, since our forecasting algorithm requires regression with noisy features, our approach suggests a matrix estimation based method-coupled with a novel, non-standard matrix estimation error metric-to solve the error-in-variable regression problem, which could be of interest in its own right. Through synthetic and real-world datasets, we demonstrate that our algorithm outperforms standard software packages (including R libraries) in the presence of missing data as well as high levels of noise.
more » « less
Full Text Available
Model Agnostic Time Series Analysis via Matrix Estimation

https://doi.org/10.1145/3287319

Agarwal, Anish; Amjad, Muhammad Jehangir; Shah, Devavrat; Shen, Dennis (December 2018, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Full Text Available
Model Agnostic Time Series Analysis via Matrix Estimation

Agarwal, Anish; Amjad, Muhammad Jehangir; Shah, Devavrat; Shen, Dennis (December 2018, ACM SIGMETRICS performance evaluation review)

We propose an algorithm to impute and forecast a time series by transforming the observed time series into a matrix, utilizing matrix estimation to recover missing values and de-noise observed entries, and performing linear regression to make predictions. At the core of our analysis is a representation result, which states that for a large class of models, the transformed time series matrix is (approximately) low-rank. In effect, this generalizes the widely used Singular Spectrum Analysis (SSA) in the time series literature, and allows us to establish a rigorous link between time series analysis and matrix estimation. The key to establishing this link is constructing a Page matrix with non-overlapping entries rather than a Hankel matrix as is commonly done in the literature (e.g., SSA). This particular matrix structure allows us to provide finite sample analysis for imputation and prediction, and prove the asymptotic consistency of our method. Another salient feature of our algorithm is that it is model agnostic with respect to both the underlying time dynamics and the noise distribution in the observations. The noise agnostic property of our approach allows us to recover the latent states when only given access to noisy and partial observations a la a Hidden Markov Model; e.g., recovering the time-varying parameter of a Poisson process without knowing that the underlying process is Poisson. Furthermore, since our forecasting algorithm requires regression with noisy features, our approach suggests a matrix estimation based method—coupled with a novel, non-standard matrix estimation error metric—to solve the error-in-variable regression problem, which could be of interest in its own right. Through synthetic and real-world datasets, we demonstrate that our algorithm outperforms standard software packages (including R libraries) in the presence of missing data as well as high levels of noise.
more » « less
Full Text Available

Search for: All records