Search for: All records

Creators/Authors contains: "Shah, Devavrat"

« Prev Next »

Total Resources

18

Resource Type
Conference Paper

8

Conference Proceeding

2

Dataset

0

Journal Article

8

Workshop Report

0

Availability
Full Text / Resource Available

15

Citation Only

3

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Counterfactual Identifiability of Bijective Causal Models

Nasr-Esfahany, Arash ; Alizadeh, Mohammad ; Shah, Devavrat ( July 2023 , Proceedings of the 40th International Conference on Machine Learning)

Free, publicly-accessible full text available July 24, 2024
Counterfactual Identifiability of Bijective Causal Models

Nasr-Esfahany, Arash ; Alizadeh, Mohammad ; Shah, Devavrat ( July 2023 , Proceedings of the 40th International Conference on Machine Learning)

Free, publicly-accessible full text available July 24, 2024
Robust Max Entrywise Error Bounds for Tensor Estimation From Sparse Observations via Similarity-Based Collaborative Filtering

https://doi.org/10.1109/TIT.2023.3237231

Shah, Devavrat ; Yu, Christina Lee ( May 2023 , IEEE Transactions on Information Theory)

Free, publicly-accessible full text available May 1, 2024
CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

Alomar, Abdullah ; Hamadanian, Pouya ; Nasr-Esfahany, Arash ; Agarwal, Anish ; Alizadeh, Mohammad ; Shah, Devavrat ( April 2023 , 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23))
CausalSim: A Causal Framework for Unbiased Trace-Driven Simulation

Alomar, Abdullah ; Hamadanian, Pouya ; Nasr-Esfahany, Arash ; Agarwal, Anish ; Alizadeh, Mohammad ; Shah, Devavrat ( April 2023 , 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23))

Full Text Available
Nonasymptotic Analysis of Monte Carlo Tree Search

https://doi.org/10.1287/opre.2021.2239

Shah, Devavrat ; Xie, Qiaomin ; Xu, Zhi ( May 2022 , Operations Research)

In this work, we consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo tree search (MCTS), in the context of the infinite-horizon discounted cost Markov decision process (MDP). Although MCTS is believed to provide an approximate value function for a given state with enough simulations, the claimed proof of this property is incomplete. This is because the variant of MCTS, the upper confidence bound for trees (UCT), analyzed in prior works, uses “logarithmic” bonus term for balancing exploration and exploitation within the tree-based search, following the insights from stochastic multiarm bandit (MAB) literature. In effect, such an approach assumes that the regret of the underlying recursively dependent nonstationary MABs concentrates around their mean exponentially in the number of steps, which is unlikely to hold, even for stationary MABs. As the key contribution of this work, we establish polynomial concentration property of regret for a class of nonstationary MABs. This in turn establishes that the MCTS with appropriate polynomial rather than logarithmic bonus term in UCB has a claimed property. Interestingly enough, empirically successful approaches use a similar polynomial form of MCTS as suggested by our result. Using this as a building block, we argue that MCTS, combined with nearest neighbor supervised learning, acts as a “policy improvement” operator; that is, it iteratively improves value function approximation for all states because of combining with supervised learning, despite evaluating at only finitely many states. In effect, we establish that to learn an ε approximation of the value function with respect to [Formula: see text] norm, MCTS combined with nearest neighbor requires a sample size scaling as [Formula: see text], where d is the dimension of the state space. This is nearly optimal because of a minimax lower bound of [Formula: see text], suggesting the strength of the variant of MCTS we propose here and our resulting analysis.
more » « less
Full Text Available
Time Varying Regression with Hidden Linear Dynamics

Mania, Horia ; Jadbabaie, Ali ; Shah, Devavrat ; Sra, Suvrit ( May 2022 , PMLR)

Full Text Available
Iterative Collaborative Filtering for Sparse Matrix Estimation

https://doi.org/10.1287/opre.2021.2193

Borgs, Christian ; Chayes, Jennifer T. ; Shah, Devavrat ; Yu, Christina Lee ( December 2021 , Operations Research)

We consider sparse matrix estimation where the goal is to estimate an n-by-n matrix from noisy observations of a small subset of its entries. We analyze the estimation error of the popularly used collaborative filtering algorithm for the sparse regime. Specifically, we propose a novel iterative variant of the algorithm, adapted to handle the setting of sparse observations. We establish that as long as the number of entries observed at random scale logarithmically larger than linear in n, the estimation error with respect to the entry-wise max norm decays to zero as n goes to infinity, assuming the underlying matrix of interest has constant rank r. Our result is robust to model misspecification in that if the underlying matrix is approximately rank r, then the estimation error decays to the approximation error with respect to the [Formula: see text]-norm. In the process, we establish the algorithm’s ability to handle arbitrary bounded noise in the observations.
more » « less
Full Text Available
Quantifying Variational Approximation for Log-Partition Function

Cosson, Romain ; Shah, Devavrat ( July 2021 , Annual Conference on Learning Theory)
null (Ed.)
Variational methods, such as mean-field (MF) and tree-reweighted (TRW), provide computationally efficient approximations of the log-partition function for generic graphical models but their approximation ratio is generally not quantified. As the primary contribution of this work, we provide an approach to quantify their approximation ratio for any discrete pairwise graphical model with non-negative potentials through a property of the underlying graph structure G. Specifically, we argue that (a variant of) TRW produces an estimate within factor K(G) which captures how far G is from tree structure. As a consequence, the approximation ratio is 1 for trees. The quantity K(G) is the solution of a min-max problem associated with the spanning tree polytope of G that can be evaluated in polynomial time for any graph. We provide a near linear-time variant that achieves an approximation ratio depending on the minimal (across edges) effective resistance of the graph. We connect our results to the graph partition approximation method and thus provide a unified perspective.
more » « less
Full Text Available
Inferring Sparse Preference Lists from Partial Information

https://doi.org/10.1287/stsy.2019.0060

Farias, Vivek ; Jagabathula, Srikanth ; Shah, Devavrat ( December 2020 , Stochastic Systems)
null (Ed.)
Probability distributions over rankings are crucial for the modeling and design of a wide range of practical systems. In this work, we pursue a nonparametric approach that seeks to learn a distribution over rankings (aka the ranking model) that is consistent with the observed data and has the sparsest possible support (i.e., the smallest number of rankings with nonzero probability mass). We focus on first-order marginal data, which comprise information on the probability that item i is ranked at position j, for all possible item and position pairs. The observed data may be noisy. Finding the sparsest approximation requires brute force search in the worst case. To address this issue, we restrict our search to, what we dub, the signature family, and show that the sparsest model within the signature family can be found computationally efficiently compared with the brute force approach. We then establish that the signature family provides good approximations to popular ranking model classes, such as the multinomial logit and the exponential family classes, with support size that is small relative to the dimension of the observed data. We test our methods on two data sets: the ranked election data set from the American Psychological Association and the preference ordering data on 10 different sushi varieties.
more » « less
Full Text Available

« Prev Next »