Search for: All records

Creators/Authors contains: "Valiant, Gregory"

« Prev Next »

Total Resources

37

Resource Type
Conference Paper

35

Conference Proceeding

0

Dataset

0

Journal Article

2

Workshop Report

0

Availability
Full Text / Resource Available

37

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Online Pen Testing

Qiao, Mingda ; Valiant, Gregory ( January 2023 , Leibniz international proceedings in informatics)

Full Text Available
One-sided Matrix Completion from Two Observations Per Row

Cao, Steven ; Liang, Percy ; Valiant, Gregory ( January 2023 , ICML)

Full Text Available
Efficient Convex Optimization Requires Superlinear Memory

Marsden, Annie ; Sharan, Vatsal ; Sidford, Aaron ; Valiant, Gregory ( January 2022 , Conference on Learning Theory (COLT))

We show that any memory-constrained, first-order algorithm which minimizes d-dimensional, 1-Lipschitz convex functions over the unit ball to 1/ poly(d) accuracy using at most d^(1.25-delta) bits of memory must make at least d^(1+ 4 delta / 3) first-order queries (for any constant delta in (0,1/4). Consequently, the performance of such memory-constrained algorithms are a polynomial factor worse than the optimal O(d polylog d) query bound for this problem obtained by cutting plane methods that use >d^2 memory. This resolves one of the open problems in the COLT 2019 open problem publication of Woodworth and Srebro.
more » « less
Full Text Available
Efficient Convex Optimization Requires Superlinear Memory

Marsden, Annie ; Sharan, Vatsal ; Sidford, Aaron ; Valiant, Gregory ( January 2022 , Conference on Learning Theory (COLT))

Full Text Available
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes

Garg, Shivam ; Tsipras, Dimitris ; Liang, Percy ; Valiant, Gregory ( January 2022 , Advances in Neural Information Processing Systems 35 (NeurIPS 2022))

Full Text Available
Big-Step-Little-Step: Gradient Methods for Objectives with Multiple Scales

Kelner, Jonathan ; Marsden, Annie ; Sharan, Vatsal ; Sidford, Aaron ; Valiant, Gregory ; Yuan, Honglin ( January 2022 , Conference on Learning Theory (COLT))

We provide new gradient-based methods for efficiently solving a broad class of ill-conditioned optimization problems. We consider the problem of minimizing a function f : R d --> R which is implicitly decomposable as the sum of m unknown non-interacting smooth, strongly convex functions and provide a method which solves this problem with a number of gradient evaluations that scales (up to logarithmic factors) as the product of the square-root of the condition numbers of the components. This complexity bound (which we prove is nearly optimal) can improve almost exponentially on that of accelerated gradient methods, which grow as the square root of the condition number of f. Additionally, we provide efficient methods for solving stochastic, quadratic variants of this multiscale optimization problem. Rather than learn the decomposition of f (which would be prohibitively expensive), our methods apply a clean recursive “Big-Step-Little-Step” interleaving of standard methods. The resulting algorithms use O˜(dm) space, are numerically stable, and open the door to a more fine-grained understanding of the complexity of convex optimization beyond condition number.
more » « less
Full Text Available
Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales

Kelner, Jonathan ; Marsden, Annie ; Sharan, Vatsal ; Sidford, Aaron ; Valiant, Gregory ; Yuan, Honglin ( January 2022 , Conference on Learning Theory)

Full Text Available
Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales

Kelner, Jonathan A. ; Marsden, Annie ; Sharan, Vatsal ; Sidford, Aaron ; Valiant, Gregory ; Yuan, Honglin ( January 2022 , Conference on Learning Theory (COLT))

Full Text Available
Exponential Weights Algorithms for Selective Learning

Qiao, Mingda ; Valiant, Gregory ( January 2021 , Conference on Learning Theory (COLT))
null (Ed.)
Full Text Available
Stronger Calibration Lower Bounds via Sidestepping

Qiao, Mingda ; Valiant, Gregory ( January 2021 , Proceedings of the Annual ACM Symposium on Theory of Computing)
null (Ed.)
We consider an online binary prediction setting where a forecaster observes a sequence of T bits one by one. Before each bit is revealed, the forecaster predicts the probability that the bit is 1. The forecaster is called well-calibrated if for each p in [0,1], among the n_p bits for which the forecaster predicts probability p, the actual number of ones, m_p, is indeed equal to p*n_p. The calibration error, defined as \sum_p |m_p - p n_p|, quantifies the extent to which the forecaster deviates from being well-calibrated. It has long been known that an O(T^(2/3)) calibration error is achievable even when the bits are chosen adversarially, and possibly based on the previous predictions. However, little is known on the lower bound side, except an sqrt(T) bound that follows from the trivial example of independent fair coin flips. In this paper, we prove an T^(0.528) bound on the calibration error, which is the first bound above the trivial sqrt(T) lowerbound for this setting. The technical contributions of our work include two lower bound techniques, early stopping and sidestepping, which circumvent the obstacles that have previously hindered strong calibration lower bounds. We also propose an abstraction of the prediction setting, termed the Sign-Preservation game, which may be of independent interest. This game has a much smaller state space than the full prediction setting and allows simpler analyses. The T^0.528 lower bound follows from a general reduction theorem that translates lower bounds on the game value of Sign-Preservation into lower bounds on the calibration error.
more » « less
Full Text Available

« Prev Next »