NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

https://doi.org/10.1287/opre.2021.0633

Farina, Gabriele; Kroer, Christian; Sandholm, Tuomas (September 2025, Operations Research)

The paper studies the application of first-order methods to the problem of computing equilibria of large-scale extensive-form games. It introduces a new weighted entropy-based distance-generating function for instantiating first-order methods. The new function achieves significantly better strong-convexity properties than existing weight schemes for the dilated entropy while maintaining the same easily implemented closed-form proximal mapping as the prior state of the art. The paper then generalizes our new entropy distance function, as well as the whole class of dilated distance functions, to the scaled extension operator. This yields the first efficiently computable distance-generating function for the decision polytopes capturing correlated and team solution concepts for extensive-form games. By instantiating first-order methods with these regularizers, several new results are achieved, such as the first method for computing ex ante correlated team equilibria with a guaranteed 1/T rate of convergence and efficient proximal updates.
more » « less
Free, publicly-accessible full text available September 1, 2026
Expected Variational Inequalities

Zhang, Brian Hu; Anagnostides, Ioannis; Tewolde, Emanuel; Berker, Ratip Emin; Farina, Gabriele; Conitzer, Vincent; Sandholm, Tuomas (July 2025, ICML25)

Free, publicly-accessible full text available July 13, 2026
Increasing Revenue in Efficient Combinatorial Auctions by Learning to Generate Artificial Competition

https://doi.org/10.1609/aaai.v39i13.33482

Balcan, Maria-Florina; Prasad, Siddharth; Sandholm, Tuomas (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

The design of multi-item, multi-bidder auctions involves a delicate balancing act of economic objectives, bidder incentives, and real-world complexities. Efficient auctions, that is, auctions that allocate items to maximize total bidder value, are practically desirable since they promote the most economically beneficial use of resources. Arguably the biggest drawback of efficient auctions, however, is their potential to generate very low revenue. In this work, we show how the auction designer can artificially inject competition into the auction to boost revenue while striving to maintain efficiency. First, we invent a new auction family that enables the auction designer to specify competition in a precise, expressive, and interpretable way. We then introduce a new model of bidder behavior and individual rationality to understand how bidders act when prices are too competitive. Next, under our bidder behavior model, we use our new competitive auction class to derive the globally revenue-optimal efficient auction under two different knowledge models for the auction designer: knowledge of full bidder value distributions and knowledge of bidder value quantiles. Finally, we study a third knowledge model for the auction designer: knowledge of historical bidder valuation data. In this setting we present sample and computationally efficient learning algorithms that find high-revenue probably-efficient competitive auctions from bidder data. Our learning algorithms are instance adaptive and can be run in parallel across bidders, unlike most prior approaches to data-driven auction design.
more » « less
Free, publicly-accessible full text available April 11, 2026
A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation

https://doi.org/10.1609/aaai.v39i22.34498

Taguelmimt, Redha; Aknine, Samir; Boukredera, Djamila; Changder, Narayan; Sandholm, Tuomas (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Coalition structure generation (CSG), i.e. the problem of optimally partitioning a set of agents into coalitions to maximize social welfare, is a fundamental computational problem in multiagent systems. This problem is important for many applications where small run times are necessary, including transportation and disaster response. In this paper, we develop SALDAE, a multiagent path finding algorithm for CSG that operates on a graph of coalition structures. Our algorithm utilizes a variety of heuristics and strategies to perform the search and guide it. It is an anytime algorithm that can handle large problems with hundreds and thousands of agents. We show empirically on nine standard value distributions, including disaster response and electric vehicle allocation benchmarks, that our algorithm enables a rapid finding of high-quality solutions and compares favorably with other state-of-the-art methods.
more » « less
Free, publicly-accessible full text available April 11, 2026
Convergence of log(1ϵ) for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis

Anagnostides, Ioannis; Sandholm, Tuomas (December 2024, NeurIPS24)

Full Text Available
Computing Game Symmetries and Equilibria That Respect Them

Tewolde, Emanuel; Zhang, Brian Hu; Oesterheld, Caspar; Sandholm, Tuomas; Conitzer, Vincent (February 2025, AAAI25)

Free, publicly-accessible full text available February 25, 2026
The Value of Recall in Extensive-Form Games

Berker, Ratip Emin; Tewolde, Emanuel; Anagnostides, Ioannis; Sandholm, Tuomas; Conitzer, Vincent (February 2025, AAAI25)

Free, publicly-accessible full text available February 25, 2026
Efficient 0-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

Zhang, Brian Hu; Anagnostides, Ioannis; Farina, Gabriele; Sandholm, Tuomas (December 2024, NeurIPS24)

Full Text Available
Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games

https://doi.org/10.24963/ijcai.2024/336

Zhang, Brian Hu; Sandholm, Tuomas (August 2024, International Joint Conferences on Artificial Intelligence Organization)

The double oracle algorithm is a popular method of solving games, because it is able to reduce computing equilibria to computing a series of best responses. However, its theoretical properties are not well understood. In this paper, we provide exponential lower bounds on the performance of the double oracle algorithm in both partially-observable stochastic games (POSGs) and extensive-form games (EFGs). Our results depend on what is assumed about the tiebreaking scheme---that is, which meta-Nash equilibrium or best response is chosen, in the event that there are multiple to pick from. In particular, for EFGs, our lower bounds require adversarial tiebreaking, whereas for POSGs, our lower bounds apply regardless of how ties are broken.
more » « less
Full Text Available
How Much Data Is Sufficient to Learn High-Performing Algorithms?

https://doi.org/10.1145/3676278

Balcan, Maria-Florina; Deblasio, Dan; Dick, Travis; Kingsford, Carl; Sandholm, Tuomas; Vitercik, Ellen (October 2024, Journal of the ACM)

Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case approximation ratio or runtime bound. Worst-case instances, however, may be rare or nonexistent in practice. A growing body of research has demonstrated that a data-driven approach to parameter tuning can lead to significant improvements in performance. This approach uses atraining setof problem instances sampled from an unknown, application-specific distribution and returns a parameter setting with strong average performance on the training set. We provide techniques for derivinggeneralization guaranteesthat bound the difference between the algorithm’s average performance over the training set and its expected performance on the unknown distribution. Our results apply no matter how the parameters are tuned, be it via an automated or manual approach. The challenge is that for many types of algorithms, performance is a volatile function of the parameters: slightly perturbing the parameters can cause a large change in behavior. Prior research [e.g.,12,16,20,62] has proved generalization bounds by employing case-by-case analyses of greedy algorithms, clustering algorithms, integer programming algorithms, and selling mechanisms. We streamline these analyses with a general theorem that applies whenever an algorithm’s performance is a piecewise-constant, piecewise-linear, or—more generally—piecewise-structuredfunction of its parameters. Our results, which are tight up to logarithmic factors in the worst case, also imply novel bounds for configuring dynamic programming algorithms from computational biology.
more » « less
Full Text Available

« Prev Next »

Search for: All records