An Explore-then-Commit Algorithm for Submodular Maximization Under Full-bandit Feedback

Guanyu Nie, Mridul Agarwal

Citation Details

We investigate the problem of combinatorial multi-armed bandits with stochastic submodular (in expectation) rewards and full-bandit feedback, where no extra information other than the reward of selected action at each time step is observed. We propose a simple algorithm, Explore-Then-Commit Greedy (ETCG) and prove that it achieves a -regret upper bound of for a horizon , number of base elements , and cardinality constraint . We also show in experiments with synthetic and real-world data that the ETCG empirically outperforms other full-bandit methods. more »

Award ID(s):: 2149588

PAR ID:: 10400787

Author(s) / Creator(s):: Guanyu Nie, Mridul Agarwal

Date Published:: 2022-01-01

Journal Name:: Uncertainty in artificial intelligence

Volume:: 180

ISSN:: 1525-3384

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this