Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

Jain, Adit; Krishnamurthy, Vikram

doi:10.1109/LCSYS.2024.3406543

Citation Details

Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

This letter studies how a stochastic gradient algorithm (SG) can be controlled to hide the estimate of the local stationary point from an eavesdropper. Such prob- lems are of signiﬁcant interest in distributed optimization settings like federated learning and inventory management. A learner queries a stochastic oracle and incentivizes the oracle to obtain noisy gradient measurements and per- form SG. The oracle probabilistically returns either a noisy gradient of the function or a non-informative measure- ment, depending on the oracle state and incentive. The learner’s query and incentive are visible to an eavesdropper who wishes to estimate the stationary point. This letter formulates the problem of the learner performing covert optimization by dynamically incentivizing the stochastic oracle and obfuscating the eavesdropper as a ﬁnite-horizon Markov decision process (MDP). Using conditions for interval-dominance on the cost and transition probability structure, we show that the optimal policy for the MDP has a monotone threshold structure. We propose searching for the optimal stationary policy with the threshold structure using a stochastic approximation algorithm and a multi– armed bandit approach. The effectiveness of our methods is numerically demonstrated on a covert federated learning hate-speech classiﬁcation task. more »

Award ID(s):: 2312198

PAR ID:: 10518928

Author(s) / Creator(s):: Jain, Adit; Krishnamurthy, Vikram

Publisher / Repository:: IEEE

Date Published:: 2024-01-01

Journal Name:: IEEE Control Systems Letters

Volume:: 8

ISSN:: 2475-1456

Page Range / eLocation ID:: 1295 to 1300

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/LCSYS.2024.3406543

More Like this