Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback

Jia, Yiling; Wang, Hongning

doi:10.1145/3477495.3532057

Citation Details

Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback

Deep neural networks (DNNs) demonstrates significant advantages in improving ranking performance in retrieval tasks. Driven by the recent developments in optimization and generalization of DNNs, learning a neural ranking model online from its interactions with users becomes possible. However, the required exploration for model learning has to be performed in the entire neural network parameter space, which is prohibitively expensive and limits the application of such online solutions in practice. In this work, we propose an efficient exploration strategy for online interactive neural ranker learning based on bootstrapping. Our solution is based on an ensemble of ranking models trained with perturbed user click feedback. The proposed method eliminates explicit confidence set construction and the associated computational overhead, which enables the online neural rankers training to be efficiently executed in practice with theoretical guarantees. Extensive comparisons with an array of state-of-the-art OL2R algorithms on two public learning to rank benchmark datasets demonstrate the effectiveness and computational efficiency of our proposed neural OL2R solution. more »

Award ID(s):: 2128019 1718216 1553568

NSF-PAR ID:: 10381226

Author(s) / Creator(s):: Jia, Yiling; Wang, Hongning

Date Published:: 2022-07-06

Journal Name:: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Page Range / eLocation ID:: 533 to 545

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3477495.3532057

More Like this