Rethink training of BERT rerankers in multi-stage retrieval pipeline

Gao, Luyu; Dai, Zhuyun; Callan, Jamie

doi:10.1007/978-3-030-72240-1_26

Citation Details

Rethink training of BERT rerankers in multi-stage retrieval pipeline

Pre-trained deep language models (LM) have advanced the state-of-the-art of text retrieval. Rerankers fine-tuned from deep LM estimates candidate relevance based on rich contextualized matching signals. Meanwhile, deep LMs can also be leveraged to improve search index, building retrievers with better recall. One would expect a straightforward combination of both in a pipeline to have additive performance gain. In this paper, we discover otherwise and that popular reranker cannot fully exploit the improved retrieval result. We, therefore, propose a Localized Contrastive Estimation (LCE) for training rerankers and demonstrate it significantly improves deep two-stage models (Our codes are open sourced at https://github.com/luyug/Reranker.). more »

Award ID(s):: 1815528

PAR ID:: 10273588

Author(s) / Creator(s):: Gao, Luyu; Dai, Zhuyun; Callan, Jamie

Date Published:: 2021-03-28

Journal Name:: Advances in Information Retrieval – 43rd European Conference on IR Research

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-030-72240-1_26

More Like this