Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation

Kashyapi, Sumanta; Dietz, Laura

doi:10.1145/3574318

Citation Details

Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation

For most queries, the set of relevant documents spans multiple subtopics. Inspired by the neural ranking models and query-specific neural clustering models, we develop Topic-Mono-BERT which performs both tasks jointly. Based on text embeddings of BERT, our model learns a shared embedding that is optimized for both tasks. The clustering hypothesis would suggest that embeddings which place topically similar text in close proximity will also perform better on ranking tasks. Our model is trained with the Wikimarks approach to obtain training signals for relevance and subtopics on the same queries. Our task is to identify overview passages that can be used to construct a succinct answer to the query. Our empirical evaluation on two publicly available passage retrieval datasets suggests that including the clustering supervision in the ranking model leads to about 16% improvement in identifying text passages that summarize different subtopics within a query. more »

Award ID(s):: 1846017

PAR ID:: 10473540

Author(s) / Creator(s):: Kashyapi, Sumanta; Dietz, Laura

Editor(s):: Ganguly, Debasis; Gangopadhyay, Surupendu; Mitra, Mandar; Majumder, Prasenjit

Publisher / Repository:: ACM

Date Published:: 2022-12-09

ISBN:: 9798400700231

Format(s):: Medium: X

Location:: Kolkata India

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
https://doi.org/10.1145/3574318

More Like this