NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

COIL: Revisit exact lexical match in information retrieval with contextualized inverted list

Gao, Luyu; Dai, Zhuyun; Callan, Jamie (June 2021, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)
null (Ed.)
Classical information retrieval systems such asBM25 rely on exact lexical match and carryout search efficiently with inverted list index. Recent neural IR models shifts towards soft semantic matching all query document terms,but they lose the computation efficiency of exact match systems.This paper presents COIL, a contextualized exact match retrieval architecture that brings semantic lexical matching. COIL scoring is based on overlapping query document tokens’ contextualized representations. The new architecture stores contextualized token representations in inverted lists, bringing together the efficiency of exact match and the representation power of deep language models. Our experimental results show COIL outperforms classical lexical retrievers and state-of-the-art deep LM retrievers with similar or smaller latency.
more » « less
Full Text Available
PGT: Pseudo relevance feedback using a graph-based transforme

Yu, HongChien; Dai, Zhuyun; Callan, Jamie (March 2021, Advances in Information Retrieval – 43rd European Conference on IR Research)
null (Ed.)
Most research on pseudo relevance feedback (PRF) has been done in vector space and probabilistic retrieval models. This paper shows that Transformer-based rerankers can also benefit from the extra context that PRF provides. It presents PGT, a graph-based Transformer that sparsifies attention between graph nodes to enable PRF while avoiding the high computational complexity of most Transformer architectures. Experiments show that PGT improves upon non-PRF Transformer reranker, and it is at least as accurate as Transformer PRF models that use full attention, but with lower computational costs.
more » « less
Full Text Available
Rethink training of BERT rerankers in multi-stage retrieval pipeline

https://doi.org/10.1007/978-3-030-72240-1_26

Gao, Luyu; Dai, Zhuyun; Callan, Jamie (March 2021, Advances in Information Retrieval – 43rd European Conference on IR Research)
null (Ed.)
Pre-trained deep language models (LM) have advanced the state-of-the-art of text retrieval. Rerankers fine-tuned from deep LM estimates candidate relevance based on rich contextualized matching signals. Meanwhile, deep LMs can also be leveraged to improve search index, building retrievers with better recall. One would expect a straightforward combination of both in a pipeline to have additive performance gain. In this paper, we discover otherwise and that popular reranker cannot fully exploit the improved retrieval result. We, therefore, propose a Localized Contrastive Estimation (LCE) for training rerankers and demonstrate it significantly improves deep two-stage models (Our codes are open sourced at https://github.com/luyug/Reranker.).
more » « less
Full Text Available
Modularized transformer-based ranking framework

Gao, Luyu; Dai, Zhuyun (November 2020, Conference on Empirical Methods in Natural Language Processing)
null (Ed.)
Recent innovations in Transformer-based ranking models have advanced the state-of-the-art in information retrieval. However, these Transformers are computationally expensive, and their opaque hidden states make it hard to understand the ranking process. In this work, we modularize the Transformer ranker into separate modules for text representation and interaction. We show how this design enables substantially faster ranking using offline pre-computed representations and light-weight online interactions. The modular design is also easier to interpret and sheds light on the ranking process in Transformer rankers.
more » « less
Full Text Available
Context-aware term weighting for first-stage passage retrieval

https://doi.org/10.1145/3397271.3401204

Dai, Zhuyun; Callan, Jamie (July 2020, Proceedings of the 43nd International ACM SIGIR Conference on Research & Development in Information Retrieval)

Neural networks provide new possibilities to automatically learn complex language patterns and query-document relations. Neural IR models have achieved promising results in learning query-document relevance patterns, but few explorations have been done on understanding the text content of a query or a document. This paper studies leveraging a recently-proposed contextual neural language model, BERT, to provide deeper text understanding for IR.Experimental results demonstrate that the contextual text representations from BERT are more effective than traditional word embeddings. Compared to bag-of-words retrieval models, the contextual language model can better leverage language structures, bringing large improvements on queries written in natural languages. Combining the text understanding ability with search knowledge leads to an enhanced pre-trained BERT model that can benefit related search tasks where training data are limited.
more » « less
Full Text Available
Understanding BERT rankers under distillation

https://doi.org/10.1145/3409256.3409838

Gao, Luyu; Dai, Zhuyun; Callan, Jamie (September 2020, ACM SIGIR International Conference on the Theory of Information Retrieval)
null (Ed.)
Deep language models, such as BERT pre-trained on large corpora,have given a huge performance boost to state-of-the-art information retrieval ranking systems. Knowledge embedded in such models allows them to pick up complex matching signals between passages and queries. However, the high computation cost during inference limits their deployment in real-world search scenarios. In this paper, we study if and how the knowledge for search within BERT can be transferred to a smaller ranker through distillation.Our experiments demonstrate that it is crucial to use a proper distillation procedure, which produces up to nine times speed upwhile preserving the state-of-the-art performance.
more » « less
Full Text Available
Complement lexical retrieval model with semantic residual embeddings

Gao, Luyu; Dai, Zhuyun; Chen, Tongfei; Fan, Zhen; Van Durme, Benjamin; Callan, Jamie (March 2021, Advances in Information Retrieval – 43rd European Conference on IR Research)
null (Ed.)
This paper presents CLEAR, a retrieval model that seeks to complement classical lexical exact-match models such as BM25 with semantic matching signals from a neural embedding matching model. CLEAR explicitly trains the neural embedding to encode language structures and semantics that lexical retrieval fails to capture with a novel residual-based embedding learning method. Empirical evaluations demonstrate the advantages of CLEAR over state-of-the-art retrieval models, and that it can substantially improve the end-to-end accuracy and efficiency of reranking pipelines.
more » « less
Full Text Available
Context-Aware Document Term Weighting for Ad-Hoc Search

https://doi.org/10.1145/3366423.3380258

Dai, Zhuyun; Callan, Jamie (April 2020, Proceedings of The Web Conference 2020)

Bag-of-words document representations play a fundamental role in modern search engines, but their power is limited by the shallow frequency-based term weighting scheme. This paper proposes HDCT, a context-aware document term weighting framework for document indexing and retrieval. It first estimates the semantic importance of a term in the context of each passage. These fine-grained term weights are then aggregated into a document-level bag-of-words representation, which can be stored into a standard inverted index for efficient retrieval. This paper also proposes two approaches that enable training HDCT without relevance labels. Experiments show that an index using HDCT weights significantly improved the retrieval accuracy compared to typical term-frequency and state-of-the-art embedding-based indexes.
more » « less
Full Text Available
Summarizing and exploring tabular data in conversational search

https://doi.org/10.1145/3397271.3401205

Zhang, Shuo; Dai, Zhuyun; Balog, Krisztian; Callan, Jamie (July 2020, Proceedings of the 43nd International ACM SIGIR Conference on Research & Development in Information Retrieval)

Tabular data provide answers to a significant portion of search queries. However, reciting an entire result table is impractical in conversational search systems. We propose to generate natural language summaries as answers to describe the complex information contained in a table. Through crowdsourcing experiments, we build a new conversation-oriented, open-domain table summarization dataset. It includes annotated table summaries, which not only answer questions but also help people explore other information in the table. We utilize this dataset to develop automatic table summarization systems as SOTA baselines. Based on the experimental results, we identify challenges and point out future research directions that this resource will support.
more » « less
Full Text Available
Efficiency Implications of Term Weighting for Passage Retrieval

https://doi.org/10.1145/3397271.3401263

Mackenzie, Joel; Dai, Zhuyun; Gallagher, Luke; Callan, Jamie (July 2020, Proceedings of the 43nd International ACM SIGIR Conference on Research & Development in Information Retrieval)

Language model pre-training has spurred a great deal of attention for tasks involving natural language understanding, and has been successfully applied to many downstream tasks with impressive results. Within information retrieval, many of these solutions are too costly to stand on their own, requiring multi-stage ranking architectures. Recent work has begun to consider how to “backport” salient aspects of these computationally expensive models to previous stages of the retrieval pipeline. One such instance is DeepCT, which uses BERT to re-weight term importance in a given context at the passage level. This process, which is computed offline, results in an augmented inverted index with re-weighted term frequency values. In this work,we conduct an investigation of query processing efficiency over DeepCT indexes. Using a number of candidate generation algorithms, we reveal how term re-weighting can impact query processing latency, and explore how DeepCT can be used as a static index pruning technique to accelerate query processing without harming search effectiveness.
more » « less
Full Text Available

« Prev Next »

Search for: All records