Unsupervised Deep Keyphrase Generation

Shen, Xianjie; Wang, Yinghan; Meng, Rui; Shang, Jingbo

doi:10.1609/aaai.v36i10.21381

Citation Details

Unsupervised Deep Keyphrase Generation

Keyphrase generation aims to summarize long documents with a collection of salient phrases. Deep neural models have demonstrated remarkable success in this task, with the capability of predicting keyphrases that are even absent from a document. However, such abstractiveness is acquired at the expense of a substantial amount of annotated data. In this paper, we present a novel method for keyphrase generation, AutoKeyGen, without the supervision of any annotated doc-keyphrase pairs. Motivated by the observation that an absent keyphrase in a document may appear in other places, in whole or in part, we construct a phrase bank by pooling all phrases extracted from a corpus. With this phrase bank, we assign phrase candidates to new documents by a simple partial matching algorithm, and then we rank these candidates by their relevance to the document from both lexical and semantic perspectives. Moreover, we bootstrap a deep generative model using these top-ranked pseudo keyphrases to produce more absent candidates. Extensive experiments demonstrate that AutoKeyGen outperforms all unsupervised baselines and can even beat a strong supervised method in certain cases. more »

Award ID(s):: 2040727

PAR ID:: 10403505

Author(s) / Creator(s):: Shen, Xianjie; Wang, Yinghan; Meng, Rui; Shang, Jingbo

Date Published:: 2022-06-30

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 36

Issue:: 10

ISSN:: 2159-5399

Page Range / eLocation ID:: 11303 to 11311

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaai.v36i10.21381

More Like this