SUMDocS: Surrounding-aware Unsupervised Multi-Document Summarization

Qi Zhu, Fang Guo

doi:10.1137/1.9781611976700.54

Citation Details

SUMDocS: Surrounding-aware Unsupervised Multi-Document Summarization

Multi-document summarization, which summarizes a set of documents with a small number of phrases or sentences, provides a concise and critical essence of the documents. Existing multi-document summarization methods ignore the fact that there often exist many relevant documents that provide surrounding background knowledge, which can help generate a salient and discriminative summary for a given set of documents. In this paper, we propose a novel method, SUMDocS (Surrounding-aware Unsupervised Multi-Document Summarization), which incorporates rich surrounding (topically related) documents to help improve the quality of extractive summarization without human supervision. Speci fically, we propose a joint optimization algorithm to unify global novelty (i.e., category-level frequent and discriminative), local consistency (i.e., locally frequent, co-occurring), and local saliency (i.e., salient from its surroundings) such that the obtained summary captures the characteristics of the target documents. Extensive experiments on news and scientifi c domains demonstrate the superior performance of our method when the unlabeled surrounding corpus is utilized. more »

Award ID(s):: 1956151 1704532 1741317

PAR ID:: 10311073

Author(s) / Creator(s):: Qi Zhu, Fang Guo

Editor(s):: Carlotta Demeniconi, Ian Davidson:

Date Published:: 2021-04-29

Journal Name:: Proceedings of the 2021 {SIAM} International Conference on Data Mining, {SDM} 2021

Volume:: 2021

Issue:: 1

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1137/1.9781611976700.54

More Like this