NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision

https://doi.org/10.1145/3696410.3714940

Zhang, Yunyi; Yang, Ruozhen; Xu, Xueqiang; Li, Rui; Xiao, Jinfeng; Shen, Jiaming; Han, Jiawei (April 2025, ACM)

Free, publicly-accessible full text available April 22, 2026
Predicting Text Preference Via Structured Comparative Reasoning

https://doi.org/10.18653/v1/2024.acl-long.541

Yan, Jing Nathan; Liu, Tianqi; Chiu, Justin; Shen, Jiaming; Qin, Zhen; Yu, Yue; Lakshmanan, Charumathi; Kurzion, Yair; Rush, Alexander; Liu, Jialu; et al (January 2024, Association for Computational Linguistics)

Full Text Available
Unsupervised Event Chain Mining from Multiple Documents

https://doi.org/10.1145/3543507.3583295

Jiao, Yizhu; Zhong, Ming; Shen, Jiaming; Zhang, Yunyi; Zhang, Chao; Han, Jiawei (April 2023, ACM)
Proc. 2023 The Web Conf. (Ed.)
Massive and fast-evolving news articles keep emerging on the web. To efectively summarize and provide concise insights into real-world events, we propose a new event knowledge extraction task Event Chain Mining in this paper. Given multiple documents abouta super event, it aims to mine a series of salient events in temporal order. For example, the event chain of super event Mexico Earthquake in 2017 is {earthquake hit Mexico, destroy houses, kill people,block roads}. This task can help readers capture the gist of textsquickly, thereby improving reading efciency and deepening text comprehension. To address this task, we regard an event as a cluster of diferent mentions of similar meanings. In this way, we can identify the diferent expressions of events, enrich their semantic knowledge and replenish relation information among them. Taking events as the basic unit, we present a novel unsupervised framework, EMiner. Specifcally, we extract event mentions from texts and merge them with similar meanings into a cluster as a single event. By jointly incorporating both content and commonsense, essential events are then selected and arranged chronologically to form an event chain. Meanwhile, we annotate a multi-document benchmark to build a comprehensive testbed for the proposed task. Extensive experiments are conducted to verify the efectiveness of EMiner in terms of both automatic and human evaluations.
more » « less
Full Text Available
Corpus-Based Relation Extraction by Identifying and Refining Relation Patterns

https://doi.org/10.1007/978-3-031-43421-1_2

Zhou, Sizhe; Ge, Suyu; Shen, Jiaming; Han, Jiawei (January 2023, Springer Nature Switzerland)

Full Text Available
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters

https://doi.org/10.18653/v1/2023.acl-long.153

Wang, Boshi; Min, Sewon; Deng, Xiang; Shen, Jiaming; Wu, You; Zettlemoyer, Luke; Sun, Huan (January 2023, Association for Computational Linguistics)

Full Text Available
Unsupervised Key Event Detection from Massive Text Corpora

https://doi.org/10.1145/3534678.3539395

Zhang, Yunyi; Guo, Fang; Shen, Jiaming; Han, Jiawei (August 2022, KDD'22:The 28th {ACM} {SIGKDD} Conference on Knowledge Discovery and Data Mining, August 14-18, 2021)

Automated event detection from news corpora is a crucial task towards mining fast-evolving structured knowledge. As real-world events have different granularities, from the top-level themes to key events and then to event mentions corresponding to concrete actions, there are generally two lines of research: (1) theme detection tries to identify from a news corpus major themes (e.g., “2019 Hong Kong Protests” versus “2020 U.S. Presidential Election”) which have very distinct semantics; and (2) action extraction aims to extract from a single document mention-level actions (e.g., “the police hit the left arm of the protester”) that are often too fine-grained for comprehending the real-world event. In this paper, we propose a new task, key event detection at the intermediate level, which aims to detect from a news corpus key events (e.g., HK Airport Protest on Aug. 12-14), each happening at a particular time/location and focusing on the same topic. This task can bridge event understanding and structuring and is inherently challenging because of (1) the thematic and temporal closeness of different key events and (2) the scarcity of labeled data due to the fast-evolving nature of news articles. To address these challenges, we develop an unsupervised key event detection framework, EvMine, that (1) extracts temporally frequent peak phrases using a novel ttf-itf score, (2) merges peak phrases into event-indicative feature sets by detecting communities from our designed peak phrase graph that captures document cooccurrences, semantic similarities, and temporal closeness signals, and (3) iteratively retrieves documents related to each key event by training a classifier with automatically generated pseudo labels from the event-indicative feature sets and refining the detected key events using the retrieved documents in each iteration. Extensive experiments and case studies show EvMine outperforms all the baseline methods and its ablations on two real-world news corpora.
more » « less
Full Text Available
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters

https://doi.org/10.1145/3485447.3512002

Lee, Dongha; Shen, Jiaming; Kang, Seongku; Yoon, Susik; Han, Jiawei; Yu, Hwanjo (April 2022, ACM)

Full Text Available
Phrase-aware Unsupervised Constituency Parsing

https://doi.org/10.18653/v1/2022.acl-long.444

Gu, Xiaotao; Shen, Yikang; Shen, Jiaming; Shang, Jingbo; Han, Jiawei (January 2022, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion

https://doi.org/10.18653/v1/2022.findings-acl.23

Xie, Yiqing; Shen, Jiaming; Li, Sha; Mao, Yuning; Han, Jiawei (January 2022, Association for Computational Linguistics)

Full Text Available
Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation

https://doi.org/10.18653/v1/2022.findings-emnlp.122

Lee, Dongha; Shen, Jiaming; Lee, Seonghyeon; Yoon, Susik; Yu, Hwanjo; Han, Jiawei (January 2022, Association for Computational Linguistics)

Full Text Available

« Prev Next »

Search for: All records