Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes

Wang, Haoyu; Zhang, Hongming; Wang, Yueguan; Deng, Yuqian; Chen, Muhao; Roth, Dan

doi:10.18653/v1/2023.emnlp-main.246

Citation Details

Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes

Natural language often describes events in different granularities, such that more coarse-grained (goal) events can often be decomposed into fine-grained sequences of (step) events. A critical but overlooked challenge in understanding an event process lies in the fact that the step events are not equally important to the central goal. In this paper, we seek to fill this gap by studying how well current models can understand the essentiality of different step events towards a goal event. As discussed by cognitive studies, such an ability enables the machine to mimic human’s commonsense reasoning about preconditions and necessary efforts of daily-life tasks. Our work contributes with a high-quality corpus of (goal, step) pairs from a community guideline website WikiHow, where the steps are manually annotated with their essentiality w.r.t. the goal. The high IAA indicates that humans have a consistent understanding of the events. Despite evaluating various statistical and massive pre-trained NLU models, we observe that existing SOTA models all perform drastically behind humans, indicating the need for future investigation of this crucial yet challenging task. more »

Award ID(s):: 2105329

NSF-PAR ID:: 10482430

Author(s) / Creator(s):: Wang, Haoyu; Zhang, Hongming; Wang, Yueguan; Deng, Yuqian; Chen, Muhao; Roth, Dan

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2023-01-01

Journal Name:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Page Range / eLocation ID:: 4048 to 4056

Format(s):: Medium: X

Location:: Singapore

Sponsoring Org:: National Science Foundation

Conference Paper:
https://doi.org/10.18653/v1/2023.emnlp-main.246

More Like this