RE2: Region-Aware Relation Extraction from Visually Rich Documents

Ramu, Pritika; Wang, Sijia; Mouatadid, Lalla; Rimchala, Joy; Huang, Lifu

doi:10.18653/v1/2024.naacl-long.484

Citation Details

RE2: Region-Aware Relation Extraction from Visually Rich Documents

Current research in form understanding predominantly relies on large pre-trained language models, necessitating extensive data for pre-training. However, the importance of layout structure (i.e., the spatial relationship between the entity blocks in the visually rich document) to relation extraction has been overlooked. In this paper, we propose REgion-Aware Relation Extraction (\bf{RE^2}) that leverages region-level spatial structure among the entity blocks to improve their relation prediction. We design an edge-aware graph attention network to learn the interaction between entities while considering their spatial relationship defined by their region-level representations. We also introduce a constraint objective to regularize the model towards consistency with the inherent constraints of the relation extraction task. To support the research on relation extraction from visually rich documents and demonstrate the generalizability of \bf{RE^2}, we build a new benchmark dataset, DiverseForm, that covers a wide range of domains. Extensive experiments on DiverseForm and several public benchmark datasets demonstrate significant superiority and transferability of \bf{RE^2} across various domains and languages, with up to 18.88% absolute F-score gain over all high-performing baselines more »

Award ID(s):: 2238940

PAR ID:: 10527696

Author(s) / Creator(s):: Ramu, Pritika; Wang, Sijia; Mouatadid, Lalla; Rimchala, Joy; Huang, Lifu

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2024-06-15

ISBN:: 979-8-89176-114-8

Page Range / eLocation ID:: 8731 to 8747

Format(s):: Medium: X

Location:: Mexico City, Mexico

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2024.naacl-long.484

More Like this