NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Uniform Meaning Representation Parsing as a Pipelined Approach

Chun, Jayeol; Xue, Nianwen (August 2024, ACL Anthology)

Full Text Available
Chinese UMR annotation: Can LLMs help?

Sun, Haibo; Xue, Nianwen; Zhao, Jin; Yue, Liulu; Sun, Yao; Xue, Keer; Wu, Jiawei (May 2024, ELRA and ICCL)
Bonial, Claire; Bonn, Julia; Hwang, Jena D (Ed.)
We explore using LLMs, GPT-4 specifically, to generate draft sentence-level Chinese Uniform Meaning Representations (UMRs) that human annotators can revise to speed up the UMR annotation process. In this study, we use few-shot learning and Think-Aloud prompting to guide GPT-4 to generate sentence-level graphs of UMR. Our experimental results show that compared with annotating UMRs from scratch, using LLMs as a preprocessing step reduces the annotation time by two thirds on average. This indicates that there is great potential for integrating LLMs into the pipeline for complicated semantic annotation tasks.
more » « less
Full Text Available
Anchor and Broadcast: An Efficient Concept Alignment Approach for Evaluation of Semantic Graphs

Sun, Haibo; Xue, Nianwen (May 2024, ELRA and ICCL)
Calzolari, Nicoletta; Kan, Min-Yen Kan; Hoste, Veronique; Lenci, Alessandro; Sakti, Sakriani; Xue, Nianwen (Ed.)
In this paper, we present AnCast, an intuitive and efficient tool for evaluating graph-based meaning representations (MR). AnCast implements evaluation metrics that are well understood in the NLP community, and they include concept F1, unlabeled relation F1, labeled relation F1, and weighted relation F1. The efficiency of the tool comes from a novel anchor broadcast alignment algorithm that is not subject to the trappings of local maxima. We show through experimental results that the AnCast score is highly correlated with the widely used Smatch score, but its computation takes only about 40% the time.
more » « less
Full Text Available
Building a Broad Infrastructure for Uniform Meaning Representations

Bonn, Juli; Buchholz, Matthew J; Chun, Jayeol; Cowell, Andrew; Croft, William; Denk, Lukas; Ge, Sijia; Hajič, Jan; Lai, Kenneth; Martin, James H; et al (May 2024, ELRA and ICCL)
Calzolari, Nicoletta; Kan, Min-Yen; Hoste, Veronique; Lenci, Alessandro; Sakti, Sakriani; Xue, Nianwen (Ed.)
This paper reports the first release of the UMR (Uniform Meaning Representation) data set. UMR is a graph-based meaning representation formalism consisting of a sentence-level graph and a document-level graph. The sentence-level graph represents predicate-argument structures, named entities, word senses, aspectuality of events, as well as person and number information for entities. The document-level graph represents coreferential, temporal, and modal relations that go beyond sentence boundaries. UMR is designed to capture the commonalities and variations across languages and this is done through the use of a common set of abstract concepts, relations, and attributes as well as concrete concepts derived from words from invidual languages. This UMR release includes annotations for six languages (Arapaho, Chinese, English, Kukama, Navajo, Sanapana) that vary greatly in terms of their linguistic properties and resource availability. We also describe on-going efforts to enlarge this data set and extend it to other genres and modalities. We also briefly describe the available infrastructure (UMR annotation guidelines and tools) that others can use to create similar data sets.
more » « less
Full Text Available
Beyond Benchmarks: Building a Richer Cross-Document Event Coreference Dataset with Decontextualization

Zhao, Jin; Tu, Jingxuan; Ye, Bingyang; Hu, Xinrui; Xue, Nianwen; Pustejovsky, James (April 2024, ACL Anthology)

Full Text Available
Cross-Document Event Coreference Resolution: Instruct Humans or Instruct GPT?

https://doi.org/10.18653/v1/2023.conll-1.38

Zhao, Jin; Xue, Nianwen; Min, Bonan (December 2023, Association for Computational Linguistics)
Jiang, Jing; Reitter, David; Deng, Shumin (Ed.)
This paper explores utilizing Large Language Models (LLMs) to perform Cross-Document Event Coreference Resolution (CDEC) annotations and evaluates how they fare against human annotators with different levels of training. Specifically, we formulate CDEC as a multi-category classification problem on pairs of events that are represented as decontextualized sentences, and compare the predictions of GPT-4 with the judgment of fully trained annotators and crowdworkers on the same data set. Our study indicates that GPT-4 with zero-shot learning outperformed crowd-workers by a large margin and exhibits a level of performance comparable to trained annotators. Upon closer analysis, GPT-4 also exhibits tendencies of being overly confident, and force annotation decisions even when such decisions are not warranted due to insufficient information. Our results have implications on how to perform complicated annotations such as CDEC in the age of LLMs, and show that the best way to acquire such annotations might be to combine the strengths of LLMs and trained human annotators in the annotation process, and using untrained or undertrained crowdworkers is no longer a viable option to acquire high-quality data to advance the state of the art for such problems.
more » « less
Full Text Available
UMR-Writer 2.0: Incorporating a New Keyboard Interface and Workflow into UMR-Writer

https://doi.org/10.18653/v1/2023.law-1.21

Ge, Sijia; Zhao, Jin; Wright-Bettner, Kristin; Myers, Skatje; Xue, Nianwen; Palmer, Martha (July 2023, The 17th Linguistic Annotation Workshop (LAW-XVII))

Full Text Available
UMR annotation of Multiword Expressions

Bonn, Julia; Cowell, Andrew; Hajic, Jan; Palmer, Alexis; Palmer, Martha; Pustejovsky, James; Sun, Haibo; Uresova Zdenka; Wein, Shira; Xue, Nianwen; et al (June 2023, The 4th International Workshop on Designing Meaning Representations)

Full Text Available
UMR annotation of Chinese Verb compounds and related constructions

Sun, Haibo; Zhu, Yifan; Zhao, Jin; Xue, Nianwen (March 2023, The First International Workshop on Construction Grammars and NLP (CxGs+NLP))

Full Text Available
Mapping AMR to UMR: Resources for Adapting Existing Corpora for Cross-Lingual Compatibility

Bonn, Julia; Myers Skatje; Van Gysel, Jens E.; Denk, Lukas; Vigus, Meagan; Zhao, Jin; Cowell, Andrew; Croft, William; Hajic, Jan; Martin, James H; et al (March 2023, The 21st International Workshop on Treebanks and Linguistic Theories (TLT, GURT/SyntaxFest 2023))

Full Text Available

Search for: All records