NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Enhancing Knowledge Graph Consistency through Open Large Language Models: A Case Study

https://doi.org/10.1609/aaaiss.v3i1.31201

Padia, Ankur; Ferraro, Francis; Finin, Tim (May 2024, Proceedings of the AAAI Symposium Series)

High-quality knowledge graphs (KGs) play a crucial role in many applications. However, KGs created by automated information extraction systems can suffer from erroneous extractions or be inconsistent with provenance/source text. It is important to identify and correct such problems. In this paper, we study leveraging the emergent reasoning capabilities of large language models (LLMs) to detect inconsistencies between extracted facts and their provenance. With a focus on ``open'' LLMs that can be run and trained locally, we find that few-shot approaches can yield an absolute performance gain of 2.5-3.4% over the state-of-the-art method with only 9% of training data. We examine the LLM architectures' effect and show that Decoder-Only models underperform Encoder-Decoder approaches. We also explore how model size impacts performance and counterintuitively find that larger models do not result in consistent performance gains. Our detailed analyses suggest that while LLMs can improve KG consistency, the different LLM models learn different aspects of KG consistency and are sensitive to the number of entities involved.
more » « less
Full Text Available
Jointly Identifying and Fixing Inconsistent Readings from Information Extraction Systems

https://doi.org/10.18653/v1/2022.deelio-1.5

Padia, Ankur; Ferraro, Francis; Finin, Tim (May 2022, Third Deep Learning Inside Out (DeeLIO) Workshop: Knowledge Extraction and Integration for Deep Learning Architecture)

Information extraction systems analyze text to produce entities and beliefs, but their output often has errors. In this paper we analyze the reading consistency of the extracted facts with respect to the text from which they were derived and show how to detect and correct errors. We consider both the scenario when the provenance text is automatically found by an IE system and when it is curated by humans. We contrast consistency with credibility; define and explore consistency and repair tasks; and demonstrate a simple, yet effective and generalizable, model. We analyze these tasks and evaluate this approach on three datasets. Against a strong baseline model, we consistently improve both consistency and repair across three datasets using a simple MLP model with attention and lexical features.
more » « less
Full Text Available
Knowledge graph fact prediction via knowledge-enriched tensor factorization

https://doi.org/10.1016/j.websem.2019.01.004

Padia, Ankur; Kalpakis, Konstantinos; Ferraro, Francis; Finin, Tim (February 2019, Journal of Web Semantics)

Full Text Available
Team UMBC-FEVER : Claim verification using Semantic Lexical Resources

https://doi.org/10.18653/v1/W18-5527

Padia, Ankur; Ferraro, Francis; Finin, Tim (January 2018, Team UMBC-FEVER : Claim verification using Semantic Lexical Resources)

Full Text Available
UMBC at SemEval-2018 Task 8: Understanding Text about Malware

https://doi.org/10.18653/v1/S18-1142

Padia, Ankur; Roy, Arpita; Satyapanich, Taneeya; Ferraro, Francis; Pan, Shimei; Park, Youngja; Joshi, Anupam; Finin, Tim (January 2018, UMBC at SemEval-2018 Task 8: Understanding Text about Malware)

Full Text Available

Search for: All records