NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Pencils Down! Automatic Rubric-based Evaluation of Retrieve/Generate Systems

https://doi.org/10.1145/3664190.3672511

Farzi, Naghmeh; Dietz, Laura (August 2024, ACM)

Full Text Available
EXAM++: LLM-based Answerability Metrics for IR Evaluation

Farzi, Naghmeh; Dietz, Laura (July 2024, CEUR-WS)

Full Text Available
A Workbench for Autograding Retrieve/Generate Systems

https://doi.org/10.1145/3626772.3657871

Dietz, Laura (July 2024, ACM)

Full Text Available
Who Determines What Is Relevant? Humans or AI? Why Not Both?

https://doi.org/10.1145/3624730

Faggioli, Guglielmo; Dietz, Laura; Clarke, Charles_L A; Demartini, Gianluca; Hagen, Matthias; Hauff, Claudia; Kando, Noriko; Kanoulas, Evangelos; Potthast, Martin; Stein, Benno; et al (April 2024, Communications of the ACM)

Full Text Available
Neural Entity Context Models

Oza, Pooja; Chatterjee, Shubham; Dietz, Laura (December 2023, ACM)

A prevalent approach of entity-oriented systems involves retrieving relevant entities by harnessing knowledge graph embeddings. These embeddings encode entity information in the context of the knowledge graph and are static in nature. Our goal is to generate entity embeddings that capture what renders them relevant for the query. This differs from entity embeddings constructed with static resource, for example, E-BERT. Previously, ~\citet{dalton2014entity} demonstrated the benefits obtained with the Entity Context Model, a pseudo-relevance feedback approach based on entity links in relevant contexts. In this work, we reinvent the Entity Context Model (ECM) for neural graph networks and incorporate pre-trained embeddings. We introduce three entity ranking models based on fundamental principles of ECM: (1) \acl{GAN}, (2) Simple Graph Relevance Networks, and (3) Graph Relevance Networks. \acl{GAN} and Graph Relevance Networks are the graph neural variants of ECM, that employ attention mechanism and relevance information of the relevant context respectively to ascertain entity relevance. Our experiments demonstrate that our neural variants of the ECM model significantly outperform the state-of-the-art BERT-ER ~\cite{10.1145/3477495.3531944} by more than 14\% and exceeds the performance of systems that use knowledge graph embeddings by over 101\%. Notably, our findings reveal that leveraging the relevance of the relevant context is more effective at identifying relevant entities than the attention mechanism. To evaluate the efficacy of the models, we conduct experiments on two standard benchmark datasets, DBpediaV2 and TREC Complex Answer Retrieval. To aid reproducibility, our code and data are available. https://github.com/TREMA-UNH/neural-entity-context-models
more » « less
Full Text Available
Neural entity context models

Oza, P; Chatterjee, S; Dietz, L (December 2023, Proceedings of the the 12th International Joint Conference on Knowledge Graphs)

Full Text Available
Perspectives on Large Language Models for Relevance Judgment

https://doi.org/10.1145/3578337.3605136

Faggioli, Guglielmo; Dietz, Laura; Clarke, Charles L.; Demartini, Gianluca; Hagen, Matthias; Hauff, Claudia; Kando, Noriko; Kanoulas, Evangelos; Potthast, Martin; Stein, Benno; et al (August 2023, ACM)

When asked, large language models (LLMs) like ChatGPT claim that they can assist with relevance judgments but it is not clear whether automated judgments can reliably be used in evaluations of retrieval systems. In this perspectives paper, we discuss possible ways for LLMs to support relevance judgments along with concerns and issues that arise. We devise a human–machine collaboration spectrum that allows to categorize different relevance judgment strategies, based on how much humans rely on machines. For the extreme point of ‘fully automated judgments’, we further include a pilot experiment on whether LLM-based relevance judgments corre- late with judgments from trained human assessors. We conclude the paper by providing opposing perspectives for and against the use of LLMs for automatic relevance judgments, and a compromise per- spective, informed by our analyses of the literature, our preliminary experimental evidence, and our experience as IR researchers
more » « less
Full Text Available
Neuro-Symbolic Representations for Information Retrieval

https://doi.org/10.1145/3539618.3594246

Dietz, Laura; Bast, Hannah; Chatterjee, Shubham; Dalton, Jeffrey; Nie, Jian-Yun; Nogueira, Rodrigo (July 2023, ACM)

This tutorial will provide an overview of recent advances on neuro- symbolic approaches for information retrieval. A decade ago, knowl- edge graphs and semantic annotations technology led to active research on how to best leverage symbolic knowledge. At the same time, neural methods have demonstrated to be versatile and highly effective. From a neural network perspective, the same representation approach can service document ranking or knowledge graph rea- soning. End-to-end training allows to optimize complex methods for downstream tasks. We are at the point where both the symbolic and the neural research advances are coalescing into neuro-symbolic approaches. The underlying research questions are how to best combine sym- bolic and neural approaches, what kind of symbolic/neural ap- proaches are most suitable for which use case, and how to best integrate both ideas to advance the state of the art in information retrieval. Materials are available online: https://github.com/laura-dietz/ neurosymbolic-representations-for-IR
more » « less
Full Text Available
Report on the Dagstuhl Seminar on Frontiers of Information Access Experimentation for Research and Education

https://doi.org/10.1145/3636341.3636351

Bauer, Christine; Carterette, Ben; Ferro, Nicola; Fuhr, Norbert; Beel, Joeran; Breuer, Timo; Clarke, Charles_L A; Crescenzi, Anita; Demartini, Gianluca; Di_Nunzio, Giorgio Maria; et al (June 2023, ACM SIGIR Forum)

This report documents the program and the outcomes of Dagstuhl Seminar 23031 Frontiers of Information Access Experimentation for Research and Education, which brought together 38 participants from 12 countries. The seminar addressed technology-enhanced information access (information retrieval, recommender systems, natural language processing) and specifically focused on developing more responsible experimental practices leading to more valid results, both for research as well as for scientific education. The seminar featured a series of long and short talks delivered by participants, who helped in setting a common ground and in letting emerge topics of interest to be explored as the main output of the seminar. This led to the definition of five groups which investigated challenges, opportunities, and next steps in the following areas:reality check, i.e. conducting real-world studies, human-machine-collaborative relevance judgment frameworks, overcoming methodological challenges in information retrieval and recommender systems through awareness and education, results-blind reviewing, and guidance for authors. Date:15--20 January 2023. Website:https://www.dagstuhl.de/23031.
more » « less
Full Text Available
Entity Embeddings for Entity Ranking: A Replicability Study

https://doi.org/10.1007/978-3-031-28241-6_8

Oza, Pooja; Dietz, Laura (April 2023, Springer)

Knowledge Graph embeddings model semantic and struc- tural knowledge of entities in the context of the Knowledge Graph. A nascent research direction has been to study the utilization of such graph embeddings for the IR-centric task of entity ranking. In this work, we replicate the GEEER study of Gerritse et al. [9] which demonstrated improvements of Wiki2Vec embeddings on entity ranking tasks on the DBpediaV2 dataset. We further extend the study by exploring additional state-of-the-art entity embeddings ERNIE [27] and E-BERT [19], and by including another test collection, TREC CAR, with queries not about person, location, and organization entities. We confirm the finding that entity embeddings are beneficial for the entity ranking task. Interestingly, we find that Wiki2Vec is competitive with ERNIE and E-BERT. Our code and data to aid reproducibility and further research is available at https://github.com/poojahoza/E3R-Replicability
more » « less
Full Text Available

« Prev Next »

Search for: All records