‘Just because you are right, doesn’t mean I am wrong’: Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks

Luo, Man; Sampat, Shailaja Keyur; Tallman, Riley; Zeng, Yankai; Vancha, Manuha; Sajja, Akarshan; Baral, Chitta

doi:10.18653/v1/2021.eacl-main.240

Citation Details

‘Just because you are right, doesn’t mean I am wrong’: Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks

GQA (CITATION) is a dataset for real-world visual reasoning and compositional question answering. We found that many answers predicted by the best vision-language models on the GQA dataset do not match the ground-truth answer but still are semantically meaningful and correct in the given context. In fact, this is the case with most existing visual question answering (VQA) datasets where they assume only one ground-truth answer for each question. We propose Alternative Answer Sets (AAS) of ground-truth answers to address this limitation, which is created automatically using off-the-shelf NLP tools. We introduce a semantic metric based on AAS and modify top VQA solvers to support multiple plausible answers for a question. We implement this approach on the GQA dataset and show the performance improvements. more »

Award ID(s):: 1816039

PAR ID:: 10353903

Author(s) / Creator(s):: Luo, Man; Sampat, Shailaja Keyur; Tallman, Riley; Zeng, Yankai; Vancha, Manuha; Sajja, Akarshan; Baral, Chitta

Editor(s):: Merlo, Paola; Tiedemann, Jorg; Tsarfaty, Reut

Date Published:: 2021-01-01

Journal Name:: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume

Page Range / eLocation ID:: 2766 to 2771

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2021.eacl-main.240

More Like this