Seeing the meaning: Vision meets semantics in solving pictorial analogy problems

Lu, H.; Liu, Q.; Ichien, N.; Yuille, A. L.; Holyoak, K. J

Citation Details

We report a first effort to model the solution of meaningful four-term visual analogies, by combining a machine-vision model (ResNet50-A) that can classify pixel-level images into object categories, with a cognitive model (BART) that takes semantic representations of words as input and identifies semantic relations instantiated by a word pair. Each model achieves above-chance performance in selecting the best analogical option from a set of four. However, combining the visual and the semantic models increases analogical performance above the level achieved by either model alone. The contribution of vision to reasoning thus may extend beyond simply generating verbal representations from images. These findings provide a proof of concept that a comprehensive model can solve semantically-rich analogies from pixel-level inputs. more »

Award ID(s):: 1827374

PAR ID:: 10093529

Author(s) / Creator(s):: Lu, H.; Liu, Q.; Ichien, N.; Yuille, A. L.; Holyoak, K. J

Date Published:: 2019-01-01

Journal Name:: Proceedings of the Annual Conference of the Cognitive Science Society

ISSN:: 1069-7977

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this