Parsing and Summarizing Infographics with Synthetically Trained Icon Detection

Madan, Spandan; Bylinskii, Zoya; Nobre, Carolina; Tancik, Matthew; Recasens, Adria; Zhong, Kimberli; Alsheikh, Sami; Oliva, Aude; Durand, Fredo; Pfister, Hanspeter

doi:10.1109/PacificVis52677.2021.00012

Citation Details

Parsing and Summarizing Infographics with Synthetically Trained Icon Detection

Widely used in news, business, and educational media, infographics are handcrafted to effectively communicate messages about complex and often abstract topics including `ways to conserve the environment' and `coronavirus prevention'. The computational understanding of infographics required for future applications like automatic captioning, summarization, search, and question-answering, will depend on being able to parse the visual and textual elements contained within. However, being composed of stylistically and semantically diverse visual and textual elements, infographics pose challenges for current A.I. systems. While automatic text extraction works reasonably well on infographics, standard object detection algorithms fail to identify the stand-alone visual elements in infographics that we refer to as `icons'. In this paper, we propose a novel approach to train an object detector using synthetically-generated data, and show that it succeeds at generalizing to detecting icons within in-the-wild infographics. We further pair our icon detection approach with an icon classifier and a state-of-the-art text detector to demonstrate three demo applications: topic prediction, multi-modal summarization, and multi-modal search. Parsing the visual and textual elements within infographics provides us with the first steps towards automatic infographic understanding. more »

Award ID(s):: 1901030

PAR ID:: 10300546

Author(s) / Creator(s):: Madan, Spandan; Bylinskii, Zoya; Nobre, Carolina; Tancik, Matthew; Recasens, Adria; Zhong, Kimberli; Alsheikh, Sami; Oliva, Aude; Durand, Fredo; Pfister, Hanspeter

Date Published:: 2021-04-01

Journal Name:: 2021 IEEE 14th Pacific Visualization Symposium (PacificVis)

Page Range / eLocation ID:: 31 to 40

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/PacificVis52677.2021.00012

More Like this