LPGA: Line-of-Sight Parsing with Graph-Based Attention for Math Formula Recognition

Mahdavi, Mahshad; Condon, Michael; Davila, Kenny; Zanibbi, Richard

Citation Details

We present a model for recognizing typeset math formula images from connected components or symbols. In our approach, connected components are used to construct a line-of-sight (LOS) graph. The graph is used both to reduce the search space for formula structure interpretations, and to guide a classification attention model using separate channels for inputs and their local visual context. For classification, we used visual densities with Random Forests for initial development, and then converted this to a Convolutional Neural Network (CNN) with a second branch to capture context for each input image. Formula structure is extracted as a directed spanning tree from a weighted LOS graph using Edmonds’ algorithm. We obtain strong results for formulas without grids or matrices in the InftyCDB-2 dataset (90.89% from components, 93.5% from symbols). Using tools from the CROHME handwritten formula recognition competitions, we were able to compile all symbol and structure recognition errors for analysis. Our data and source code are publicly available. more »

Award ID(s):: 1717997

PAR ID:: 10124326

Author(s) / Creator(s):: Mahdavi, Mahshad; Condon, Michael; Davila, Kenny; Zanibbi, Richard

Date Published:: 2019-01-01

Journal Name:: Proceedings of the International Conference on Document Analysis and Recognition

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this