Visual Parsing with Query-Driven Global Graph Attention (QD-GGA): Preliminary Results for Handwritten Math Formula Recognition

Mahdavi, Mahshad; Sun, Leilei; Zanibbi, Richard

doi:10.1109/CVPRW50498.2020.00293

Citation Details

Visual Parsing with Query-Driven Global Graph Attention (QD-GGA): Preliminary Results for Handwritten Math Formula Recognition

We present a new visual parsing method based on convolutional neural networks for handwritten mathematical formulas. The Query-Driven Global Graph Attention (QD- GGA) parsing model employs multi-task learning, and uses a single feature representation for locating, classifying, and relating symbols. First, a Line-Of-Sight (LOS) graph is computed over the handwritten strokes in a formula. Second, class distributions for LOS nodes and edges are obtained using query-specific feature filters (i.e., attention) in a single feed-forward pass. Finally, a Maximum Spanning Tree (MST) is extracted from the weighted graph. Our preliminary results show that this is a promising new approach for visual parsing of handwritten formulas. Our data and source code are publicly available. more »

Award ID(s):: 1717997

PAR ID:: 10198732

Author(s) / Creator(s):: Mahdavi, Mahshad; Sun, Leilei; Zanibbi, Richard

Date Published:: 2020-06-01

Journal Name:: Proc. CVPR Workshop on Text and Documents ion the Deep Learning Era

Page Range / eLocation ID:: 2429 to 2438

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/CVPRW50498.2020.00293

More Like this