Summarizing Lecture Videos by Key Handwritten Content Regions

Urala Kota, Bhargava; Ahmed, Saleem; Stone, Alexander; Davila, Kenny; Setlur, Srirangaraj; Govindaraju, Venu

doi:10.1109/ICDARW.2019.30058

Citation Details

Summarizing Lecture Videos by Key Handwritten Content Regions

We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88). more »

Award ID(s):: 1640867

NSF-PAR ID:: 10188708

Author(s) / Creator(s):: Urala Kota, Bhargava; Ahmed, Saleem; Stone, Alexander; Davila, Kenny; Setlur, Srirangaraj; Govindaraju, Venu

Date Published:: 2019-09-01

Journal Name:: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)

Page Range / eLocation ID:: 13 to 18

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICDARW.2019.30058

More Like this