NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Olala: object-level active learning based layout annotation

Z Shen, J Zhao (December 2022, Empirical Methods in Natural Language Processing)
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

https://doi.org/10.1007/978-3-030-86549-8_9

Shen, Zejiang; Zhang, Ruochen; Dell, Melissa; Lee, Benjamin; Carlson, Jacob; Li, Weining (January 2021, Proceedings of the International Conference on Document Analysis and Recognition)

Full Text Available
A Large Dataset of Historical Japanese Documents with Complex Layouts

Shen, Zejiang; Zhang, Kaixuan; Dell, Melissa (January 2020, IEEE/CVF Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Deep learning-based approaches for automatic document layout analysis and content extraction have the potential to unlock rich information trapped in historical documents on a large scale. One major hurdle is the lack of large datasets for training robust models. In particular, little training data exist for Asian languages. To this end, we present HJDataset, a Large Dataset of Historical Japanese Documents with Complex Layouts. It contains over 250,000 layout element annotations of seven types. In addition to bounding boxes and masks of the content regions, it also includes the hierarchical structures and reading orders for layout elements. The dataset is constructed using a combination of human and machine efforts. A semi-rule based method is developed to extract the layout elements, and the results are checked by human inspectors. The resulting large-scale dataset is used to provide baseline performance analyses for text region detection using state-of-the-art deep learning models. And we demonstrate the usefulness of the dataset on real-world document digitization tasks.
more » « less
Full Text Available
A Large Dataset of Historical Japanese Documents with Complex Layouts.

https://doi.org/10.1109/CVPRW50498.2020.00282

Shen, Zejiang; Zhang, Kaixuan; Dell, Melissa (January 2020, IEEE/CVF Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available
Information Extraction from Text Regions with Complex Tabular Structure.

Zhang, Kaixuan; Shen, Zejiang; Zhou, Jie; Dell, Melissa (January 2019, Conference on Neural Information Processing Systems)
null (Ed.)
Recent innovations have improved layout analysis of document images, significantly improving our ability to identify text and non-text regions. However, extracting information from within text regions remains quite challenging because the text region may have a complex structure. In this paper, we present a new dataset with complex tabular structure, and propose new methods to robustly retrieve information from the complex text region.
more » « less
Full Text Available

Search for: All records