MouseScholar: Evaluating an Image+Text Search System for Biocuration

Trabucco, Juan Trelles; Floricel, Carla; Arighi, Cecilia; Shatkay, Hagit; Raciti, Daniela; Ringwald, Martin; Marai, G Elisabeta

doi:10.1109/BIBM58861.2023.10385503

Citation Details

MouseScholar: Evaluating an Image+Text Search System for Biocuration

Biocuration is the process of analyzing biological or biomedical articles to organize biological data into data repositories using taxonomies and ontologies. Due to the expanding number of articles and the relatively small number of biocurators, automation is desired to improve the workflow of assessing articles worth curating. As figures convey essential information, automatically integrating images may improve curation. In this work, we instantiate and evaluate a first-in-kind, hybrid image+text document search system for biocuration. The system, MouseScholar, leverages an image modality taxonomy derived in collaboration with biocurators, in addition to figure segmentation, and classifiers components as a back-end and a streamlined front-end interface to search and present document results. We formally evaluated the system with ten biocurators on a mouse genome informatics biocuration dataset and collected feedback. The results demonstrate the benefits of blending text and image information when presenting scientific articles for biocuration. more »

Award ID(s):: 2320261

PAR ID:: 10536556

Author(s) / Creator(s):: Trabucco, Juan Trelles; Floricel, Carla; Arighi, Cecilia; Shatkay, Hagit; Raciti, Daniela; Ringwald, Martin; Marai, G Elisabeta

Publisher / Repository:: IEEE Xplore

Date Published:: 2023-12-05

ISBN:: 979-8-3503-3748-8

Page Range / eLocation ID:: 1473-1480

Subject(s) / Keyword(s):: document search biocuration

Format(s):: Medium: X

Location:: Istanbul, Turkiye

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BIBM58861.2023.10385503

More Like this