PubMed Author-assigned Keyword Extraction (PubMedAKE) Benchmark

Sheng, Jiasheng; Gero, Zelalem; Ho, Joyce C.

doi:10.1145/3511808.3557675

Citation Details

PubMed Author-assigned Keyword Extraction (PubMedAKE) Benchmark

With the ever-increasing abundance of biomedical articles, improving the accuracy of keyword search results becomes crucial for ensuring reproducible research. However, keyword extraction for biomedical articles is hard due to the existence of obscure keywords and the lack of a comprehensive benchmark. PubMedAKE is an author-assigned keyword extraction dataset that contains the title, abstract, and keywords of over 843,269 articles from the PubMed open access subset database. This dataset, publicly available on Zenodo, is the largest keyword extraction benchmark with sufficient samples to train neural networks. Experimental results using state-of-the-art baseline methods illustrate the need for developing automatic keyword extraction methods for biomedical literature. more »

Award ID(s):: 2145411 1838200 2124104

PAR ID:: 10419384

Author(s) / Creator(s):: Sheng, Jiasheng; Gero, Zelalem; Ho, Joyce C.

Date Published:: 2022-10-17

Journal Name:: CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Page Range / eLocation ID:: 4470 to 4474

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3511808.3557675

More Like this