Plant science corpus

Shiu, Shin-Han

doi:10.5281/zenodo.10022686

Citation Details

Plant science corpus

The plant science corpus consists of the titles and abstracts of plant science articles in PubMed published prior to 2021 with a small number of 2021 records due to modification of records. The columns are: Index: integer index serving as identifier PMID: PubMed identifier Date: Publication date Journal: journal where the article was published Title: Title of the article Abstract: Abstract of the article Corpus: Title and abstract combined Text classification score: plant science record prediction model score Preprocessed corpus: Corpus after lower-casing, stop word removal, removal of non-alphanumeric and non-white space characters, lemmitisation Topic: index of topics after topic modeling more »

Award ID(s):: 2107215

PAR ID:: 10475805

Author(s) / Creator(s):: Shiu, Shin-Han

Publisher / Repository:: Zenodo

Date Published:: 2023-01-01

Format(s):: Medium: X

Location:: Michigan State University

Sponsoring Org:: National Science Foundation

Dataset:
https://doi.org/10.5281/zenodo.10022686

More Like this