Search for: All records

Award ID contains: 2145411

« Prev Next »

Total Resources

7

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

2

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

7

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PubMed Author-assigned Keyword Extraction (PubMedAKE) Benchmark

https://doi.org/10.1145/3511808.3557675

Sheng, Jiasheng ; Gero, Zelalem ; Ho, Joyce C. ( October 2022 , CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management)

With the ever-increasing abundance of biomedical articles, improving the accuracy of keyword search results becomes crucial for ensuring reproducible research. However, keyword extraction for biomedical articles is hard due to the existence of obscure keywords and the lack of a comprehensive benchmark. PubMedAKE is an author-assigned keyword extraction dataset that contains the title, abstract, and keywords of over 843,269 articles from the PubMed open access subset database. This dataset, publicly available on Zenodo, is the largest keyword extraction benchmark with sufficient samples to train neural networks. Experimental results using state-of-the-art baseline methods illustrate the need for developing automatic keyword extraction methods for biomedical literature.
more » « less
Full Text Available
PGB: A PubMed Graph Benchmark for Heterogeneous Network Representation Learning

https://doi.org/10.5281/zenodo.6406776

W., Eric Lee ; C., Joyce Ho ( January 2022 , Zenodo)

PubMed Graph Benchmark (PGB) aggregates the metadata associated with the biomedical articles from PubMed into a unified source. The benchmark contains metadata including title, abstract, authors, in/out citations, MeSH terms, MeSH hierarchy, venue, publication type, and chemicals.

more » « less
SR-CoMbEr: Heterogeneous Network Embedding Using Community Multi-view Enhanced Graph Convolutional Network for Automating Systematic Reviews

Lee, Eric W. ; Ho, Joyce C. ( April 2023 , Advances in Information Retrieval: 45th European Conference on Information Retrieval)

Systematic reviews (SRs) are a crucial component of evidence-based clinical practice. Unfortunately, SRs are labor-intensive and unscalable with the exponential growth in literature. Automating evidence synthesis using machine learning models has been proposed but solely focuses on the text and ignores additional features like citation information. Recent work demonstrated that citation embeddings can outperform the text itself, suggesting that better network representation may expedite SRs. Yet, how to utilize the rich information in heterogeneous information networks (HIN) for network embeddings is understudied. Existing HIN models fail to produce a high-quality embedding compared to simply running state-of-the-art homogeneous network models. To address existing HIN model limitations, we propose SR-CoMbEr, a community-based multi-view graph convolutional network for learning better embeddings for evidence synthesis. Our model automatically discovers article communities to learn robust embeddings that simultaneously encapsulate the rich semantics in HINs. We demonstrate the effectiveness of our model to automate 15 SRs.
more » « less
Full Text Available
Neighborhood-Regularized Self-Training for Learning with Few Labels

Xu, Ran ; Yu, Yue ; Cui, Hejie ; Kan, Xuan ; Zhu, Yanqiao ; Ho, Joyce ; Zhang, Chao ; Yang, Carl ( January 2023 , Thirty-Seventh AAAI Conference on Artificial Intelligence)

Full Text Available
Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training

Xu, Ran ; Yu, Yue ; Ho, Joyce ; Yang, Carl ( January 2023 , 46th International ACM SIGIR Conference on Research and Development in Information Retrieval)

Full Text Available
PubMed-OA-Extraction-dataset

https://doi.org/10.5281/zenodo.6330817

Sheng, Jiasheng ( January 2022 , Zenodo)

This is the train-test-validation dataset for pubmed open-access articles keyphrase extraction task. The small_* file contains the all articles that have 5to 25 extractive keyphrases (keyphrase in the article that is inside the abstract of the article).

more » « less
Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR

Xu, Ran ; Yu, Yue ; Zhang, Chao ; Ali, Mohammed K ; Ho, Joyce C ; Yang, Carl ( January 2022 , Proceedings of the 2nd Machine Learning for Health symposium)

Electronic Health Record modeling is crucial for digital medicine. However, existing models ignore higher-order interactions among medical codes and their causal relations towards downstream clinical predictions. To address such limitations, we propose a novel framework CACHE, to provide effective and insightful clinical predictions based on hypergraph representation learning and counterfactual and factual reasoning techniques. Experiments on two real EHR datasets show the superior performance of CACHE. Case studies with a domain expert illustrate a preferred capability of CACHE in generating clinically meaningful interpretations towards the correct predictions.
more » « less
Full Text Available