Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available December 13, 2024
-
Purpose This study aims to evaluate a method of building a biomedical knowledge graph (KG).
Design/methodology/approach This research first constructs a COVID-19 KG on the COVID-19 Open Research Data Set, covering information over six categories (i.e. disease, drug, gene, species, therapy and symptom). The construction used open-source tools to extract entities, relations and triples. Then, the COVID-19 KG is evaluated on three data-quality dimensions: correctness, relatedness and comprehensiveness, using a semiautomatic approach. Finally, this study assesses the application of the KG by building a question answering (Q&A) system. Five queries regarding COVID-19 genomes, symptoms, transmissions and therapeutics were submitted to the system and the results were analyzed.
Findings With current extraction tools, the quality of the KG is moderate and difficult to improve, unless more efforts are made to improve the tools for entity extraction, relation extraction and others. This study finds that comprehensiveness and relatedness positively correlate with the data size. Furthermore, the results indicate the performances of the Q&A systems built on the larger-scale KGs are better than the smaller ones for most queries, proving the importance of relatedness and comprehensiveness to ensure the usefulness of the KG.
Originality/value The KG construction process, data-quality-based and application-based evaluations discussed in this paper provide valuable references for KG researchers and practitioners to build high-quality domain-specific knowledge discovery systems.
-
Abstract High throughput drug screening is an established approach to investigate tumor biology and identify therapeutic leads. Traditional platforms use two-dimensional cultures which do not accurately reflect the biology of human tumors. More clinically relevant model systems such as three-dimensional tumor organoids can be difficult to scale and screen. Manually seeded organoids coupled to destructive endpoint assays allow for the characterization of treatment response, but do not capture transitory changes and intra-sample heterogeneity underlying clinically observed resistance to therapy. We present a pipeline to generate bioprinted tumor organoids linked to label-free, time-resolved imaging via high-speed live cell interferometry (HSLCI) and machine learning-based quantitation of individual organoids. Bioprinting cells gives rise to 3D structures with unaltered tumor histology and gene expression profiles. HSLCI imaging in tandem with machine learning-based segmentation and classification tools enables accurate, label-free parallel mass measurements for thousands of organoids. We demonstrate that this strategy identifies organoids transiently or persistently sensitive or resistant to specific therapies, information that could be used to guide rapid therapy selection.