A lexical-based approach for exhaustive detection of missing hierarchical IS-A relations in SNOMED CT

Zheng, Fengbo; Shi, Jay; Cui, Licong

Citation Details

Incompleteness of ontologies affects the quality of downstream ontology-based applications. In this paper, we introduce a novel lexical-based approach to automatically detect potentially missing hierarchical IS-A relations in SNOMED CT. We model each concept with an enriched set of lexical features, by leveraging words and noun phrases in the name of the concept itself and the concept's ancestors. Then we perform subset inclusion checking to suggest potentially missing IS-A relations between concepts. We applied our approach to the September 2017 release of SNOMED CT (US edition) which suggested a total of 38,615 potentially missing IS-A relations. For evaluation, a domain expert reviewed a random sample of 100 missing IS-A relations selected from the "Clinical finding" sub-hierarchy, and confirmed 90 are valid (a precision of 90%). Additional review of invalid suggestions further revealed incorrect existing IS-A relations. Our results demonstrate that systematic analysis of the enriched lexical features of concepts is an effective approach to identify potentially missing hierarchical IS-A relations in SNOMED CT. more »

Award ID(s):: 1931134

PAR ID:: 10286742

Author(s) / Creator(s):: Zheng, Fengbo; Shi, Jay; Cui, Licong

Date Published:: 2020-01-01

Journal Name:: AMIA Annual Symposium proceedings

Page Range / eLocation ID:: 1392-1401

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this