skip to main content


Title: Construction and evaluation of a domain-specific knowledge graph for knowledge discovery
Purpose

This study aims to evaluate a method of building a biomedical knowledge graph (KG).

Design/methodology/approach

This research first constructs a COVID-19 KG on the COVID-19 Open Research Data Set, covering information over six categories (i.e. disease, drug, gene, species, therapy and symptom). The construction used open-source tools to extract entities, relations and triples. Then, the COVID-19 KG is evaluated on three data-quality dimensions: correctness, relatedness and comprehensiveness, using a semiautomatic approach. Finally, this study assesses the application of the KG by building a question answering (Q&A) system. Five queries regarding COVID-19 genomes, symptoms, transmissions and therapeutics were submitted to the system and the results were analyzed.

Findings

With current extraction tools, the quality of the KG is moderate and difficult to improve, unless more efforts are made to improve the tools for entity extraction, relation extraction and others. This study finds that comprehensiveness and relatedness positively correlate with the data size. Furthermore, the results indicate the performances of the Q&A systems built on the larger-scale KGs are better than the smaller ones for most queries, proving the importance of relatedness and comprehensiveness to ensure the usefulness of the KG.

Originality/value

The KG construction process, data-quality-based and application-based evaluations discussed in this paper provide valuable references for KG researchers and practitioners to build high-quality domain-specific knowledge discovery systems.

 
more » « less
Award ID(s):
2225229
NSF-PAR ID:
10472133
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Emerald Publishing Limited
Date Published:
Journal Name:
Information Discovery and Delivery
ISSN:
2398-6247
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Purpose

    This study aims to present the evaluation of a competency-based online professional development training program, PhD Progression, tied to a digital badge system, created to support PhD students across fields.

    Design/methodology/approach

    This study took place at Boston University, a large, nonprofit, Carnegie Classified R1 research-intensive institution located in the northeastern region of the USA. Through internal campus collaborations, the authors developed a PhD core capacities framework. Building from this framework, the authors designed the first learning level of the program and ran a pilot study with PhD students from various fields and at different stages of their PhD. Using surveys and focus groups, the authors collected both quantitative and qualitative data to evaluate this program.

    Findings

    The quantitative and qualitative data show that the majority of the PhD student participants found the contents of the competency-based training program useful, appropriate for building skills and knowledge and therefore relevant for both their degree progress and their future job. Gaining digital badges significantly increased their motivation to complete training modules.

    Practical implications

    This type of resource is scalable to other institutions that wish to provide self-paced professional development support to their PhD students while rewarding them for investing time in building professional skills and enabling them to showcase these skills to potential employers.

    Originality/value

    This study demonstrates, for the first time, that tying a digital badging system to a competency-based professional development program significantly motivates PhD students to set professional development goals and invest time in building skills.

     
    more » « less
  2. Hoppey, David ; Hall, Katrina ; Lynch, Megan (Ed.)
    Purpose

    There is some evidence to suggest that the historical challenge associated with recruiting and retaining Black and Brown Science, Technology, Engineering and Math (STEM) collegians is tied to early their teaching and learning experiences in Mathematics. This paper describes an National Science Foundation (NSF) funded project (NSF #2151043) whose goal is to attract, prepare and retain math teachers of color in high need school districts ensure that those teachers remain in the field long enough to make a meaningful impact on the minds and hearts of BIPOC students who are often, extrinsically, and intrinsically, discouraged from pursuing careers in STEM professions.

    Design/methodology/approach

    This mixed-methods study, which began in the summer of 2023, seeks to recruit, prepare, support and retain nineteen (19) Black and Brown math teachers for two (2) high need urban school districts. The expectancy value theory will be used to explain the performance, persistence, and choices of the teachers, while grounded theory will be utilized to understand the impact of the intensive mentorship and wellness coaching that applied over the first year of their preservice preparation and subsequent in-service years.

    Findings

    Measures of project efficacy won’t begin until 2025 and as such there are no findings or implications to draw from for the study at this time.

    Originality/value

    The intention of this paper is to augment the body of knowledge on recruiting and retaining Black and Brown math teachers for urban schools where the need for quality STEM teachers is critical.

     
    more » « less
  3. Purpose

    The purpose of this study was to examine the experiences of multiple campus teams as they engaged in the assessment of their science, technology, engineering and mathematics (STEM) mentoring ecosystems within a peer assessment dialogue exercise.

    Design/methodology/approach

    This project utilized a qualitative multicase study method involving six campus teams, drawing upon completed inventory and visual mapping artefacts, session observations and debriefing interviews. The campuses included research universities, small colleges and minority-serving institutions (MSIs) across the United States of America. The authors analysed which features of the peer assessment dialogue exercise scaffolded participants' learning about ecosystem synergies and threats.

    Findings

    The results illustrated the benefit of instructor modelling, intra-team process time and multiple rounds of peer assessment. Participants gained new insights into their own campuses and an increased sense of possibility by dialoguing with peer campuses.

    Research limitations/implications

    This project involved teams from a small set of institutions, relying on observational and self-reported debriefing data. Future research could centre perspectives of institutional leaders.

    Practical implications

    The authors recommend dedicating time to the institutional assessment of mentoring ecosystems. Investing in a campus-wide mentoring infrastructure could align with campus equity goals.

    Originality/value

    In contrast to studies that have focussed solely on programmatic outcomes of mentoring, this study explored strategies to strengthen institutional mentoring ecosystems in higher education, with a focus on peer assessment, dialogue and learning exercises.

     
    more » « less
  4. Purpose

    This study aims to explore how network visualization provides opportunities for learners to explore data literacy concepts using locally and personally relevant data.

    Design/methodology/approach

    The researchers designed six locally relevant network visualization activities to support students’ data reasoning practices toward understanding aggregate patterns in data. Cultural historical activity theory (Engeström, 1999) guides the analysis to identify how network visualization activities mediate students’ emerging understanding of aggregate data sets.

    Findings

    Pre/posttest findings indicate that this implementation positively impacted students’ understanding of network visualization concepts, as they were able to identify and interpret key relationships from novel networks. Interaction analysis (Jordan and Henderson, 1995) of video data revealed nuances of how activities mediated students’ improved ability to interpret network data. Some challenges noted in other studies, such as students’ tendency to focus on familiar concepts, are also noted as teachers supported conversations to help students move beyond them.

    Originality/value

    To the best of the authors’ knowledge, this is the first study the authors are aware of that supported elementary students in exploring data literacy through network visualization. The authors discuss how network visualizations and locally/personally meaningful data provide opportunities for learning data literacy concepts across the curriculum.

     
    more » « less
  5. Purpose

    The purpose of this study is to develop a deep learning framework for additive manufacturing (AM), that can detect different defect types without being trained on specific defect data sets and can be applied for real-time process control.

    Design/methodology/approach

    This study develops an explainable artificial intelligence (AI) framework, a zero-bias deep neural network (DNN) model for real-time defect detection during the AM process. In this method, the last dense layer of the DNN is replaced by two consecutive parts, a regular dense layer denoted (L1) for dimensional reduction, and a similarity matching layer (L2) for equal weight and non-biased cosine similarity matching. Grayscale images of 3D printed samples acquired during printing were used as the input to the zero-bias DNN.

    Findings

    This study demonstrates that the approach is capable of successfully detecting multiple types of defects such as cracks, stringing and warping with high accuracy without any prior training on defective data sets, with an accuracy of 99.5%.

    Practical implications

    Once the model is set up, the computational time for anomaly detection is lower than the speed of image acquisition indicating the potential for real-time process control. It can also be used to minimize manual processing in AI-enabled AM.

    Originality/value

    To the best of the authors’ knowledge, this is the first study to use zero-bias DNN, an explainable AI approach for defect detection in AM.

     
    more » « less