Preliminary Experiments with Transformer based Approaches To Automatically Inferring Domain Models from Textbooks

Banjade, Rabin; Oli, Priti; Tamang, Lasang Jimba; Rus, Vasile

Citation Details

Domain modeling is a central component in education technologies as it represents the target domain students are supposed to train on and eventually master. Automatically generating domain models can lead to substantial cost and scalability benefits. Automatically extracting key concepts or knowledge components from, for instance, textbooks can enable the development of automatic or semi-automatic processes for creating domain models. We explore in this work the use of transformer based pre-trained models for the task of keyphrase extraction. Specifically, we investigate and evaluate four different variants of BERT, a pre-trained transformer based architecture, that vary in terms of training data, training objective, or training strategy to extract knowledge components from textbooks for the domain of intro-to-programming. We report results obtained using the following BERT-based models: BERT, CodeBERT, SciBERT and RoBERTa. more »

Award ID(s):: 1822752

PAR ID:: 10367869

Author(s) / Creator(s):: Banjade, Rabin; Oli, Priti; Tamang, Lasang Jimba; Rus, Vasile

Date Published:: 2022-07-24

Journal Name:: Proceedings of the 15th International Conference on Educational Data Mining

Page Range / eLocation ID:: 667–672

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this