Analysis of student essays in an introductory physics course using natural language processing

Bralin, Amir; Morphew, Jason W; Rebello, Carina M; Rebello, N Sanjay

doi:10.1119/perc.2023.pr.Bralin

Citation Details

Analysis of student essays in an introductory physics course using natural language processing

We analyzed the essays that were written on various topics in an introductory physics course using two unsupervised machine learning algorithms. One of them was Latent Dirichlet Allocation (LDA). This algorithm is used for extracting abstract topics from a collection of text documents. The other algorithm was Non-negative Matrix Factorization (NMF). It is used for similar purposes but also in other domains such as image recognition. We applied these two algorithms to the dataset that consisted of N=683 student essays. Although there were some built-in, important differences between LDA and NMF, they both found similar topics in our data by large. This offers instructors a promising and productive way of accessing useful information about their students' written work, especially in large-enrollment classes. more »

Award ID(s):: 2300645

PAR ID:: 10610778

Author(s) / Creator(s):: Bralin, Amir; Morphew, Jason W; Rebello, Carina M; Rebello, N Sanjay

Publisher / Repository:: American Association of Physics Teachers

Date Published:: 2023-10-15

Page Range / eLocation ID:: 58 to 63

Format(s):: Medium: X

Location:: Sacramento, CA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1119/perc.2023.pr.Bralin

More Like this