NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Unpacking Response Process Issues Encountered When Developing a Mathematics Teachers’ Pedagogical Content Knowledge (PCK) Assessment

https://doi.org/10.1080/19477503.2023.2201115

Epstein, Martha L.; Malik, Hamza; Wang, Kun; Orrill, Chandra H. (July 2023, Investigations in Mathematics Learning)

Full Text Available
The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models

https://doi.org/10.1177/00131644221094325

Sen, Sedat; Cohen, Allan S. (May 2022, Educational and Psychological Measurement)

The purpose of this study was to examine the effects of different data conditions on item parameter recovery and classification accuracy of three dichotomous mixture item response theory (IRT) models: the Mix1PL, Mix2PL, and Mix3PL. Manipulated factors in the simulation included the sample size (11 different sample sizes from 100 to 5000), test length (10, 30, and 50), number of classes (2 and 3), the degree of latent class separation (normal/no separation, small, medium, and large), and class sizes (equal vs. nonequal). Effects were assessed using root mean square error (RMSE) and classification accuracy percentage computed between true parameters and estimated parameters. The results of this simulation study showed that more precise estimates of item parameters were obtained with larger sample sizes and longer test lengths. Recovery of item parameters decreased as the number of classes increased with the decrease in sample size. Recovery of classification accuracy for the conditions with two-class solutions was also better than that of three-class solutions. Results of both item parameter estimates and classification accuracy differed by model type. More complex models and models with larger class separations produced less accurate results. The effect of the mixture proportions also differentially affected RMSE and classification accuracy results. Groups of equal size produced more precise item parameter estimates, but the reverse was the case for classification accuracy results. Results suggested that dichotomous mixture IRT models required more than 2,000 examinees to be able to obtain stable results as even shorter tests required such large sample sizes for more precise estimates. This number increased as the number of latent classes, the degree of separation, and model complexity increased.
more » « less
Full Text Available
Teacher-Responses: Highlight characteristics of low response process validity for item(s) measuring teachers' pedagogical content knowledge

Epstein, M. L. (January 2022, Proceedings of the forty-fourth annual meeting of the North American Chapter of the International Group for the Psychology of Mathematics Education)
Lischka, A. E. (Ed.)
Response Process Validity (RPV) reflects the degree to which items are interpreted as intended by item developers. In this study, teacher responses to constructed response (CR) items to assess pedagogical content knowledge (PCK) of middle school mathematics teachers were evaluated to determine what types of teacher responses signaled weak RPV. We analyzed 38 CR pilot items on proportional reasoning across up to 13 middle school mathematics teachers per item. By coding teacher responses and using think-alouds, we found teachers' responses deemed indicative of low item RPV often had one of the following characteristics: vague answers, unanticipated assumptions, a focus on unintended topics, and paraphrasing. To develop a diverse pool of items with strong RPV, we suggest it is helpful to be aware of these symptoms, use them to consider how to improve items, and then revise and retest items accordingly.
more » « less
Full Text Available
A model comparison approach to posterior predictive model checks in Bayesian confirmatory factor analysis.

https://doi.org/10.1080/10705511.2021.2012682

Zhang, J. (January 2022, Structural equation modeling)

Full Text Available
Modification Indices for Diagnostic Classification Models

Brown C. & Templin, J. (January 2022, Multivariate behavioral research)

Full Text Available
A Gibbs Sampling Algorithm with Monotonicity Constraints for Diagnostic Classification Models

https://doi.org/10.1007/s00357-021-09392-7

Yamaguchi, Kazuhiro; Templin, Jonathan (July 2021, Journal of Classification)
null (Ed.)
Full Text Available
Integrating a Statistical Topic Model and a Diagnostic Classification Model for Analyzing Items in a Mixed Format Assessment

https://doi.org/10.3389/fpsyg.2020.579199

Choi, H.-J.; Kim, Seohyun; Cohen, Allan S.; Templin, Jonathan; Copur-Gencturk, Yasemin (February 2021, Frontiers in Psychology)
null (Ed.)
Selected response items and constructed response (CR) items are often found in the same test. Conventional psychometric models for these two types of items typically focus on using the scores for correctness of the responses. Recent research suggests, however, that more information may be available from the CR items than just scores for correctness. In this study, we describe an approach in which a statistical topic model along with a diagnostic classification model (DCM) was applied to a mixed item format formative test of English and Language Arts. The DCM was used to estimate students’ mastery status of reading skills. These mastery statuses were then included in a topic model as covariates to predict students’ use of each of the latent topics in their written answers to a CR item. This approach enabled investigation of the effects of mastery status of reading skills on writing patterns. Results indicated that one of the skills, Integration of Knowledge and Ideas, helped detect and explain students’ writing patterns with respect to students’ use of individual topics.
more » « less
Full Text Available
Sample Size Requirements for Applying Diagnostic Classification Models

https://doi.org/10.3389/fpsyg.2020.621251

Sen, Sedat; Cohen, Allan S. (January 2021, Frontiers in Psychology)
null (Ed.)
Results of a comprehensive simulation study are reported investigating the effects of sample size, test length, number of attributes and base rate of mastery on item parameter recovery and classification accuracy of four DCMs (i.e., C-RUM, DINA, DINO, and LCDMREDUCED). Effects were evaluated using bias and RMSE computed between true (i.e., generating) parameters and estimated parameters. Effects of simulated factors on attribute assignment were also evaluated using the percentage of classification accuracy. More precise estimates of item parameters were obtained with larger sample size and longer test length. Recovery of item parameters decreased as the number of attributes increased from three to five but base rate of mastery had a varying effect on the item recovery. Item parameter and classification accuracy were higher for DINA and DINO models.
more » « less
Full Text Available
Designing assessment items for measuring PCK for proportional reasoning.

Orrill, C. H. (January 2021, Proceedings of the forty-third annual meeting of the North American Chapter of the International Group for the Psychology of Mathematics Education)
Olanoff, D. (Ed.)
Full Text Available
Designing items for measuring PCK for proportional reasoning.

Orrill, C. H.; Epstein, M.; Wang, K.; Malik, H.; & Copur-Gencturk, Y. (January 2021, Proceedings of the fourth-third annual meeting of the North American Chapter of the International Group for the Psychology of Mathematics Education)
Olanoff, D.; Johnson, K.; & Spitzer, S. (Ed.)
The purpose of this poster is to report on findings from our development efforts. In prior papers, we have reflected on some of the challenges in writing items to measure teachers’ specialized content knowledge (e.g., Orrill et al, 2015). In this paper, we reflect on our analysis of think-aloud interviews to identify what we have learned about the development of PCK items for proportional reasoning.
more » « less
Full Text Available

« Prev Next »

Search for: All records