skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Supporting Theory Building in Design-Based Research through Large Scale Data-Based Models
Although the fields of educational data mining and learning analytics have grown in terms of the analytic sophistication and breadth of applications, the impact on theory-building has been limited. To move these fields forward, studies should not only be driven by learning theory but also the analytics should be used to inform theory. In this paper, we present an approach for integrating educational data mining models with design-based research approaches to promote theory-building that is informed by data-based models. This approach aligns theory, design of the learning environment, data collection, and analytic methods through iterations that focus on the refinement and improvement of all these components. We provide an example from our own work which is driven by a critical constructionist learning framework, the design and development of a digital learning environment for elementary-school aged children to learn about artificial intelligence within sociopolitical contexts, and the use of epistemic network analysis as a tool for modeling learning. We conclude with how this approach can be reciprocally beneficial in that educational data miners can use their models to inform theory and learning scientists can augment their theory-building practices through big data models.  more » « less
Award ID(s):
2448445
PAR ID:
10575522
Author(s) / Creator(s):
; ; ; ; ; ;
Editor(s):
Benjamin, Paaßen; Carrie, Demmans Epp
Publisher / Repository:
International Educational Data Mining Society
Date Published:
Format(s):
Medium: X
Right(s):
Creative Commons Attribution 4.0 International
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    This essay presents the case of designing a learning analytics system using a theory of learning. Learning analytics systems are often institutional artifacts using data collected from and to support educational practice and practitioners including learners, teachers, and administrators. There is a substantial and growing body of work under the learning analytics banner. Much of it framed technically around data harvested from digital tools and presentation mechanisms called dashboards. Using a specific case involving a collaborative game-based education research project, this paper provides a broad, sociotechnical design perspective through three frame expanding sections: the educational game’s learning theory-driven approach, the information architecture of the learning analytics system, and the activity system that the information from learning analytics information are used within. This paper illustrates a portion of the conceptual landscape that can guide the design, development, and research for these data systems that are potentially consequential for students and educators. 
    more » « less
  2. Different analytic techniques operate optimally with different types of data. As the use of EHR-based analytics expands to newer tasks, data will have to be transformed into different representations, so the tasks can be optimally solved. We classified representations into broad categories based on their characteristics and proposed a new knowledge-driven representation for clinical data mining as well as trajectory mining, called Severity Encoding Variables (SEVs). Additionally, we studied which characteristics make representations most suitable for particular clinical analytics tasks including trajectory mining. Our evaluation shows that, for regression, most data representations performed similarly, with SEV achieving a slight (albeit statistically significant) advantage. For patients at high risk of diabetes, it outperformed the competing representation by (relative) 20%. For association mining, SEV achieved the highest performance. Its ability to constrain the search space of patterns through clinical knowledge was key to its success. 
    more » « less
  3. de Vries, E; Hod, Y; Ahn, J (Ed.)
    Researchers in the Learning Sciences take two prevalent stances: research as building theories or as developing designs. The connection between theories and designs is most often filled in by methods, but an alternative stance is possible: research as improving models. The modeling stance seeks parsimonious, useful, illuminating descriptions of learning activity systems. Models can help us understand and express how variability (in all its forms) plays into, is enacted during, and results from designed learning activities. Building models often requires employing multiple theories, methods, and design elements; a modeling stance recognizes that our research often elaborates a multi-level systems view. An explicit modeling stance may lead to developing descriptions of complex systems, inviting multi-stakeholder teamwork to improve these systems, integrating advances in learning analytics and educational data mining, and adding to ability of learning sciences research to tackle challenges at scale. 
    more » « less
  4. Technological developments have spawned a range of educational software that strives to enhance learning through personalized adaptation. The success of these systems depends on how accurate the knowledge state of individual learners is modeled over time. Computer scientists have been at the forefront of development for these kinds of distributed learning systems and have primarily relied on data-driven algorithms to trace knowledge acquisition in noisy and complex learning domains. Meanwhile, research psychologists have primarily relied on data collected in controlled laboratory settings to develop and validate theory-driven computational models, but have not devoted much exploration to learning in naturalistic environments. The two fields have largely operated in parallel despite considerable overlap in goals. We argue that mutual benefits would result from identifying and implementing more accurate methods to model the temporal dynamics of learning and forgetting for individual learners. Here we discuss recent efforts in developing adaptive learning technologies to highlight the strengths and weaknesses inherent in the typical approaches of both fields. We argue that a closer collaboration between the educational machine learning/data mining and cognitive psychology communities would be a productive and exciting direction for adaptive learning system application to move in. 
    more » « less
  5. null (Ed.)
    This paper proposes a special session on the use of computational methods for analyzing educational data. Computation has permeated all disciplines because it provides unique opportunities to represent knowledge and understand complex phenomena. In education, disciplines such as learning analytics and educational data mining have emerged to better understand educational phenomena. This special session will discuss three different approaches to use computational methods to analyze qualitative educational data. After the discussion, the participants will be able to implement these methods using R programming, while reflecting on how they can use these methods in their own context. 
    more » « less