skip to main content


Title: Automatic text generation using deep learning: providing large-scale support for online learning communities
Participating in online communities has significant benefits to students learning in terms of students’ motivation, persistence, and learning outcomes. However, maintaining and supporting online learning communities is very challenging and requires tremendous work. Automatic support is desirable in this situation. The purpose of this work is to explore the use of deep learning algorithms for automatic text generation in providing emotional and community support for a massive online learning community, Scratch. Particularly, state-of-art deep learning language models GPT-2 and recurrent neural network (RNN) are trained using two million comments from the online learning community. We then conduct both a readability test and human evaluation on the automatically generated results for offering support to the online students. The results show that the GPT-2 language model can provide timely and human-written like replies in a style genuine to the data set and context for offering related support.  more » « less
Award ID(s):
1901704
NSF-PAR ID:
10315141
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Interactive learning environments
ISSN:
1049-4820
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This study aims to investigate the collaboration processes of immigrant families as they search for online information together. Immigrant English-language learning adults of lower socioeconomic status often work collaboratively with their children to search the internet. Family members rely on each other’s language and digital literacy skills in this collaborative process known as online search and brokering (OSB). While previous work has identified ecological factors that impact OSB, research has not yet distilled the specific learning processes behind such collaborations. Design/methodology/approach: For this study, the authors adhere to practices of a case study examination. This study’s participants included parents, grandparents and children aged 10–17 years. Most adults were born in Mexico, did not have a college-degree, worked in service industries and represented a lower-SES population. This study conducted two to three separate in-home family visits per family with interviews and online search tasks. Findings: From a case study analysis of three families, this paper explores the funds of knowledge, resilience, ecological support and challenges that children and parents face, as they engage in collaborative OSB experiences. This study demonstrates how in-home computer-supported collaborative processes are often informal, social, emotional and highly relevant to solving information challenges. Research limitations/implications: An intergenerational OSB process is different from collaborative online information problem-solving that happens between classroom peers or coworkers. This study’s research shows how both parents and children draw on their funds of knowledge, resilience and ecological support systems when they search collaboratively, with and for their family members, to problem solve. This is a case study of three families working in collaboration with each other. This case study informs analytical generalizations and theory-building rather than statistical generalizations about families. Practical implications: Designers need to recognize that children and youth are using the same tools as adults to seek high-level critical information. This study’s model suggests that if parents and children are negotiating information seeking with the same technology tools but different funds of knowledge, experience levels and skills, the presentation of information (e.g. online search results, information visualizations) needs to accommodate different levels of understanding. This study recommends designers work closely with marginalized communities through participatory design methods to better understand how interfaces and visuals can help accommodate youth invisible work. Social implications: The authors have demonstrated in this study that learning and engaging in family online searching is not only vital to the development of individual and digital literacy skills, it is a part of family learning. While community services, libraries and schools have a responsibility to support individual digital and information literacy development, this study’s model highlights the need to recognize funds of knowledge, family resiliency and asset-based learning. Schools and teachers should identify and harness youth invisible work as a form of learning at home. The authors believe educators can do this by highlighting the importance of information problem solving in homes and youth in their families. Libraries and community centers also play a critical role in supporting parents and adults for technical assistance (e.g. WiFi access) and information resources. Originality/value: This study’s work indicates new conditions fostering productive joint media engagement (JME) around OSB. This study contributes a generative understanding that promotes studying and designing for JME, where family responsibility is the focus.

     
    more » « less
  2. There have been numerous demands for enhancements in the way undergraduate learning occurs today, especially at a time when the value of higher education continues to be called into question (The Boyer 2030 Commission, 2022). One type of demand has been for the increased integration of subjects/disciplines around relevant issues/topics—with a more recent trend of seeking transdisciplinary learning experiences for students (Sheets, 2016; American Association for the Advancement of Science, 2019). Transdisciplinary learning can be viewed as the holistic way of working equally across disciplines to transcend their own disciplinary boundaries to form new conceptual understandings as well as develop new ways in which to address complex topics or challenges (Ertas, Maxwell, Rainey, & Tanik, 2003; Park & Son, 2010). This transdisciplinary approach can be important as humanity’s problems are not typically discipline specific and require the convergence of competencies to lead to innovative thinking across fields of study. However, higher education continues to be siloed which makes the authentic teaching of converging topics, such as innovation, human-technology interactions, climate concerns, or harnessing the data revolution, organizationally difficult (Birx, 2019; Serdyukov, 2017). For example, working across a university’s academic units to collaboratively teach, or co-teach, around topics of convergence are likely to be rejected by the university systems that have been built upon longstanding traditions. While disciplinary expertise is necessary and one of higher education’s strengths, the structures and academic rigidity that come along with the disciplinary silos can prevent modifications/improvements to the roles of academic units/disciplines that could better prepare students for the future of both work and learning. The balancing of disciplinary structure with transdisciplinary approaches to solving problems and learning is a challenge that must be persistently addressed. These institutional challenges will only continue to limit universities seeking toward scaling transdisciplinary programs and experimenting with novel ways to enhance the value of higher education for students and society. This then restricts innovations to teaching and also hinders the sharing of important practices across disciplines. To address these concerns, a National Science Foundation Improving Undergraduate STEM Education project team, which is the topic of this paper, has set the goal of developing/implementing/testing an authentically transdisciplinary, and scalable educational model in an effort to help guide the transformation of traditional undergraduate learning to span academics silos. This educational model, referred to as the Mission, Meaning, Making (M3) program, is specifically focused on teaching the crosscutting practices of innovation by a) implementing co-teaching and co-learning from faculty and students across different academic units/colleges as well as b) offering learning experiences spanning multiple semesters that immerse students in a community that can nourish both their learning and innovative ideas. As a collaborative initiative, the M3 program is designed to synergize key strengths of an institution’s engineering/technology, liberal arts, and business colleges/units to create a transformative undergraduate experience focused on the pursuit of innovation—one that reaches the broader campus community, regardless of students’ backgrounds or majors. Throughout the development of this model, research was conducted to help identify institutional barriers toward creating such a cross-college program at a research-intensive public university along with uncovering ways in which to address these barriers. While data can show how students value and enjoy transdisciplinary experiences, universities are not likely to be structured in a way to support these educational initiatives and they will face challenges throughout their lifespan. These challenges can result from administration turnover whereas mutual agreements across colleges may then vanish, continued disputes over academic territory, and challenges over resource allotments. Essentially, there may be little to no incentives for academic departments to engage in transdisciplinary programming within the existing structures of higher education. However, some insights and practices have emerged from this research project that can be useful in moving toward transdisciplinary learning around topics of convergence. Accordingly, the paper will highlight features of an educational model that spans disciplines along with the workarounds to current institutional barriers. This paper will also provide lessons learned related to 1) the potential pitfalls with educational programming becoming “un-disciplinary” rather than transdisciplinary, 2) ways in which to incentivize departments/faculty to engage in transdisciplinary efforts, and 3) new structures within higher education that can be used to help faculty/students/staff to more easily converge to increase access to learning across academic boundaries. 
    more » « less
  3. Community colleges provide an important pathway for many prospective engineering graduates, especially those from traditionally underrepresented groups. However, due to a lack of facilities, resources, student demand and/or local faculty expertise, the breadth and frequency of engineering course offerings is severely restricted at many community colleges. This in turn presents challenges for students trying to maximize their transfer eligibility and preparedness. Through a grant from the National Science Foundation Improving Undergraduate STEM Education program (NSF IUSE), three community colleges from Northern California collaborated to increase the availability and accessibility of a comprehensive lower-division engineering curriculum, even at small-to-medium sized community colleges. This was accomplished by developing resources and teaching strategies that could be employed in a variety of delivery formats (e.g., fully online, online/hybrid, flipped face-to-face, etc.), providing flexibility for local community colleges to leverage according to their individual needs. This paper focuses on the iterative development, testing, and refining of the resources for an introductory Materials Science course with 3-unit lecture and 1-unit laboratory components. This course is required as part of recently adopted statewide model associate degree curricula for transfer into Civil, Mechanical, Aerospace, and Manufacturing engineering bachelor’s degree programs at California State Universities. However, offering such a course is particularly challenging for many community colleges, because of a lack of adequate expertise and/or laboratory facilities and equipment. Consequently, course resources were developed to help mitigate these challenges by streamlining preparation for instructors new to teaching the course, as well as minimizing the face-to-face use of traditional materials testing equipment in the laboratory portion of the course. These same resources can be used to support online hybrid and other alternative (e.g., emporium) delivery approaches. After initial pilot implementation of the course during the Spring 2015 semester by the curriculum designer in a flipped student-centered format, these same resources were then implemented by an instructor who had never previously taught the course, at a different community college that did not have its own materials laboratory facilities. A single site visit was arranged with a nearby community college to afford students an opportunity to complete certain lab activities using traditional materials testing equipment. Lessons learned during this attempt were used to inform curriculum revisions, which were evaluated in a repeat offering the following year. In all implementations of the course, student surveys and interviews were used to determine students’ perceptions of the effectiveness of the course resources, student use of these resources, and overall satisfaction with the course. Additionally, student performance on objective assessments was compared with that of traditional lecture delivery of the course by the curriculum designer in prior years. During initial implementations of the course, results from these surveys and assessments revealed low levels of student satisfaction with certain aspects of the flipped approach and course resources, as well as reduced learning among students at the alternate institution. Subsequent modifications to the curriculum and delivery approach were successful in addressing most of these deficiencies. 
    more » « less
  4. Abstract

    Event segmentation theory posits that people segment continuous experience into discrete events and that event boundaries occur when there are large transient increases in prediction error. Here, we set out to test this theory in the context of story listening, by using a deep learning language model (GPT‐2) to compute the predicted probability distribution of the next word, at each point in the story. For three stories, we used the probability distributions generated by GPT‐2 to compute the time series of prediction error. We also asked participants to listen to these stories while marking event boundaries. We used regression models to relate the GPT‐2 measures to the human segmentation data. We found that event boundaries are associated with transient increases in Bayesian surprise but not with a simpler measure of prediction error (surprisal) that tracks, for each word in the story, how strongly that word was predicted at the previous time point. These results support the hypothesis that prediction error serves as a control mechanism governing event segmentation and point to important differences between operational definitions of prediction error.

     
    more » « less
  5. null (Ed.)
    Virtual conversational assistants designed specifically for software engineers could have a huge impact on the time it takes for software engineers to get help. Research efforts are focusing on virtual assistants that support specific software development tasks such as bug repair and pair programming. In this paper, we study the use of online chat platforms as a resource towards collecting developer opinions that could potentially help in building opinion Q&A systems, as a specialized instance of virtual assistants and chatbots for software engineers. Opinion Q&A has a stronger presence in chats than in other developer communications, thus mining them can provide a valuable resource for developers in quickly getting insight about a specific development topic (e.g., What is the best Java library for parsing JSON?). We address the problem of opinion Q&A extraction by developing automatic identification of opinion-asking questions and extraction of participants’ answers from public online developer chats. We evaluate our automatic approaches on chats spanning six programming communities and two platforms. Our results show that a heuristic approach to opinion-asking questions works well (.87 precision), and a deep learning approach customized to the software domain outperforms heuristics-based, machine-learning-based and deep learning for answer extraction in community question answering. 
    more » « less