NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Ohio Child Speech Corpus

https://doi.org/10.1016/j.specom.2025.103206

Wagner, Laura; Alghowinhem, Sharifa; Alwan, Abeer; Bowdrie, Kristina; Breazeal, Cynthia; Clopper, Cynthia G; Fosler-Lussier, Eric; Jamsek, Izabela A; Lander, Devan; Ramnath, Rajiv; et al (January 2025, Speech Communication)

This paper reports on the creation and composition of a new corpus of children’s speech, the Ohio Child Speech Corpus, which is publicly available on the Talkbank-CHILDES website. The audio corpus contains speech samples from 303 children ranging in age from 4 – 9 years old, all of whom participated in a seven-task elicitation protocol conducted in a science museum lab. In addition, an interactive social robot controlled by the researchers joined the sessions for approximately 60% of the children, and the corpus itself was collected in the peri‑pandemic period. Two analyses are reported that highlighted these last two features. One set of analyses found that the children spoke significantly more in the presence of the robot relative to its absence, but no effects of speech complexity (as measured by MLU) were found for the robot’s presence. Another set of analyses compared children tested immediately post-pandemic to children tested a year later on two school-readiness tasks, an Alphabet task and a Reading Passages task. This analysis showed no negative impact on these tasks for our highly-educated sample of children just coming off of the pandemic relative to those tested later. These analyses demonstrate just two possible types of questions that this corpus could be used to investigate.
more » « less
Full Text Available
The JIBO Kids Corpus: A speech dataset of child-robot interactions in a classroom environment

https://doi.org/10.1121/10.0034195

Shankar, Natarajan_Balaji; Afshan, Amber; Johnson, Alexander; Mahapatra, Aurosweta; Martin, Alejandra; Ni, Haolun; Park, Hae_Won; Perez, Marlen_Quintero; Yeung, Gary; Bailey, Alison; et al (November 2024, JASA Express Letters)

This paper describes an original dataset of children's speech, collected through the use of JIBO, a social robot. The dataset encompasses recordings from 110 children, aged 4–7 years old, who participated in a letter and digit identification task and extended oral discourse tasks requiring explanation skills, totaling 21 h of session data. Spanning a 2-year collection period, this dataset contains a longitudinal component with a subset of participants returning for repeat recordings. The dataset, with session recordings and transcriptions, is publicly available, providing researchers with a valuable resource to advance investigations into child language development.
more » « less
CHILDES English OCSC Corpus

https://doi.org/10.21415/60TV-D669

Wagner, Laura; Alghowinhem, Sharifa; Alwan, Abeer; Bowdrie, Kristina; Breazeal, Cynthia; Clopper, Cynthia; Fosler-Lussier, Eric; Jamsek, Izabella; Lander, Devan; Ramnath, Rajiv; et al (January 2024, TalkBank)

This corpus was collected in the Language Sciences Research Lab, a working lab embedded inside of a science museum: the Center of Science and Industry in Columbus, Ohio, USA. Participants were recruited from the floor of the museum and run in a semi-public space. Three distinctive features of the corpus are: (1) an interactive social robot (specifically, a Jibo robot) was present and participated in the sessions for roughly half the children; (2) all children were recorded with a lapel mic generating high quality audio (available through CHILDES), as well as a distal table mic generating low quality audio (available on request) to facilitate strong tests of automated speech processing on the data; and (3) the data were collected in the peri-pandemic period, beginning in the summer of 2021 just after COVID-19 restrictions were being eased and ending in the summer of 2022 – thus providing a snapshot of language development in a distinctive time of the world. A YouTube video on the Jibo robot is available here .
more » « less
Jibo Community Social Robot Research Platform @Scale

https://doi.org/10.1145/3610978.3638171

Park, Hae Won; Breazeal, Cynthia; Alghowinem, Sharifa; Ostrowski, Anastasia K; Ferguson, Jon; Zhang, Xiajie; Lee, Dong Won (March 2024, ACM)

Full Text Available
What are GANs?: Introducing Generative Adversarial Networks to Middle School Students

Ali, Safinah; DiPaola, Daniella; Breazeal, Cynthia (May 2021, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Applications of Generative Machine Learning techniques such as Generative Adversarial Networks (GANs) are used to generate new instances of images, music, text, and videos. While GANs have now become commonplace on social media, a part of children’s lives, and have considerable ethical implications, existing K-12 AI education curricula do not include generative AI. We present a new module, “What are GANs?”, that teaches middle school students how GANs work and how they can create media using GANs. We developed an online, team-based game to simulate how GANs work. Students also interacted with up to four web tools that apply GANs to generate media. This module was piloted with 72 middle school students in a series of online workshops. We provide insight into student usage, understanding, and attitudes towards this lesson. Finally, we give suggestions for integrating this lesson into AI education curricula.
more » « less
Full Text Available
Lifelong Personalization via Gaussian Process Modeling for Long-Term HRI

https://doi.org/10.3389/frobt.2021.683066

Spaulding, Samuel; Shen, Jocelyn; Park, Hae Won; Breazeal, Cynthia (June 2021, Frontiers in Robotics and AI)
null (Ed.)
Across a wide variety of domains, artificial agents that can adapt and personalize to users have potential to improve and transform how social services are provided. Because of the need for personalized interaction data to drive this process, long-term (or longitudinal) interactions between users and agents, which unfold over a series of distinct interaction sessions, have attracted substantial research interest. In recognition of the expanded scope and structure of a long-term interaction, researchers are also adjusting the personalization models and algorithms used, orienting toward “continual learning” methods, which do not assume a stationary modeling target and explicitly account for the temporal context of training data. In parallel, researchers have also studied the effect of “multitask personalization,” an approach in which an agent interacts with users over multiple different tasks contexts throughout the course of a long-term interaction and learns personalized models of a user that are transferrable across these tasks. In this paper, we unite these two paradigms under the framework of “Lifelong Personalization,” analyzing the effect of multitask personalization applied to dynamic, non-stationary targets. We extend the multi-task personalization approach to the more complex and realistic scenario of modeling dynamic learners over time, focusing in particular on interactive scenarios in which the modeling agent plays an active role in teaching the student whose knowledge the agent is simultaneously attempting to model. Inspired by the way in which agents use active learning to select new training data based on domain context, we augment a Gaussian Process-based multitask personalization model with a mechanism to actively and continually manage its own training data, allowing a modeling agent to remove or reduce the weight of observed data from its training set, based on interactive context cues. We evaluate this method in a series of simulation experiments comparing different approaches to continual and multitask learning on simulated student data. We expect this method to substantially improve learning in Gaussian Process models in dynamic domains, establishing Gaussian Processes as another flexible modeling tool for Long-term Human-Robot Interaction (HRI) Studies.
more » « less
Full Text Available
Exploring Generative Models with Middle School Students

https://doi.org/10.1145/3411764.3445226

Ali, Safinah; DiPaola, Daniella; Lee, Irene; Hong, Jenna; Breazeal, Cynthia (May 2021, CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems)
null (Ed.)
Applications of generative models such as Generative Adversarial Networks (GANs) have made their way to social media platforms that children frequently interact with. While GANs are associated with ethical implications pertaining to children, such as the generation of Deepfakes, there are negligible efforts to educate middle school children about generative AI. In this work, we present a generative models learning trajectory (LT), educational materials, and interactive activities for young learners with a focus on GANs, creation and application of machine-generated media, and its ethical implications. The activities were deployed in four online workshops with 72 students (grades 5-9). We found that these materials enabled children to gain an understanding of what generative models are, their technical components and potential applications, and benefits and harms, while reflecting on their ethical implications. Learning from our findings, we propose an improved learning trajectory for complex socio-technical systems.
more » « less
Full Text Available
Developing Middle School Students' AI Literacy

https://doi.org/10.1145/3408877.3432513

Lee, Irene; Ali, Safinah; Zhang, Helen; DiPaola, Daniella; Breazeal, Cynthia (March 2021, SIGCSE '21: Proceedings of the 52nd ACM Technical Symposium on Computer Science Education)
null (Ed.)
In this experience report, we describe an AI summer workshop designed to prepare middle school students to become informed citizens and critical consumers of AI technology and to develop their foundational knowledge and skills to support future endeavors as AI-empowered workers. The workshop featured the 30-hour "Developing AI Literacy" or DAILy curriculum that is grounded in literature on child development, ethics education, and career development. The participants in the workshop were students between the ages of 10 and 14; 87% were from underrepresented groups in STEM and Computing. In this paper we describe the online curriculum, its implementation during synchronous online workshop sessions in summer of 2020, and preliminary findings on student outcomes. We reflect on the successes and lessons we learned in terms of supporting students' engagement and conceptual learning of AI, shifting attitudes toward AI, and fostering conceptions of future selves as AI-enabled workers. We conclude with discussions of the affordances and barriers to bringing AI education to students from underrepresented groups in STEM and Computing.
more » « less
Full Text Available
Teaching and learning with children: Impact of reciprocal peer learning with a social robot on children’s learning and emotive engagement

https://doi.org/10.1016/j.compedu.2020.103836

Chen, Huili; Park, Hae Won; Breazeal, Cynthia (June 2020, Computers & Education)

Full Text Available
Adapting K-12 AI Learning for Online Instruction. 2nd International Workshop on Education in Artificial Intelligence K-12

Ali, Safinah; DiPaola, Daniella; Lee, Irene; Jackson, David; Kiel, Jeff; Beal, Kerri; Zhang, Helen; Cheng, Yihong; Breazeal, Cynthia (January 2021, Proceedings of German Journal of Artificial Intelligence 2/2021)
null (Ed.)
Due to classrooms moving online during COVID-19, educators are faced with the challenge of adapting in-classroom curricula for online instructions. This poses challenges and opportunities for AI learning given the project-based learning approaches of existing curricula. We taught a 5-hour synchronous online class about AI to 17 middle school students. In this paper, we discuss challenges in adapting to online learning and future opportunities. Our contribution is valuable to educators and curriculum designers that are adapting their AI curricula for synchronous online learning.
more » « less
Full Text Available

« Prev Next »

Search for: All records