Becoming a proficient reader is a critical skill that supports future learning. Toward the end of the primary grades, reading becomes increasingly automatized, and children begin to transition from learning-to-read to reading-to-learn. Yet, the design of beginning reader books may be suboptimal for novice readers. Colorful illustrations that contain irrelevant information (i.e., seductive details) presented in close proximity to the text may increase attentional competition between these sources of information; thus, hampering decoding and reading comprehension. Study 1 examines this hypothesis by experimentally manipulating components of the book design (e.g., presence/absence of seductive details) and investigating its effect on attention and reading performance in first grade students. In Study 2, we conduct an analysis in which we identify common design features in books for beginning readers and examine the prevalence of design features that were fmmd to tax attention in Study 1 and in prior research. Collectively this work identifies an important opportunity in which instructional materials can be optimized to better support children as they learn-to-read. 
                        more » 
                        « less   
                    
                            
                            Deep Learning for Natural Language Processing: A Gentle Introduction
                        
                    
    
            Upon encountering this publication, one might ask the obvious question, "Why do we need another deep learning and natural language processing book?" Several excellent ones have been published, covering both theoretical and practical aspects of deep learning and its application to language processing. However, from our experience teaching courses on natural language processing, we argue that, despite their excellent quality, most of these books do not target their most likely readers. The intended reader of this book is one who is skilled in a domain other than machine learning and natural language processing and whose work relies, at least partially, on the automated analysis of large amounts of data, especially textual data. Such experts may include social scientists, political scientists, biomedical scientists, and even computer scientists and computational linguists with limited exposure to machine learning. Existing deep learning and natural language processing books generally fall into two camps. The first camp focuses on the theoretical foundations of deep learning. This is certainly useful to the aforementioned readers, as one should understand the theoretical aspects of a tool before using it. However, these books tend to assume the typical background of a machine learning researcher and, as a consequence, I have often seen students who do not have this background rapidly get lost in such material. To mitigate this issue, the second type of book that exists today focuses on the machine learning practitioner; that is, on how to use deep learning software, with minimal attention paid to the theoretical aspects. We argue that focusing on practical aspects is similarly necessary but not sufficient. Considering that deep learning frameworks and libraries have gotten fairly complex, the chance of misusing them due to theoretical misunderstandings is high. We have commonly seen this problem in our courses, too. This book, therefore, aims to bridge the theoretical and practical aspects of deep learning for natural language processing. We cover the necessary theoretical background and assume minimal machine learning background from the reader. Our aim is that anyone who took introductory linear algebra and calculus courses will be able to follow the theoretical material. To address practical aspects, this book includes pseudo code for the simpler algorithms discussed and actual Python code for the more complicated architectures. The code should be understandable by anyone who has taken a Python programming course. After reading this book, we expect that the reader will have the necessary foundation to immediately begin building real-world, practical natural language processing systems, and to expand their knowledge by reading research publications on these topics. https://doi.org/10.1017/9781009026222 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2006583
- PAR ID:
- 10550333
- Publisher / Repository:
- Cambridge University Press
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Culbertson, J.; Perfors, A; Rabagliati, H.; Ramenzoni, V. (Ed.)Learning to read is a critical skill; yet only a small portion of children in the United States are reading at or above grade level. Attention is one crucial process that affects the acquisition of reading skills. The process involves selectively choosing task relevant information and requires monitoring competing demands. Many books for beginning readers include illustrations, but this design choice may require learners to split their attention between multiple sources of information. This study employed eye tracking to examine whether embedding text within illustrations in children’s e-books inadvertently induces attentional competition. The results showed that spatially separating illustrations from the text in beginning reader books reduces attentional competition and improves children’s reading comprehension. This study shows that changes to the design of books for beginning readers can help promote literacy development in children.more » « less
- 
            Promoting early STEM knowledge helps to prepare children for formal schooling. Shared book reading may promote early STEM knowledge. This research examined the quality of available STEM books in children’s environments and investigated how such books influenced children’s learning in shared book reading contexts. In Study 1, we used both meaning-based human coding and computerized latent semantic analysis to categorize books based on the extent to which they provided support for encoding and demand for active recall. We found similarity in the ratings using the two approaches. Most books fell into categories characterized by low Support and Demand. In Study 2, we found that 4- to 5-year-olds learned more STEM facts when books were high in Support and/or Demand, although few books fell into those categories. This research highlights the importance that textual features of books play in promoting early STEM knowledge during shared book reading.more » « less
- 
            null (Ed.)Studies of eye movements during source code reading have supported the idea that reading source code differs fundamentally from reading natural text. The paper analyzed an existing data set of natural language and source code eye movement data using the E-Z reader model of eye movement control. The results show that the E-Z reader model can be used with natural text and with source code where it provides good predictions of eye movement duration. This result is confirmed by comparing model predictions to eye-movement data from this experiment and calculating the correlation score for each metric. Finally, it was found that gaze duration is influenced by token frequency in code and in natural text. The frequency effect is less pronounced on first fixation duration and single fixation duration. An eye movement control model for source code reading may open the door for tools in education and the industry to enhance program comprehension.more » « less
- 
            Through a computational reading of the online book reviewing community LibraryThing, we examine the dynamics of a collaborative tagging system and learn how its users refine and redefine literary genres. LibraryThing tags are overlapping and multi-dimensional, created in a shared space by thousands of users, including readers, bookstore owners, and librarians. A common understanding of genre is that it relates to the content of books, but this resource allows us to view genre as an intersection of user communities and reader values and interests. We explore different methods of computational genre measurement within the open space of user-created tags. We measure overlap between books, tags, and users, and we also measure the homogeneity of communities associated with genre tags and correlate this homogeneity with reviewing behavior.Finally, by analyzing the text of reviews, we identify the thematic signatures of genres on LibraryThing, revealing similarities and differences between them. These measurements are intended to elucidate the genre conceptions of the users, not, as in prior work, to normalize the tags or enforce a hierarchy. We find that LibraryThing users make sense of genre through a variety of values and expectations, many of which fall outside common definitions and understandings of genre.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    