Data users need relevant context and research expertise to effectively search for and identify relevant datasets. Leading data providers, such as the Inter‐university Consortium for Political and Social Research (ICPSR), offer standardized metadata and search tools to support data search. Metadata standards emphasize the machine‐readability of data and its documentation. There are opportunities to enhance dataset search by improving users' ability to learn about, and make sense of, information about data. Prior research has shown that context and expertise are two main barriers users face in effectively searching for, evaluating, and deciding whether to reuse data. In this paper, we propose a novel chatbot‐based search system, DataChat, that leverages a graph database and a large language model to provide novel ways for users to interact with and search for research data. DataChat complements data archives' and institutional repositories' ongoing efforts to curate, preserve, and share research data for reuse by making it easier for users to explore and learn about available research data. 
                        more » 
                        « less   
                    
                            
                            Interdisciplinary Aspirations and Disciplinary Archives: Losing and finding John M. Weatherby's Soo data
                        
                    
    
            As theorized in language documentation, archives serve to make research reproducible and to make primary data accessible for multiple audiences (Himmelmann 2006; Berez-Kroeker et al. 2018). Scholars in the emerging mid-20th-century field of African history emphasized these same priorities. Mid-century Africanist historians assembled large text collections but failed in a clearly stated disciplinary project to preserve them in accessible archives. This paper explores the relationship between institutional and social factors in data preservation through the story of audio recordings and field notes documenting Soo (Uganda: Kuliak/Nilo-Saharan) collected in the mid-20th century by Makerere University history PhD student John M. Weatherby. For decades, Weatherby struggled and failed to find an institutional home for his materials, which were nearly lost amid changing disciplinary trends. I encountered them only through informal social interactions in 2018 and have subsequently been depositing them in a language archive. The slide of Weatherby’s data into obscurity shows how archiving is inherently a disciplinary practice. Institutions intending to preserve data rose and fell with changing disciplinary paradigms, but Weatherby’s data were preserved through personal relationships. Despite a common emphasis on technical and institutional initiatives for archiving, the relational contexts of legacy materials are central to their preservation. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1911571
- PAR ID:
- 10376437
- Date Published:
- Journal Name:
- Language documentation and description
- Volume:
- 21
- ISSN:
- 2756-1224
- Page Range / eLocation ID:
- 101-139
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract: Rowasu'u is a digital archives project that seeks to reunite A'uwẽ-Xavante individuals with researcher produced documentation of their ancestors, families, bodies, culture, and homelands and eventually provide a platform for the collection and preservation of community knowledge. A'uwẽ-Xavante have a long history of receiving academic researchers including anthropologists, geneticists, biomedical researchers, ecologists, and linguists, but they have had limited access to the documentation and other data produced through these encounters. The Rowasu'u project is working with scholars to compile and make accessible records of more than 60 years of decentralized academic research while partnering with A'uwẽ-Xavante communities historically positioned as the most prominent participants. Our larger aspiration is that in addition to supporting A'uwẽ-Xavante efforts to reclaim their history as recorded by scientists, Rowasu'u will advance Indigenous research governance and data sovereignty as human rights applicable to past as well as future research. This chapter discusses our early progress in developing Rowasu'u using Mukurtu CMS, including the challenges and complexities inherent in navigating local politics in the context of generations of marginalization and exclusion.more » « less
- 
            Rowasu'u is a digital archives project that seeks to reunite A'uwẽ-Xavante individuals with researcher produced documentation of their ancestors, families, bodies, culture, and homelands and eventually provide a platform for the collection and preservation of community knowledge. A'uwẽ-Xavante have a long history of receiving academic researchers including anthropologists, geneticists, biomedical researchers, ecologists, and linguists, but they have had limited access to the documentation and other data produced through these encounters. The Rowasu'u project is working with scholars to compile and make accessible records of more than 60 years of decentralized academic research while partnering with A'uwẽ-Xavante communities historically positioned as the most prominent participants. Our larger aspiration is that in addition to supporting A'uwẽ-Xavante efforts to reclaim their history as recorded by scientists, Rowasu'u will advance Indigenous research governance and data sovereignty as human rights applicable to past as well as future research. This chapter discusses our early progress in developing Rowasu'u using Mukurtu CMS, including the challenges and complexities inherent in navigating local politics in the context of generations of marginalization and exclusion.more » « less
- 
            null (Ed.)Biomedical research results in the collection and storage of increasingly large and complex data sets. Preserving those data so that they are discoverable, accessible, and interpretable accelerates scientific discovery and improves health outcomes, but requires that researchers, data curators, and data archivists consider the long-term disposition of data and the costs of preserving, archiving, and promoting access to them. Life Cycle Decisions for Biomedical Data examines and assesses approaches and considerations for forecasting costs for preserving, archiving, and promoting access to biomedical research data. This report provides a comprehensive conceptual framework for cost-effective decision making that encourages data accessibility and reuse for researchers, data managers, data archivists, data scientists, and institutions that support platforms that enable biomedical research data preservation, discoverability, and use.more » « less
- 
            Abstract The geographic center of El Niño has shifted from the tropical eastern Pacific (EP) in the 20th century to the tropical central Pacific (CP) in the 21st century. Analyzing data spanning 1948–2018, this study uncovers notable alterations in the impact of the changing El Niño patterns on California market squid (Doryteuthis opalescens) landings. While the traditional EP El Niño in the 20th century significantly reduces squid landings, this impact diminishes with the ascent of the CP type of El Niño in the 21st century. Remarkably, the CP‐I type of El Niño, a specific variant where warming occurs predominantly in the central Pacific and is often less intense but more frequent than traditional El Niño events, can even amplify squid landings. These transformations stem from variations in sea surface temperature, trade winds, and Sverdrup transport associated with different El Niño types. These findings suggest that the fishery community should consider developing adaptive approaches to address the evolving impacts of El Niño.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    