skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Interdisciplinary Aspirations and Disciplinary Archives: Losing and finding John M. Weatherby's Soo data
As theorized in language documentation, archives serve to make research reproducible and to make primary data accessible for multiple audiences (Himmelmann 2006; Berez-Kroeker et al. 2018). Scholars in the emerging mid-20th-century field of African history emphasized these same priorities. Mid-century Africanist historians assembled large text collections but failed in a clearly stated disciplinary project to preserve them in accessible archives. This paper explores the relationship between institutional and social factors in data preservation through the story of audio recordings and field notes documenting Soo (Uganda: Kuliak/Nilo-Saharan) collected in the mid-20th century by Makerere University history PhD student John M. Weatherby. For decades, Weatherby struggled and failed to find an institutional home for his materials, which were nearly lost amid changing disciplinary trends. I encountered them only through informal social interactions in 2018 and have subsequently been depositing them in a language archive. The slide of Weatherby’s data into obscurity shows how archiving is inherently a disciplinary practice. Institutions intending to preserve data rose and fell with changing disciplinary paradigms, but Weatherby’s data were preserved through personal relationships. Despite a common emphasis on technical and institutional initiatives for archiving, the relational contexts of legacy materials are central to their preservation.  more » « less
Award ID(s):
1911571
PAR ID:
10376437
Author(s) / Creator(s):
Date Published:
Journal Name:
Language documentation and description
Volume:
21
ISSN:
2756-1224
Page Range / eLocation ID:
101-139
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The Florida Museum of Natural History (FLMNH) currently houses nearly 580,000 specimen-lots of close to 3 million specimens of recent mollusks. It is one of the largest collections of its type in North America and, except for new donations, its collection data is entirely accessible online. The collection has grown rapidly since its founding in the early 20th century, especially in the past 50 years. While global in scope, the collection is especially strong in material from North and South America, the Caribbean, Madagascar, Pacific Ocean islands, and south Asia. Holdings have been greatly enriched by biodiversity surveys conducted by museum staff and research associates and by acquisition of relinquished institutional and private collections. Field collections often have linked voucher specimens, tissues, sequence data, and digital images. The FLMNH mollusk collection is an important resource for systematics, biogeography, biodiversity studies, and education. 
    more » « less
  2. Data users need relevant context and research expertise to effectively search for and identify relevant datasets. Leading data providers, such as the Inter‐university Consortium for Political and Social Research (ICPSR), offer standardized metadata and search tools to support data search. Metadata standards emphasize the machine‐readability of data and its documentation. There are opportunities to enhance dataset search by improving users' ability to learn about, and make sense of, information about data. Prior research has shown that context and expertise are two main barriers users face in effectively searching for, evaluating, and deciding whether to reuse data. In this paper, we propose a novel chatbot‐based search system, DataChat, that leverages a graph database and a large language model to provide novel ways for users to interact with and search for research data. DataChat complements data archives' and institutional repositories' ongoing efforts to curate, preserve, and share research data for reuse by making it easier for users to explore and learn about available research data. 
    more » « less
  3. Abstract: Rowasu'u is a digital archives project that seeks to reunite A'uwẽ-Xavante individuals with researcher produced documentation of their ancestors, families, bodies, culture, and homelands and eventually provide a platform for the collection and preservation of community knowledge. A'uwẽ-Xavante have a long history of receiving academic researchers including anthropologists, geneticists, biomedical researchers, ecologists, and linguists, but they have had limited access to the documentation and other data produced through these encounters. The Rowasu'u project is working with scholars to compile and make accessible records of more than 60 years of decentralized academic research while partnering with A'uwẽ-Xavante communities historically positioned as the most prominent participants. Our larger aspiration is that in addition to supporting A'uwẽ-Xavante efforts to reclaim their history as recorded by scientists, Rowasu'u will advance Indigenous research governance and data sovereignty as human rights applicable to past as well as future research. This chapter discusses our early progress in developing Rowasu'u using Mukurtu CMS, including the challenges and complexities inherent in navigating local politics in the context of generations of marginalization and exclusion. 
    more » « less
  4. Rowasu'u is a digital archives project that seeks to reunite A'uwẽ-Xavante individuals with researcher produced documentation of their ancestors, families, bodies, culture, and homelands and eventually provide a platform for the collection and preservation of community knowledge. A'uwẽ-Xavante have a long history of receiving academic researchers including anthropologists, geneticists, biomedical researchers, ecologists, and linguists, but they have had limited access to the documentation and other data produced through these encounters. The Rowasu'u project is working with scholars to compile and make accessible records of more than 60 years of decentralized academic research while partnering with A'uwẽ-Xavante communities historically positioned as the most prominent participants. Our larger aspiration is that in addition to supporting A'uwẽ-Xavante efforts to reclaim their history as recorded by scientists, Rowasu'u will advance Indigenous research governance and data sovereignty as human rights applicable to past as well as future research. This chapter discusses our early progress in developing Rowasu'u using Mukurtu CMS, including the challenges and complexities inherent in navigating local politics in the context of generations of marginalization and exclusion. 
    more » « less
  5. null (Ed.)
    Biomedical research results in the collection and storage of increasingly large and complex data sets. Preserving those data so that they are discoverable, accessible, and interpretable accelerates scientific discovery and improves health outcomes, but requires that researchers, data curators, and data archivists consider the long-term disposition of data and the costs of preserving, archiving, and promoting access to them. Life Cycle Decisions for Biomedical Data examines and assesses approaches and considerations for forecasting costs for preserving, archiving, and promoting access to biomedical research data. This report provides a comprehensive conceptual framework for cost-effective decision making that encourages data accessibility and reuse for researchers, data managers, data archivists, data scientists, and institutions that support platforms that enable biomedical research data preservation, discoverability, and use. 
    more » « less