skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Growing New Scholarly Communication Infrastructures for Sharing, Reusing, and Synthesizing Knowledge
Sharing, reuse, and synthesis of knowledge is central to the research process. These core functions are in theory served by the system of monographs, abstracts, and papers in journals and proceedings, with citation indices and search databases that comprise the core of our formal scholarly communication infrastructure; yet, converging lines of empirical and anecdotal evidence suggest that this system does not adequately act as infrastructure for synthesis. Emerging developments in new institutions for science, along with new technical infrastructures and tooling for decentralized knowledge work, offer new opportunities to prototype new technical infrastructures on top of a different installed base than the publish or perish, neoliberal academy. This workshop aims to integrate these developments and communities with CSCW’s deep roots in knowledge infrastructures and collaborative and distributed sensemaking, with new developments in science institutions and tooling, to stimulate and accelerate progress towards prototyping new scholarly communication infrastructures that are actually optimized for sharing, reusing, and synthesizing knowledge.  more » « less
Award ID(s):
2046454
PAR ID:
10454102
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
CSCW'22 Companion: Companion Publication of the 2022 Conference on Computer Supported Cooperative Work and Social Computing
Page Range / eLocation ID:
278-281
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Unlike aboveground utility systems, for which very detailed and accurate information exists, there is generally a dearth of good-quality data about underground utility infrastructures that provide vital services. To identify key strategies to improve the resilience of these underground systems, this paper presents mechanisms for successful engagement and collaboration among stakeholders and shared cross-sector system vulnerability concerns (including data availability) based on the innova- tive use of focus groups. Outputs from two virtual focus groups were used to obtain information from New York City area utilities and other stakeholders affected by underground infrastructure. There was strong agreement among participants that (1) a trusted agency in New York City government should manage a detailed map of underground infrastructure that would allow stakeholders to securely access appropriate information about underground systems on a need-to-know basis; (2) environmental risk factors, such as infrastructure age and condition, as well as location should be included; and (3) improved mechanisms for collaboration and sharing information are needed, especially during non-emergency situations. Stakeholders also highlighted the need for a regularly updated central database of relevant contacts at key organizations, since institutions often have a high employee turnover rate, which creates knowledge loss. The focus group script developed as part of this research was designed to be transferable to other cities to assess data needs and potential obstacles to stakeholder collabora- tion in the areas of underground infrastructure mapping and modeling. 
    more » « less
  2. Recent CSCW research on the collaborative design and development of research infrastructures for the natural sciences has increasingly focused on the challenges of open data sharing. This qualitative study describes and analyzes how multidisciplinary, geographically distributed ocean scientists are integrating highly diverse data as part of an effort to develop a new research infrastructure to advance science. This paper identifies different kinds of coordination that are necessary to align processes of data collection, production, and analysis. Some of the hard work to integrate data is undertaken before data integration can even become a technical problem. After data integration becomes a technical problem, social and organizational means continue to be critical for resolving differences in assumptions, methods, practices, and priorities. This work calls attention to the diversity of coordinative, social, and organizational practices and concerns that are needed to integrate data and also how, in highly innovative work, the process of integrating data also helps to define scientific problem spaces themselves. 
    more » « less
  3. The last 15 years have seen a marked growth of data management and sharing policies among federal agencies in the US and Canada. While these policies have an undeniable impact in terms of increased publicly available datasets, they have also impacted the research practices of funded researchers and the services and infrastructure provided by institutions. Researchers and institutions alike share the responsibility to align practices with funding agency requirements concerning data management and sharing, but each stakeholder group has responded in ways that may not align with one another. This presentation delves into research resulting from the National Science Foundation-funded Realities of Academic Data Sharing (RADS) Initiative and provides a comprehensive comparative analysis of services and infrastructure of six academic institutions, as well as an overview of the overall impact of these policies for researchers and institutions. Insights into services, infrastructure, and impact can lead to the creation of streamlined pathways for enhancing institutional efficiencies in data management and sharing. 
    more » « less
  4. null (Ed.)
    One of the most costly factors in providing a global computing infrastructure such as the WLCG is the human effort in deployment, integration, and operation of the distributed services supporting collaborative computing, data sharing and delivery, and analysis of extreme scale datasets. Furthermore, the time required to roll out global software updates, introduce new service components, or prototype novel systems requiring coordinated deployments across multiple facilities is often increased by communication latencies, staff availability, and in many cases expertise required for operations of bespoke services. While the WLCG (and distributed systems implemented throughout HEP) is a global service platform, it lacks the capability and flexibility of a modern platform-as-a-service including continuous integration/continuous delivery (CI/CD) methods, development-operations capabilities (DevOps, where developers assume a more direct role in the actual production infrastructure), and automation. Most importantly, tooling which reduces required training, bespoke service expertise, and the operational effort throughout the infrastructure, most notably at the resource endpoints (sites), is entirely absent in the current model. In this paper, we explore ideas and questions around potential NoOps models in this context: what is realistic given organizational policies and constraints? How should operational responsibility be organized across teams and facilities? What are the technical gaps? What are the social and cybersecurity challenges? Conversely what advantages does a NoOps model deliver for innovation and for accelerating the pace of delivery of new services needed for the HL-LHC era? We will describe initial work along these lines in the context of providing a data delivery network supporting IRIS-HEP DOMA R&D. 
    more » « less
  5. Incomplete and inconsistent connections between institutional repository holdings and the global data infrastructure inhibit research data discovery and reusability. Preventing metadata loss on the path from institutional repositories to the global research infrastructure can substantially improve research data reusability. The Realities of Academic Data Sharing (RADS) Initiative, funded by the National Science Foundation, is investigating institutional processes for improving research data FAIRness. Focal points of the RADS inquiry are to understand where researchers are sharing their data and to assess metadata quality, i.e., completeness, at six Data Curation Network (DCN) academic institutions: Cornell University, Duke University, University of Michigan, University of Minnesota, Washington University in St. Louis, and Virginia Tech. RADS is examining where researchers are storing their data, considering local institutional repositories and other popular repositories, and analyzing the completeness of the research data metadata stored in these institutional and other repositories. Metadata FAIRness (Findable, Accessible, Interoperable, Reusable) is used as the metric to assess metadata quality as FAIR complete. Research findings show significant content loss when metadata from local institutional repositories are compared to metadata found in DataCite. After examining the factors contributing to this metadata loss, RADS investigators are developing a set of recommended best practices for institutions to increase the quality of their scholarly metadata. Further, documentation such as README files are of particular importance not only for data reuse, but as sources containing valuable metadata such as Persistent Identifiers (PIDs). DOIs and related PIDs such as ORCID and ROR are still rarely used in institutional repositories. More frequent use would have a positive effect on discoverability, interoperability and reusability, especially when transferring to global infrastructure. 
    more » « less