skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The New Information Retrieval Problem: Data Availability
ABSTRACT The goals of open science are driven by policies requiring data management, sharing, and accessibility. One way of measuring the impact of open science policies on scientific knowledge is to access data that has been prepared for re‐use. But how accessible/available are data resources? In this paper, we discuss a method for exploring and locating datasets made available by scientists from federally funded projects in the US. The data pathways method was tested on federal awards. Here we describe the method and the results from analyzing fifty federal awards granted by the National Science Foundation to pursue data resources and their availability in publications, data repositories, or institutional repositories. The data pathways approach contributes to the development of a practical approach on availability that captures the current ways in which data are accessible from federally funded science projects –ranging from institutional repositories, journal data deposit, PI and project web pages, and science data platforms, among other found possibilities. This paper discusses some background and motivations for such a method, the method, research design, barriers encountered when searching for data resources from projects, and how this method can be useful to future studies of data availability.  more » « less
Award ID(s):
2020604 2020183 2429325
PAR ID:
10474780
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Proceedings of the Association for Information Science and Technology
Date Published:
Journal Name:
Proceedings of the Association for Information Science and Technology
Volume:
60
Issue:
1
ISSN:
2373-9231
Page Range / eLocation ID:
379 to 387
Subject(s) / Keyword(s):
Data availability Open Science Policies Data Management
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Beginning on or before 31 December 2025, all recipients of United States federal research funding will be required to make their federally funded scholarly outputs, including scientific data, freely available via public access venues with no delays or embargos. This paper focuses on research data as one of the key scholarly output types impacted by the requirements outlined in the Memorandum on Ensuring Free, Immediate and Equitable Access to Federally Funded Research issued by the US Office of Science and Technology Policy (OSTP), commonly called the “Nelson memo”. This paper sets out working definitions of four key terms: cost, price, reasonable, and allowable. Using these terms, we describe some of the pathways research data take to final publication, and summarize some of the extensive body of research on the costs of research data curation and sharing. In the process, we look at cost modeling experimentation in the fields of research data management and digital preservation to consider what might be relevant from their approaches. 
    more » « less
  2. Beginning on or before 31 December 2025, all recipients of United States federal research funding will be required to make their federally funded scholarly outputs, including scientific data, freely available via public access venues with no delays or embargos. This paper focuses on research data as one of the key scholarly output types impacted by the requirements outlined in the Memorandum on Ensuring Free, Immediate and Equitable Access to Federally Funded Research issued by the US Office of Science and Technology Policy (OSTP), commonly called the “Nelson memo”. This paper sets out working definitions of four key terms: cost, price, reasonable, and allowable. Using these terms, we describe some of the pathways research data take to final publication, and summarize some of the extensive body of research on the costs of research data curation and sharing. In the process, we look at cost modelling experimentation in the fields of research data management and digital preservation to consider what might be relevant from their approaches. 
    more » « less
  3. The last 15 years have seen a marked growth of data management and sharing policies among federal agencies in the US and Canada. While these policies have an undeniable impact in terms of increased publicly available datasets, they have also impacted the research practices of funded researchers and the services and infrastructure provided by institutions. Researchers and institutions alike share the responsibility to align practices with funding agency requirements concerning data management and sharing, but each stakeholder group has responded in ways that may not align with one another. This presentation delves into research resulting from the National Science Foundation-funded Realities of Academic Data Sharing (RADS) Initiative and provides a comprehensive comparative analysis of services and infrastructure of six academic institutions, as well as an overview of the overall impact of these policies for researchers and institutions. Insights into services, infrastructure, and impact can lead to the creation of streamlined pathways for enhancing institutional efficiencies in data management and sharing. 
    more » « less
  4. Abstract To meet the demands of technological change required for climate change mitigation, academic research must cover a broad range of climate solutions. Diverse participation in this research is important because research shows that a variety of backgrounds and problem-solving approaches are important to solving complex problems such as climate change. In our study, we examine the discplinary and institutional diversity of federal funding for academic research on climate solutions (ARCS) in the United States. We identify $1.42 billion in federal funding for ARCS in fiscal years 2019 and 2020. Our findings reveal that 85% of federal ARCS grants are awarded to Principal Investigators in engineering and the natural sciences. Additionally, institutions classified as having high research activity (R1s) receive over 60% of the ARCS funding per student. Tribal institutions, Historically Black Colleges and Universities, and Hispanic Serving Institutions collectively receive only $109.20 in ARCS funding per student, compared to $334.30 per student for other institution types. These disparities in federally funded ARCS grants are, in part, a consequence of the absence of policies that promote interdisciplinary collaboration and broader participation in academic research. We discuss the policy implications that have contributed to the identified inequities in ARCS funding and current policies that could enhance the distribution of ARCS in the future. We propose strategies for federally funded ARCS to support an equitable energy transition that addresses the needs of contemporary society and beyond. 
    more » « less
  5. What is the relationship between Data Management Plans (DMPs), DMP guidance documents, and the reality of end-of-project data preservation and access? In this short paper we report on some preliminary findings of a 3-year investigation into the impact of DMPs on federally funded science in the United States. We investigated a small sample of publicly accessible DMPs (N=14) published using DMPTool. We found that while DMPs followed the National Science Foundation's guidelines, the pathways to the resulting research data are often obscure, vague, or not obvious. We define two “data pathways” as the search tactics and strategies deployed in order to find datasets. 
    more » « less