skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: DataVault: a data storage infrastructure for the Einstein Toolkit
Abstract Data sharing is essential in the numerical simulations research. We introduce a data repository, DataVault, which is designed for data sharing, search and analysis. A comparative study of existing repositories is performed to analyze features that are critical to a data repository. We describe the architecture, workflow, and deployment of DataVault, and provide three use-case scenarios for different communities to facilitate the use and application of DataVault. Potential features are proposed and we outline the future development for these features.  more » « less
Award ID(s):
2004879
PAR ID:
10320826
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Classical and Quantum Gravity
Volume:
38
Issue:
13
ISSN:
0264-9381
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Data sharing is increasingly an expectation in health research as part of a general move toward more open sciences. In the United States, in particular, the implementation of the 2023 National Institutes of Health Data Management and Sharing Policy has made it clear that qualitative studies are not exempt from this data sharing requirement. Recognizing this trend, the Palliative Care Research Cooperative Group (PCRC) realized the value of creating a de-identified qualitative data repository to complement its existing de-identified quantitative data repository. The PCRC Data Informatics and Statistics Core leadership partnered with the Qualitative Data Repository (QDR) to establish the first serious illness and palliative care qualitative data repository in the U.S. We describe the processes used to develop this repository, called the PCRC-QDR, as well as our outreach and education among the palliative care researcher community, which led to the first ten projects to share the data in the new repository. Specifically, we discuss how we co-designed the PCRC-QDR and created tailored guidelines for depositing and sharing qualitative data depending on the original research context, establishing uniform expectations for key components of relevant documentation, and the use of suitable access controls for sensitive data. We also describe how PCRC was able to leverage its existing community to recruit and guide early depositors and outline lessons learned in evaluating the experience. This work advances the establishment of best practices in qualitative data sharing. 
    more » « less
  2. Abstract One of the goals of open science is to promote the transparency and accessibility of research. Sharing data and materials used in network research is critical to these goals. In this paper, we present recommendations for whether, what, when, and where network data and materials should be shared. We recommend that network data and materials should be shared, but access to or use of shared data and materials may be restricted if necessary to avoid harm or comply with regulations. Researchers should share the network data and materials necessary to reproduce reported results via a publicly accessible repository when an associated manuscript is published. To ensure the adoption of these recommendations, network journals should require sharing, and network associations and academic institutions should reward sharing. 
    more » « less
  3. Object signatures have been widely used in object detection and classification. Following a similar idea, the authors developed geometric signatures for architecture, engineering, and construction (AEC) objects such as footings, slabs, walls, beams, and columns. The signatures were developed both scientifically and empirically, by following a data-driven approach based on analysis of collected building information modeling (BIM) data using geometric theories. Rigorous geometric properties and statistical information were included in the developed geometric signatures. To enable an open access to BIM data using these signatures, the authors also initiated a BIM data repository with a preliminary collection of AEC objects and their geometric signatures. The developed geometric signatures were preliminarily tested by a small object classification experiment where 389 object instances from an architectural model were used. A rule-based algorithm developed using all parameter values of 14 features from the geometric signatures of the objects successfully classified 336 object instances into the correct categories of beams, columns, slabs, and walls. This higher than 85% accuracy showed the developed geometric signatures are promising. The collected and processed data were deposited into the Purdue University Research Repository (PURR) for sharing. 
    more » « less
  4. This data repository is for sharing all the dataset available for the NSF project titled: NSF Collaborative Research: Impacts of Evaporation and Waves on Groundwater Dynamics in Tidally Influenced Beaches. 
    more » « less
  5. Dietrich, Suzanne W; Zelinski, Mary B; Sluka, James P; Watanabe, Karen H (Ed.)
    Dataset for histology images from the ovaries of Japanese macaque (Macaca fuscata). These images are associated with the Multispecies Ovary Tissue Histology Electronic Repository (MOTHER), an online repository (https://mother-db.org) of ovary tissue histology digital images funded by NSF (DBI-2054061). Sharing these histology images will facilitate comparative studies of female reproductive strategies, enable the development of computational models to test hypotheses related to ovarian development and female reproduction, and serve as an educational resource thereby reducing the use of animals in research. See the README file (https://dataverse.asu.edu/file.xhtml?fileId=10645) for an overview of the dataset, including file naming conventions. 
    more » « less