skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Data for "Breaking the Paywall: The role of Open Journal System as key Open Science infrastructure"
Context This research was conducted within the NSF-SEEKCommons Project, a research initiative dedicated to supporting Open Science and Open Access in disciplinary research. The project has a special interest in understanding the role that critical infrastructure has in supporting open initiatives. The Open Journal System (OJS) serves as a long-standing fundamental piece for Open Access throughout the globe. Hence, it provides valuable information about experiences developing, deploying, and maintaining open technologies.  Methods We used mixed methods for our research, triangulating repository data, installation data, interviews, and documentary analysis. We collected repository data using a report generator (Kopp [2018] 2024) that uses repository metadata to present general statistics about a Git project. The resulting information was manually curated, disambiguated, and annotated to have a homogeneous set of developers with information about their institutional affiliation and country.    Names are normalized based on the information in qualitative interviews and by browsing the full-extent commits in the GitHub repository. Other sources for this were the institutional materials (available in current and archived versions of the PKP website), meeting minutes, the user forum, and further project documentation available online. GitHub handles are homologated to their most comprehensive version. For institutional and country affiliation, we resorted to GitHub profiles, PKP documentation and forums, institutional domains available in emails, and researchers' ORCID IDs.  Available files Information about the codebase (number of files, lines of code, and timestamp) organized by month, quarter, and semester. See file: OJS_GitStats_04-24.csv Information about the historical evolution of the codebase (number of files, lines of code, and timestamp), including a description of the top committers for each month. Commiters are described by including their institutional affiliation and country of origin. See file: OJS_DevStats_Institution-Country_1.tsv Information about the historical evolution of the codebase focusing on top committers, along with their institution and country. This file is formatted to map the co-occurrence of developers and attributes by month between 2004-2024.See file: OJS_DevStats_Institution-Country_2.tsv Selected fields to describe working and regularly maintained plugins for OJS as of October 2024. Includes name of the plugin, homepage, description, maintainer, and institutional affiliation. See file: OJS_Plugins_2024_Processed.tsv Details of the aggregated information included in Table 5 of the article.See file: OJS_Plugins_2024_Table5.tsv Snapshot to XML information of the plugin gallery of OJS (October 21) retrieved from PKP website (Smecher 2024)See file: OJS_Plugins_2024.csv Funding The SEEKCommons Project is funded by the U.S. National Science Foundation (NSF), grant #2226425  more » « less
Award ID(s):
2226425
PAR ID:
10621741
Author(s) / Creator(s):
;
Publisher / Repository:
Zenodo
Date Published:
Subject(s) / Keyword(s):
Science and technology studies Open Access Computer and information sciences FOS: Computer and information sciences Digital Infrastructure Academic Publishing Open Technologies Scientific Publishing Open Science Paywall Library sciences Free and Open Source Software FOSS
Format(s):
Medium: X
Right(s):
Creative Commons Attribution 4.0 International
Sponsoring Org:
National Science Foundation
More Like this
  1. This repository contains our raw datasets from channel measurements performed at the University of Utah campus. In addition, we have included a document that explains the setup and methodology used to collect this data, as well as a very brief discussion of results.  File organization: * documentation/ - Contains a .docx with the description of the setup and evaluation. * data/ - HDF5 files containing both metadata and raw IQ samples for each location at which data was collected. Notice we collected data at 14  different client locations. See map in the attached docx (skipped locations 12 and 16). We deployed 5 different receivers at 5 different rooftops. Due to resource constraints, one set of files contains data from 4 different locations whereas another set  contains information from the single remaining location. We have developed a set of python scripts that allow us to parse and analyze the data. Although not included here, they can be found in our public repository: https://github.com/renew-wireless/RENEWLab You can find the top script here.</p> For more information on the POWDER-RENEW project please visit the POWDER website. The RENEW part of the project focuses on the deployment of an open-source massive MIMO system. Please visit our website for more information.</p> 
    more » « less
  2. Open source software (OSS) underpins modern software infrastructure, yet many projects struggle with long- term sustainability. We introduce OSSPREY, an AI-powered platform that can predict the sustainability of any GitHub- hosted project. OSSPREY collects longitudinal socio-technical data, such as: commits, issues, and contributor interactions, and uses a transformer-based model to generate month-by-month sustainability forecasts. When project downturns are detected, it recommends evidence-based interventions drawn from published software engineering studies. OSSPREY integrates scraping, forecasting, and actionable guidance into an interactive dash- board, enabling maintainers to monitor project health, anticipate decline, and respond with targeted strategies. By connecting real- time project data with research-backed insights, OSSPREY offers a practical tool for sustaining OSS projects at scale. The codebase is linked to the project website at: https: //oss-prey.github.io/OSSPREY-Website/ The screencast is available at: https://www.youtube.com/ watch?v=N7a0v4hPylU 
    more » « less
  3. Abstract BackgroundAn updated version of the mwtab Python package for programmatic access to the Metabolomics Workbench (MetabolomicsWB) data repository was released at the beginning of 2021. Along with updating the package to match the changes to MetabolomicsWB’s ‘mwTab’ file format specification and enhancing the package’s functionality, the included validation facilities were used to detect and catalog file inconsistencies and errors across all publicly available datasets in MetabolomicsWB. ResultsThe MetabolomicsWB File Status website was developed to provide continuous validation of MetabolomicsWB data files and a useful interface to all found inconsistencies and errors. This list of detectable issues/errors include format parsing errors, format compliance issues, access problems via MetabolomicsWB’s REST interface, and other small inconsistencies that can hinder reusability. The website uses the mwtab Python package to pull down and validate each available analysis file and then generates an html report. The website is updated on a weekly basis. Moreover, the Python website design utilizes GitHub and GitHub.io, providing an easy to replicate template for implementing other metadata, virtual, and meta- repositories. ConclusionsThe MetabolomicsWB File Status website provides a metadata repository of validation metadata to promote the FAIR use of existing metabolomics datasets from the MetabolomicsWB data repository. 
    more » « less
  4. This dataset of 7304 aluminum grain boundaries provides comprehensive coverage of the 5D space of crystallographic character. The dataset and some of its characteristics are described in detail in https://doi.org/10.1016/j.actamat.2022.118006. The dataset here includes a zip file with all 7304 minimum energy grain boundary structure files, which are minimized dump files from LAMMPS. The dump files only include atoms +/- 15 angstroms from the grain boundary plane. The CSV file contains information about all 7304 grain boundaries, including information about the crystallographic character and a few computed properties. A README file provides a description of the columns of the CSV file. 
    more » « less
  5. Abstract This paper summarizes the open community conventions developed by the Ecological Forecasting Initiative (EFI) for the common formatting and archiving of ecological forecasts and the metadata associated with these forecasts. Such open standards are intended to promote interoperability and facilitate forecast communication, distribution, validation, and synthesis. For output files, we first describe the convention conceptually in terms of global attributes, forecast dimensions, forecasted variables, and ancillary indicator variables. We then illustrate the application of this convention to the two file formats that are currently preferred by the EFI, netCDF (network common data form), and comma‐separated values (CSV), but note that the convention is extensible to future formats. For metadata, EFI's convention identifies a subset of conventional metadata variables that are required (e.g., temporal resolution and output variables) but focuses on developing a framework for storing information about forecast uncertainty propagation, data assimilation, and model complexity, which aims to facilitate cross‐forecast synthesis. The initial application of this convention expands upon the Ecological Metadata Language (EML), a commonly used metadata standard in ecology. To facilitate community adoption, we also provide a Github repository containing a metadata validator tool and several vignettes in R and Python on how to both write and read in the EFI standard. Lastly, we provide guidance on forecast archiving, making an important distinction between short‐term dissemination and long‐term forecast archiving, while also touching on the archiving of code and workflows. Overall, the EFI convention is a living document that can continue to evolve over time through an open community process. 
    more » « less