Context This research was conducted within the NSF-SEEKCommons Project, a research initiative dedicated to supporting Open Science and Open Access in disciplinary research. The project has a special interest in understanding the role that critical infrastructure has in supporting open initiatives. The Open Journal System (OJS) serves as a long-standing fundamental piece for Open Access throughout the globe. Hence, it provides valuable information about experiences developing, deploying, and maintaining open technologies.  Methods We used mixed methods for our research, triangulating repository data, installation data, interviews, and documentary analysis. We collected repository data using a report generator (Kopp [2018] 2024) that uses repository metadata to present general statistics about a Git project. The resulting information was manually curated, disambiguated, and annotated to have a homogeneous set of developers with information about their institutional affiliation and country.    Names are normalized based on the information in qualitative interviews and by browsing the full-extent commits in the GitHub repository. Other sources for this were the institutional materials (available in current and archived versions of the PKP website), meeting minutes, the user forum, and further project documentation available online. GitHub handles are homologated to their most comprehensive version. For institutional and country affiliation, we resorted to GitHub profiles, PKP documentation and forums, institutional domains available in emails, and researchers' ORCID IDs.  Available files Information about the codebase (number of files, lines of code, and timestamp) organized by month, quarter, and semester. See file: OJS_GitStats_04-24.csv Information about the historical evolution of the codebase (number of files, lines of code, and timestamp), including a description of the top committers for each month. Commiters are described by including their institutional affiliation and country of origin. See file: OJS_DevStats_Institution-Country_1.tsv Information about the historical evolution of the codebase focusing on top committers, along with their institution and country. This file is formatted to map the co-occurrence of developers and attributes by month between 2004-2024.See file: OJS_DevStats_Institution-Country_2.tsv Selected fields to describe working and regularly maintained plugins for OJS as of October 2024. Includes name of the plugin, homepage, description, maintainer, and institutional affiliation. See file: OJS_Plugins_2024_Processed.tsv Details of the aggregated information included in Table 5 of the article.See file: OJS_Plugins_2024_Table5.tsv Snapshot to XML information of the plugin gallery of OJS (October 21) retrieved from PKP website (Smecher 2024)See file: OJS_Plugins_2024.csv Funding The SEEKCommons Project is funded by the U.S. National Science Foundation (NSF), grant #2226425 
                        more » 
                        « less   
                    
                            
                            caltechdata_api – v1.10.0
                        
                    
    
            Repository now has a full suite of automated tests. Outdated datacite43 files replaced with files from the current version of CaltechDATA. Migrated the repository to use a modern pyproject.toml and setup.cfg setup. Incorporated a workflow to update setup.cfg automatically when codemeta.json changes, via the codemeta2cff.yml GitHub Action. return_id option added to caltechdata_edit, which matched the behavior of caltechdata_write by returning the record id CLI supports a profile file with saved orcid and funding data, better orcid support, bug fixes, and many improvements to the validate function Example jupyter notebook added 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2322420
- PAR ID:
- 10617398
- Publisher / Repository:
- CaltechDATA
- Date Published:
- Subject(s) / Keyword(s):
- GitHub IGA InvenioRDM metadata Python software
- Format(s):
- Medium: X
- Right(s):
- BSD 3 Clause
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            New functionality is added to the LAMMPS molecular simulation package, which increases the versatility with which LAMMPS can interface with supporting software and manipulate information associated with bonded force fields. We introduce the “type label” framework that allows atom types and their higher-order interactions (bonds, angles, dihedrals, and impropers) to be represented in terms of the standard atom type strings of a bonded force field. Type labels increase the human readability of input files, enable bonded force fields to be supported by the OpenKIM repository, simplify the creation of reaction templates for the REACTER protocol, and increase compatibility with external visualization tools, such as VMD and OVITO. An introductory primer on the forms and use of bonded force fields is provided to motivate this new functionality and serve as an entry point for LAMMPS and OpenKIM users unfamiliar with bonded force fields. The type label framework has the potential to streamline modeling workflows that use LAMMPS by increasing the portability of software, files, and scripts for preprocessing, running, and postprocessing a molecular simulation.more » « less
- 
            The ability to identify scholarly authors is central to bibliometric analysis. Efforts to disambiguate author names using algorithms or national or societal registries become less effective with increases in the number of publications from China and other nations where shared and similar names are prevalent. This work analyzes the adoption and integration of an open source, cross-national identification system, the Open Researcher and Contributor ID system (ORCID), in Web of Science metadata. Results at the article level show greater adoption, to date, of the ORCID iD in Europe as compared with Asia and the US. Focusing analysis on individual highly cited researchers with the shared Chinese surname “Wang,” results indicate wide scope for greater adoption of ORCID. The mechanisms for integrating ORCID iDs into articles also come into question in an analysis of co-authors of one particular highly cited researcher who have varying percentages of articles with ORCID iDs attached. These results suggest that systematic variations in adoption and integration of ORCID into publication metadata should be considered in any bibliometric analysis based on it.more » « less
- 
            Codes and data for "Large language models design sequence-defined macromolecules via evolutionary optimization" Note this repository contains codes and data files for the manuscript. This is a snapshot of the repository, frozen at the time of submission. Codes: LLM codes, other algorithms, postprocessing, visualization Data files: prompts, models, embeddings, LLM responsesmore » « less
- 
            null (Ed.)Abstract How can we evaluate the performance of a disambiguation method implemented on big bibliographic data? This study suggests that the open researcher profile system, ORCID, can be used as an authority source to label name instances at scale. This study demonstrates the potential by evaluating the disambiguation performances of Author-ity2009 (which algorithmically disambiguates author names in MEDLINE) using 3 million name instances that are automatically labeled through linkage to 5 million ORCID researcher profiles. Results show that although ORCID-linked labeled data do not effectively represent the population of name instances in Author-ity2009, they do effectively capture the ‘high precision over high recall’ performances of Author-ity2009. In addition, ORCID-linked labeled data can provide nuanced details about the Author-ity2009’s performance when name instances are evaluated within and across ethnicity categories. As ORCID continues to be expanded to include more researchers, labeled data via ORCID-linkage can be improved in representing the population of a whole disambiguated data and updated on a regular basis. This can benefit author name disambiguation researchers and practitioners who need large-scale labeled data but lack resources for manual labeling or access to other authority sources for linkage-based labeling. The ORCID-linked labeled data for Author-ity2009 are publicly available for validation and reuse.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
