This paper presents an in-depth analysis of patterns and trends in the open-source software (OSS) contributions by the U.S. federal government agencies. OSS is a unique category of computer software notable for its publicly accessible source code and the rights it provides for modification and distribution for any purpose. Prompted by the Federal Source Code Policy (USCIO, 2016), Code.gov was established as a platform to facilitate the sharing of custom-developed software across various federal government agencies. This study leverages data from Code.gov, which catalogs OSS projects developed and shared by government agencies, and enhances this data with detailed development and contributor information from GitHub. By adopting a cost estimation methodology that is consistent with the U.S. national accounting framework for software investment proposed in Korkmaz et al. (2024), this research provides annual estimates of investment in OSS by government agencies for the 2009–2021 period. The findings indicate a significant investment by the federal government in OSS, with the 2021 investment estimated at around $407 million. This study not only sheds light on the government’s role in fostering OSS development but also offers a valuable framework for assessing the scope and value of OSS initiatives within the public sector. 
                        more » 
                        « less   
                    
                            
                            Open source software in the federal government: An analysis of Code.Gov
                        
                    
    
            Open source software (OSS) is ubiquitous, serving as specialized applications nurtured by devoted user communities, and as digital infrastructure underlying platforms used by millions of people. OSS is developed, maintained, and extended through the contribution of independent developers as well as people from businesses, universities, government research institutions, and nonprofits. Despite its prevalence, the scope and impact of OSS are not currently well-measured. Recent policies of the U.S. Federal Government promote sharing of software code developed by or for the Federal Government. While the policy to promote reusing and sharing of software created with public funding is relatively new, public funding plays an important and not fully accounted role in the creation of OSS. This paper aims to measure the scope and value of OSS development in the U.S. Federal Government. We collect data from Code.gov, the government’s platform for sharing OSS projects, and study contributions of agencies. The dataset contains 17K repositories from 21 agencies, with the majority of contributions originating from the DOE, NASA and GSA. In addition, we collect data on development activity (e.g., lines of code, contributors) of the repositories on GitHub, the largest hosting facility worldwide. Adopting a cost estimation model from software engineering, we generate estimates of investment in OSS that are consistent with the U.S. national accounting methods used for measuring software investment. Finally, we generate and analyze collaboration network resulting from cross-agency contributions to repositories and explore the centrality of agencies in the network. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2306160
- PAR ID:
- 10528118
- Publisher / Repository:
- 2023 Government Advances in Statistical Programming Conference
- Date Published:
- Format(s):
- Medium: X
- Institution:
- Computational Statistics for the Production of Official Statistics
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Open source software (OSS) is software that anyone can review, modify, and distribute freely, usually with only minor restrictions such as giving credit to the creator of the work. The use of OSS is growing rapidly, due to its value in increasing firm and economy-wide productivity. Despite its widespread use, there is no standardized methodology for measuring the scope and impact of this fundamental intangible asset. This study presents a framework to measure the value of OSS using data collected from GitHub, the largest platform in the world with over 100 million developers. The data include over 7.6 million repositories where software is developed, stored, and managed. We collect information about contributors and development activity such as code changes and license detail. By adopting a cost estimation model from software engineering, we develop a methodology to generate estimates of investment in OSS that are consistent with the U.S. national accounting methods used for measuring software investment. We generate annual estimates of current and inflation-adjusted investment as well as the net stock of OSS for the 2009–2019 period. Our estimates show that the U.S. investment in 2019 was $37.8 billion with a current-cost net stock of $74.3 billion.more » « less
- 
            In recent years, the FAIR guiding principles and the broader concept of open science has grown in importance in academic research, especially as funding entities have aggressively promoted public sharing of research products. Key to public research sharing is deposition of datasets into online data repositories, but it can be a chore to transform messy unstructured data into the forms required by these repositories. To help generate Metabolomics Workbench depositions, we have developed the MESSES (Metadata from Experimental SpreadSheets Extraction System) software package, implemented in the Python 3 programming language and supported on Linux, Windows, and Mac operating systems. MESSES helps transform tabular data from multiple sources into a Metabolomics Workbench specific deposition format. The package provides three commands, extract, validate, and convert, that implement a natural data transformation workflow. Moreover, MESSES facilitates richer metadata capture than is typically attempted by manual efforts. The source code and extensive documentation is hosted on GitHub and is also available on the Python Package Index for easy installation.more » « less
- 
            1. The success of conservation initiatives often depends on the inclusion of diverse stakeholder interests in the decision-making process. Yet, there is a paucity of empirical knowledge concerning the factors that explain why stakeholders do—or do not—believe that they are meaningfully represented by government agencies. 2. Our study provides insight into the relationship between trust and stakeholder perceptions of inclusivity in public land management decisions. Here, we focus on the U.S. state of Alaska, where almost two-thirds of the land area are managed by the federal government. 3. We used structural equation modelling to test whether an individual's trust and the information sources used to learn about land management positively influenced perceived inclusivity. We conceptualized trust in terms of four dimensions that reflected an individual's disposition to trust, trust in the federal government, trust in shared values and trust that agencies adhere to a moral code. 4. We found that survey respondents across the U.S. state of Alaska had a limited disposition to trust others, did not trust federal land management agencies, did not believe agencies shared their values pertaining to protected area management and did not believe that agencies adhered to a moral code. 5. Beliefs about the morality of agencies were the primary driver of perceived inclusivity in land management decisions, indicating that agencies should focus on solving problems through deliberation and discussion about moral principles rather than by force. 6. Information acquired from professional, community-based or environmental advocacy exchanges also positively influenced perceived levels of involvement among stakeholders in resource management decisions. 7. These results provide a roadmap for how land management agencies can improve public relations and work towards a model of inclusive conservation around protected areas.more » « less
- 
            The United States takes a federalist approach to pandemic responses while the bulk of pandemic powers sits at the state level. Thus, comprehensive accounts of how state health officials managed the crisis and how the federal government affected those efforts are needed to better understand the governmental response to the COVID-19 pandemic. This article reports the results of semistructured interviews with 29 state-level policy actors from 16 US states. Interviewees discussed multiple aspects of the US federal COVID-19 response that affected the response in their states, including communications with the public, intergovernmental communications, and federal actions regarding various aspects of health service preparedness including emergency funding, procurement, testing capacity, vaccine development and distribution, and data systems. This research enriches the discussion about US pandemic preparedness and response, and indicates that alignment of public communications across government levels, enhanced intergovernmental communication, inclusion of rural perspectives, and federal investment in and sustainment of health service preparedness are key factors that can improve future US pandemic responses.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    