skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Measuring Public Open-Source Software in the Federal Government: An Analysis of Code.gov
This paper presents an in-depth analysis of patterns and trends in the open-source software (OSS) contributions by the U.S. federal government agencies. OSS is a unique category of computer software notable for its publicly accessible source code and the rights it provides for modification and distribution for any purpose. Prompted by the Federal Source Code Policy (USCIO, 2016), Code.gov was established as a platform to facilitate the sharing of custom-developed software across various federal government agencies. This study leverages data from Code.gov, which catalogs OSS projects developed and shared by government agencies, and enhances this data with detailed development and contributor information from GitHub. By adopting a cost estimation methodology that is consistent with the U.S. national accounting framework for software investment proposed in Korkmaz et al. (2024), this research provides annual estimates of investment in OSS by government agencies for the 2009–2021 period. The findings indicate a significant investment by the federal government in OSS, with the 2021 investment estimated at around $407 million. This study not only sheds light on the government’s role in fostering OSS development but also offers a valuable framework for assessing the scope and value of OSS initiatives within the public sector.  more » « less
Award ID(s):
2306160 2224441
PAR ID:
10528099
Author(s) / Creator(s):
;
Publisher / Repository:
The School of Statistics and the Center for Applied Statistics, Renmin University of China
Date Published:
Journal Name:
Journal of Data Science
ISSN:
1680-743X
Page Range / eLocation ID:
1 to 20
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Open source software (OSS) is ubiquitous, serving as specialized applications nurtured by devoted user communities, and as digital infrastructure underlying platforms used by millions of people. OSS is developed, maintained, and extended through the contribution of independent developers as well as people from businesses, universities, government research institutions, and nonprofits. Despite its prevalence, the scope and impact of OSS are not currently well-measured. Recent policies of the U.S. Federal Government promote sharing of software code developed by or for the Federal Government. While the policy to promote reusing and sharing of software created with public funding is relatively new, public funding plays an important and not fully accounted role in the creation of OSS. This paper aims to measure the scope and value of OSS development in the U.S. Federal Government. We collect data from Code.gov, the government’s platform for sharing OSS projects, and study contributions of agencies. The dataset contains 17K repositories from 21 agencies, with the majority of contributions originating from the DOE, NASA and GSA. In addition, we collect data on development activity (e.g., lines of code, contributors) of the repositories on GitHub, the largest hosting facility worldwide. Adopting a cost estimation model from software engineering, we generate estimates of investment in OSS that are consistent with the U.S. national accounting methods used for measuring software investment. Finally, we generate and analyze collaboration network resulting from cross-agency contributions to repositories and explore the centrality of agencies in the network. 
    more » « less
  2. Open source software (OSS) is software that anyone can review, modify, and distribute freely, usually with only minor restrictions such as giving credit to the creator of the work. The use of OSS is growing rapidly, due to its value in increasing firm and economy-wide productivity. Despite its widespread use, there is no standardized methodology for measuring the scope and impact of this fundamental intangible asset. This study presents a framework to measure the value of OSS using data collected from GitHub, the largest platform in the world with over 100 million developers. The data include over 7.6 million repositories where software is developed, stored, and managed. We collect information about contributors and development activity such as code changes and license detail. By adopting a cost estimation model from software engineering, we develop a methodology to generate estimates of investment in OSS that are consistent with the U.S. national accounting methods used for measuring software investment. We generate annual estimates of current and inflation-adjusted investment as well as the net stock of OSS for the 2009–2019 period. Our estimates show that the U.S. investment in 2019 was $37.8 billion with a current-cost net stock of $74.3 billion. 
    more » « less
  3. Over the past two decades, the U.S. federal government has sought to increase its capacity to find, apprehend, and deport noncitizens residing in the United States who have violated federal immigration laws. One way the federal government has done this is by partnering with state and local law enforcement agencies on immigration enforcement efforts. The present study analyzes the records of all 1,964,756 interior removals between fiscal years 2003 and 2015 to examine how, if at all, the types of criminal convictions leading to removal from the U.S. interior have changed during this period of heightened coordination between law enforcement agencies and whether there are differences by gender and region of origin in the types of convictions leading to removal. Findings show that as coordination between law enforcement agencies intensified, the proportion of individuals removed from the U.S. interior with either no criminal convictions or with a driving-related conviction as their most serious conviction increased. Findings also show that the proportion of individuals removed with no criminal convictions was greater for women than for men and that the share of individuals removed with a driving-related conviction as their most serious conviction was greater for Latin Americans than for individuals from all other regions. Given renewed investment in these types of law enforcement partnerships under the Trump administration, the patterns presented in this article may foreshadow trends to come. 
    more » « less
  4. 1. The success of conservation initiatives often depends on the inclusion of diverse stakeholder interests in the decision-making process. Yet, there is a paucity of empirical knowledge concerning the factors that explain why stakeholders do—or do not—believe that they are meaningfully represented by government agencies. 2. Our study provides insight into the relationship between trust and stakeholder perceptions of inclusivity in public land management decisions. Here, we focus on the U.S. state of Alaska, where almost two-thirds of the land area are managed by the federal government. 3. We used structural equation modelling to test whether an individual's trust and the information sources used to learn about land management positively influenced perceived inclusivity. We conceptualized trust in terms of four dimensions that reflected an individual's disposition to trust, trust in the federal government, trust in shared values and trust that agencies adhere to a moral code. 4. We found that survey respondents across the U.S. state of Alaska had a limited disposition to trust others, did not trust federal land management agencies, did not believe agencies shared their values pertaining to protected area management and did not believe that agencies adhered to a moral code. 5. Beliefs about the morality of agencies were the primary driver of perceived inclusivity in land management decisions, indicating that agencies should focus on solving problems through deliberation and discussion about moral principles rather than by force. 6. Information acquired from professional, community-based or environmental advocacy exchanges also positively influenced perceived levels of involvement among stakeholders in resource management decisions. 7. These results provide a roadmap for how land management agencies can improve public relations and work towards a model of inclusive conservation around protected areas. 
    more » « less
  5. The analysis of the gender dynamics in scientific research and respective outputs is crucial for ensuring that science policy is inclusive and equitable. Similar to other research outputs such as publications and patents, open source software (OSS) projects are also developed by contributors from universities, government research institutions, and nonprofits, in addition to businesses. Despite its reach and continued rapid growth, reliable and comprehensive survey data on OSS does not exist, limiting insights into contributions by gender and policy- makers’ ability to assess trends in gender representation. Like in scientific research, the inclusion of diverse perspectives in software development enhances creativity and problem-solving. Using GitHub data, researchers have found positive correlations between gender diversity of an OSS development team and its productivity (Vasilescu et al., 2015; Ortu et al., 2017). Yet there is evidence of gender bias, with women facing higher standards to have their contributions accepted (Terrell et al., 2017; Imtiaz et al., 2019). This exploratory study aims to quantify gender differences in development and use (impact) of OSS using publicly available information collected from GitHub. We focus on software packages developed for programming language R, with the majority of contributors from academia. The paper asks (1) what are gender differences in the volume of contributions? (2) has gender representation shifted over time? (3) is there a correlation between the gender of contributors and the impact of a package? 
    more » « less