Abstract Research universities rely heavily on external funding to advance knowledge and generate economic growth. In the USA, tens of billions of dollars are spent each year on research and development with the federal government contributing over half of these funds. Yet a decline in relative federal funding highlights the role of other funders and their varying contractual terms. Specifically, nonfederal funders provide lower recovery of indirect costs. Using project-level university-sponsored research administrative records from four institutions, we examine indirect cost recovery. We find significant variation in the amount of indirect funding recovered—both across and within funders, as well as to different academic fields within a university. The distribution of sponsors in the overall research funding portfolio also impacts indirect cost recovery. The recovery variation has important implications for the sustainability and cross-subsidization of the university research enterprise. Together, our results show where universities are under-recovering indirect costs.
more »
« less
Ethical considerations in utilizing artificial intelligence for analyzing the NHGRI’s History of Genomics and Human Genome Project archives
Understanding “how to optimize the production of scientific knowledge” is paramount to those who support scientific research—funders as well as research institutions—to the communities served, and to researchers. Structured archives can help all involved to learn what decisions and processes help or hinder the production of new knowledge. Using artificial intelligence (AI) and large language models (LLMs), we recently created the first structured digital representation of the historic archives of the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health. This work yielded a digital knowledge base of entities, topics, and documents that can be used to probe the inner workings of the Human Genome Project, a massive international public-private effort to sequence the human genome, and several of its offshoots like The Cancer Genome Atlas (TCGA) and the Encyclopedia of DNA Elements (ENCODE). The resulting knowledge base will be instrumental in understanding not only how the Human Genome Project and genomics research developed collaboratively, but also how scientific goals come to be formulated and evolve. Given the diverse and rich data used in this project, we evaluated the ethical implications of employing AI and LLMs to process and analyze this valuable archive. As the first computational investigation of the internal archives of a massive collaborative project with multiple funders and institutions, this study will inform future efforts to conduct similar investigations while also considering and minimizing ethical challenges. Our methodology and risk-mitigating measures could also inform future initiatives in developing standards for project planning, policymaking, enhancing transparency, and ensuring ethical utilization of artificial intelligence technologies and large language models in archive exploration.Author Contributions: Mohammad Hosseini: Investigation; Project Administration; Writing – original draft; Writing – review & editing. Spencer Hong: Conceptualization, Data curation, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. Thomas Stoeger: Conceptualization; Investigation; Project Administration; Supervision; Writing – original draft; Writing – review & editing. Kristi Holmes: Funding acquisition, Supervision, Writing – review & editing. Luis A. Nunes Amaral: Funding acquisition, Supervision, Writing – review & editing. Christopher Donohue: Conceptualization, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing. Kris Wetterstrand: Conceptualization, Funding acquisition, Project administration.
more »
« less
- Award ID(s):
- 2410335
- PAR ID:
- 10589026
- Publisher / Repository:
- Lamar Soutter Library, UMass Chan Medical School
- Date Published:
- Journal Name:
- Journal of eScience Librarianship
- Volume:
- 13
- Issue:
- 1
- ISSN:
- 2161-3974
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract In response to calls for public engagement on human genome editing (HGE), which intensified after the 2018 He Jiankui scandal that resulted in the implantation of genetically modified embryos, we detail an anticipatory approach to the governance of HGE. By soliciting multidisciplinary experts’ input on the drivers and uncertainties of HGE development, we developed a set of plausible future scenarios to ascertain publics values—specifically, their hopes and concerns regarding the novel technology and its applications. In turn, we gathered a subset of multidisciplinary experts to propose governance recommendations for HGE that incorporate identified publics’ values. These recommendations include: (1) continued participatory public engagement; (2) international harmonization and transparency of multiple governance levers such as professional and scientific societies, funders, and regulators; and (3) development of a formal whistleblower framework.more » « less
-
The flexibility and precision of CRISPR-Cas9 and related technologies have made these genome editing tools increasingly popular in agriculture, medicine, and basic science research for the past decade. Genome editing will continue to be relevant and utilized across diverse scientific fields in the future. Given this, students should be introduced to genome editing technologies and encouraged to consider their ethical implications early on in precollege biology curricula. Furthermore, instruction on this topic presents an opportunity to create partnerships between researchers and educators at the K-12 levels that can strengthen student engagement in science, technology, engineering, and mathematics. To this end, we present a 3-day student-centered learning program to introduce high school students to genome editing technologies through a hands-on base editing experiment in Escherichia coli, accompanied by a relevant background lecture and facilitated ethics discussion. This unique partnership aims to educate students and provides a framework for research institutions to implement genome editing outreach programs at local high schools. We have included all requisite materials, including lecture slides, worksheets, experimental protocols, and suggestions on active learning strategies for others to reproduce our program with their local communities.more » « less
-
This paper reports on a project funded through the Engineering Education and Centers (EEC) Division of the National Science Foundation. Since 2010, EEC has funded more than 500 proposals totaling over $150 million through engineering education research (EER) programs such as Research in Engineering Education (REE) and Research in the Formation of Engineers (RFE), to enhance understanding and improve practice. The resulting archive of robust qualitative and quantitative data represents a vast untapped potential to exponentially increase the impact of EEC funding and transform engineering education. But tapping this potential has thus far been an intractable problem, despite ongoing calls for data sharing by public funders of research. Changing the paradigm of single-use data collection requires actionable, proven practices for effective, ethical data sharing, coupled with sufficient incentives to both share and use existing data. To that end, this project draws together a team of experts to overcome substantial obstacles in qualitative data sharing by building a framework to guide secondary analysis in engineering education research (EER), and to test this framework using pioneering data sets. Herein, we report on accomplishments within the first year of the project during which time we gathered a group of 13 expert qualitative researchers to engage in the first of a series of working meetings intended to meet our project goals. We came into this first workshop with a potentially limiting definition of secondary data analysis and the idea that people would want to share existing datasets if we could find ways around anticipated hurdles. However, the workshop yielded a broader definition of secondary data analysis and revealed a stronger interest in creating new datasets designed for sharing rather than sharing existing datasets. Thus, we have reconceived our second phase as one that is a cohesive effort based on an inclusive “open cohort model” to pilot projects related to secondary data analysis.more » « less
-
Abstract As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DANDI), OpenNeuro, and Brain-CODE. The aim of this review is to describe archives that provide researchers with tools to store, share, and reanalyze both human and non-human neurophysiology data based on criteria that are of interest to the neuroscientific community. The Brain Imaging Data Structure (BIDS) and Neurodata Without Borders (NWB) are utilized by these archives to make data more accessible to researchers by implementing a common standard. As the necessity for integrating large-scale analysis into data repository platforms continues to grow within the neuroscientific community, this article will highlight the various analytical and customizable tools developed within the chosen archives that may advance the field of neuroinformatics.more » « less