Biobanks linked to electronic health records provide rich resources for health‐related research. With improvements in administrative and informatics infrastructure, the availability and utility of data from biobanks have dramatically increased. In this paper, we first aim to characterize the current landscape of available biobanks and to describe specific biobanks, including their place of origin, size, and data types. The development and accessibility of large‐scale biorepositories provide the opportunity to accelerate agnostic searches, expedite discoveries, and conduct hypothesis‐generating studies of disease‐treatment, disease‐exposure, and disease‐gene associations. Rather than designing and implementing a single study focused on a few targeted hypotheses, researchers can potentially use biobanks' existing resources to answer an expanded selection of exploratory questions as quickly as they can analyze them. However, there are many obvious and subtle challenges with the design and analysis of biobank‐based studies. Our second aim is to discuss statistical issues related to biobank research such as study design, sampling strategy, phenotype identification, and missing data. We focus our discussion on biobanks that are linked to electronic health records. Some of the analytic issues are illustrated using data from the Michigan Genomics Initiative and UK Biobank, two biobanks with two different recruitment mechanisms. We summarize the current body of literature for addressing these challenges and discuss some standing open problems. This work complements and extends recent reviews about biobank‐based research and serves as a resource catalog with analytical and practical guidance for statisticians, epidemiologists, and other medical researchers pursuing research using biobanks.
Biobanks are important in biomedical and public health research, and future healthcare research relies on their strength and capacity. However, there are financial challenges related to the operation of commercial biobanks and concerns around the commercialization of biobanks. Non-commercial biobanks depend on grant funding to operate and could be valuable to researchers if they can enable access to quality specimens at lower costs. The objective of this study is to estimate the value of specific biobank attributes. We used a rating-based conjoint experiment approach to study how researchers valued handling fee, access, quality, characterization, breadth of consent, access to key endemics, and time taken to fulfil requests. We found that researchers placed the greatest relative importance on the quality of specimens (26%), followed by the characterization of specimens (21%). Researchers with prior experience purchasing biological samples also valued access to key endemic in-country sites (11.6%) and low handling fees (5.5%) in biobanks.
more » « less- NSF-PAR ID:
- 10481083
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Scientific Reports
- Volume:
- 13
- Issue:
- 1
- ISSN:
- 2045-2322
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Summary High‐quality microbiome research relies on the integrity, management and quality of supporting data. Currently biobanks and culture collections have different formats and approaches to data management. This necessitates a standard data format to underpin research, particularly in line with the FAIR data standards of findability, accessibility, interoperability and reusability. We address the importance of a unified, coordinated approach that ensures compatibility of data between that needed by biobanks and culture collections, but also to ensure linkage between bioinformatic databases and the wider research community.
-
Conference Title: 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) Conference Start Date: 2021, Sept. 27 Conference End Date: 2021, Sept. 30 Conference Location: Champaign, IL, USAMetadata are key descriptors of research data, particularly for researchers seeking to apply machine learning (ML) to the vast collections of digitized specimens. Unfortunately, the available metadata is often sparse and, at times, erroneous. Additionally, it is prohibitively expensive to address these limitations through traditional, manual means. This paper reports on research that applies machine-driven approaches to analyzing digitized fish images and extracting various important features from them. The digitized fish specimens are being analyzed as part of the Biology Guided Neural Networks (BGNN) initiative, which is developing a novel class of artificial neural networks using phylogenies and anatomy ontologies. Automatically generated metadata is crucial for identifying the high-quality images needed for the neural network's predictive analytics. Methods that combine ML and image informatics techniques allow us to rapidly enrich the existing metadata associated with the 7,244 images from the Illinois Natural History Survey (INHS) used in our study. Results show we can accurately generate many key metadata properties relevant to the BGNN project, as well as general image quality metrics (e.g. brightness and contrast). Results also show that we can accurately generate bounding boxes and segmentation masks for fish, which are needed for subsequent machine learning analyses. The automatic process outperforms humans in terms of time and accuracy, and provides a novel solution for leveraging digitized specimens in ML. This research demonstrates the ability of computational methods to enhance the digital library services associated with the tens of thousands of digitized specimens stored in open-access repositories worldwide.more » « less
-
Abstract Freezers with biospecimen deposits became biobanks and later were networked at the pan-European level in 2013 under the Biobanking and BioMolecular Resources Research Infrastructure—European Research Infrastructure Consortium (BBMRI-ERIC). Drawing on document analysis about the BBMRI-ERIC and multi-sited fieldwork with biobankers in Spain from a science and technology studies approach, we explore what biobanks are expected to do and become under the BBMRI-ERIC framework, and how infrastructural transitions promote particular transformations in biobanking practices. The primary purpose of biobanks in Europe is presented as being to become mediators in contemporary biomedical research (global sharing nodes) distribution, and distributed nodes of samples and their associated data. We argue that infrastructural transitions are complicated and heterogeneous, giving rise to unattended local concerns on adjusting their practices to fit into the BBMRI-ERIC framework, even for non-members, as the case of Spain illustrates, where “old practices” of collection and storage are questioned. In this article, we aim to encourage qualitative studies to explore the lags between pan-European policies and prospects, different contextual interpretations, and biobanking reconfigurations as an opportunity to explore what that lag is made of (e.g. tensions with “old practices,” unresolved conflicts with the national agendas, reservations on a possible centralization of the biobanking practices by regional biobanks, lack of funding, etc.). Such research could enrich not only policy guidance, but also the understanding of technoscientific infrastructures’ scalability.
-
Training in science, technology, engineering, and mathematics (STEM) is a top priority for driving economic growth and maintaining technological competitiveness. We propose that exposure to a rigorous research program as an undergraduate leads to success in a research STEM career. We compared the scientific outcomes of 88 participants from five National Science Foundation Research Experiences for Undergraduates (REU) Site programs with demographically-similar applicants to assess the impact that formal, organized, and funded undergraduate summer research experiences have on participants. Our study demonstrates that REU participants are more likely to pursue a PhD program and generate significantly more valued products, including presentations, publications, and awards, relative to applicants. We believe that key components of the program include funding for personal and professional needs; access to diverse intellectual, analytical, and field resources; and the presence of other undergraduate researchers who support each other and share their goals and interests.more » « less