skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Persistent, Global, Unique: The three key requirements for a trusted identifier system for physical samples
There is growing recognition that unambiguous citation and tracking of physical samples allows previously impossible linking of samples to data and publications, linking and integration of sample-based observations across data systems, and paves the road towards advanced data mining of sample-based data. And in recent years, there has been an uptake in the use of Persistent Identifiers (PIDs) for physical samples to support such citation and tracking. The IGSN (International Geo Sample Number) is a PID for physical samples. It was originally developed for the solid earth sciences, and has evolved into an international PID system with members in five continents and a network of active allocating agents. It has been adopted by a growing number and range of stakeholders worldwide, including national geological surveys, research infrastructure providers, collection curators, researchers, and data managers, and by other disciplines that need to refer to physical samples. Nearly 6.9 million samples have been registered with IGSNs so far. The IGSN system uses the Handle System (Kahn and Wilensky 1995; see also Handle.Net ® ) and has an international organization, IGSN e.V., to manage its governance structure and the technical architecture. The recent expansion of the IGSN beyond the geosciences into other domains such as biodiversity, archeology, and material sciences confirms the power of its concept and implementation, but imposes substantial pressures on the existing capacity and capabilities of the IGSN architecture and its governing organization. Modifications to the IGSN organizational and technical architecture are necessary at this point to keep pace with the growing demand and expectations. These changes are also necessary to ensure trustworthy and sustainable services for PID registration and resolution in a maturing research data ecosystem. The essential criteria for a trustworthy system include an organizational foundation that ensures longevity, sustainability, proper governance, and regular quality assessment of registration services. It also includes a reliable and secure technical platform, based on open standards, which is sufficiently scalable and flexible to accommodate the growing diversity of specimen types, use cases, and stakeholder requirements. In 2018, a major planning project for the IGSN was funded by the Alfred P. Sloan Foundation. An international group of experts participates in re-designing and improving the existing organization and technical architecture of the IGSN system, revising the current business model of the IGSN e.V. and professionalizing its operations. The goal is for the IGSN system to be able to respond to, and support in a sustainable manner, the rapidly growing demands of a global and increasingly multi-disciplinary user community, and to ensure that the IGSN will be a trustworthy, stable, and adaptable persistent identifier system for material samples, both technically and organizationally. The end result should also satisfy and facilitate participation across research domains, and will be a reliable component of the evolving research data ecosystem. Finally, it will ensure that the IGSN is recognized as a trusted partner by data infrastructure providers and the science community alike.  more » « less
Award ID(s):
1636653
PAR ID:
10124177
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Biodiversity Information Science and Standards
Volume:
3
ISSN:
2535-0897
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. If you want great research analytics, you’re going to need great data governance for your research metadata. That’s a tall order when the best information often spans incompatible systems, departments, policies, and mindsets. Research metadata governance is a great illustration of how research analytics is not just a technology problem. It happens at the intersection of technology, policy, process, and culture. Do any of us feel like our organization is “nailing” metadata governance? What level of research analytics could you achieve if there could be more alignment and cooperation around this kind of governance? In this session, you will join your fellow attendees in a facilitated design thinking workshop around the culture and organizational patterns of research metadata governance. We will explore the problem space and the solution space together. And we’ll highlight the group’s best findings. Whether you’re at the technical implementation level or the VP level, you will have seen and experienced good and bad patterns of how research metadata is being governed at your institution. These experiences are valuable to your fellow community members. Using design thinking, we will explore the collective intelligence on this complex topic, shoulder to shoulder. You’ll leave with a better understanding of how the patterns and struggles of data governance at your institution correlate with those of others. You’ll gain new ideas on how to address the hard cultural and organizational problems of governance. And you’ll meet new colleagues who you can stay connected with. Finally, you’ll gain an appreciation of how design thinking techniques themselves can be useful for these types of challenges at your own organization, and how they help to create understanding and alignment. 
    more » « less
  2. Reddy, S.; Winter, J.S.; Padmanabhan, S. (Ed.)
    AI applications are poised to transform health care, revolutionizing benefits for individuals, communities, and health-care systems. As the articles in this special issue aptly illustrate, AI innovations in healthcare are maturing from early success in medical imaging and robotic process automation, promising a broad range of new applications. This is evidenced by the rapid deployment of AI to address critical challenges related to the COVID-19 pandemic, including disease diagnosis and monitoring, drug discovery, and vaccine development. At the heart of these innovations is the health data required for deep learning applications. Rapid accumulation of data, along with improved data quality, data sharing, and standardization, enable development of deep learning algorithms in many healthcare applications. One of the great challenges for healthcare AI is effective governance of these data—ensuring thoughtful aggregation and appropriate access to fuel innovation and improve patient outcomes and healthcare system efficiency while protecting the privacy and security of data subjects. Yet the literature on data governance has rarely looked beyond important pragmatic issues related to privacy and security. Less consideration has been given to unexpected or undesirable outcomes of healthcare in AI, such as clinician deskilling, algorithmic bias, the “regulatory vacuum”, and lack of public engagement. Amidst growing calls for ethical governance of algorithms, Reddy et al. developed a governance model for AI in healthcare delivery, focusing on principles of fairness, accountability, and transparency (FAT), and trustworthiness, and calling for wider discussion. Winter and Davidson emphasize the need to identify underlying values of healthcare data and use, noting the many competing interests and goals for use of health data—such as healthcare system efficiency and reform, patient and community health, intellectual property development, and monetization. Beyond the important considerations of privacy and security, governance must consider who will benefit from healthcare AI, and who will not. Whose values drive health AI innovation and use? How can we ensure that innovations are not limited to the wealthiest individuals or nations? As large technology companies begin to partner with health care systems, and as personally generated health data (PGHD) (e.g., fitness trackers, continuous glucose monitors, health information searches on the Internet) proliferate, who has oversight of these complex technical systems, which are essentially a black box? To tackle these complex and important issues, it is important to acknowledge that we have entered a new technical, organizational, and policy environment due to linked data, big data analytics, and AI. Data governance is no longer the responsibility of a single organization. Rather, multiple networked entities play a role and responsibilities may be blurred. This also raises many concerns related to data localization and jurisdiction—who is responsible for data governance? In this emerging environment, data may no longer be effectively governed through traditional policy models or instruments. 
    more » « less
  3. null (Ed.)
    Transitioning infrastructure governance for accelerating, increasingly uncertain, and increasingly complex environments is paramount for ensuring that critical and basic services are met during times of stability and instability. Yet the bureaucratic structures that dominate infrastructure organizations and their capacity to respond to increasing complexity remain poorly understood. To change infrastructure governance, it is critical to understand current conditions, the barriers to change, and the strategies needed to shift priorities and leadership strategy. The emergence of modern infrastructure bureaucratic and organizational structure is first explored. The need to rethink infrastructure as knowledge enterprises capable of making sense of changing conditions, and not simply as basic service providers, is discussed. Next, transformation of infrastructure governance is presented as both a challenge of organizational change as identity and power and leadership capacity to shift between stable and unstable conditions. Infrastructure bureaucracies should create capabilities to shift between periods of stability and instability, emphasizing flexibility where ad hoc teams are given power to make sense of changing conditions and steer the organization appropriately. Additionally, several critical factors must be addressed within organizational power structures, identities, and processes to facilitate change. Allowing infrastructure governance to persist in its current form is likely increasingly problematic for the future and may result in an increasing inability to maintain relevance. 
    more » « less
  4. Research has shown that creating an online learning community is vital in Model-Based Systems Engineering (MBSE) training programs and can be facilitated via the Community of Inquiry (CoI) framework. For professional learners, an online learning community is influenced by their organizational affiliations. The purpose of this research is to explore learning experiences in groups of professional learners with different and homogenous organizational affiliations in an asynchronous online MBSE module. Through the case study methodology, this research examines four sources of data from two cases: Case 1—learners from different organizations (n = 7); and Case 2—overwhelming majority of learners from the same organization (n = 19). Results showed that learners from the same organization reported higher social presence, which, in turn, corresponded to a higher cognitive presence and higher motivation for future MBSE learning. Based on our findings, we recommend that organizations seeking MBSE adoption coordinate with online course providers to create cohorts to participate in the same offerings to facilitate the process of learning community building. We also recommend MBSE course providers facilitate social interaction on multiple communication platforms and create orientation activities for learners from different organizations to promote social presence. 
    more » « less
  5. Abstract Open and practical exchange, dissemination, and reuse of specimens and data have become a fundamental requirement for life sciences research. The quality of the data obtained and thus the findings and knowledge derived is thus significantly influenced by the quality of the samples, the experimental methods, and the data analysis. Therefore, a comprehensive and precise documentation of the pre‐analytical conditions, the analytical procedures, and the data processing are essential to be able to assess the validity of the research results. With the increasing importance of the exchange, reuse, and sharing of data and samples, procedures are required that enable cross‐organizational documentation, traceability, and non‐repudiation. At present, this information on the provenance of samples and data is mostly either sparse, incomplete, or incoherent. Since there is no uniform framework, this information is usually only provided within the organization and not interoperably. At the same time, the collection and sharing of biological and environmental specimens increasingly require definition and documentation of benefit sharing and compliance to regulatory requirements rather than consideration of pure scientific needs. In this publication, we present an ongoing standardization effort to provide trustworthy machine‐actionable documentation of the data lineage and specimens. We would like to invite experts from the biotechnology and biomedical fields to further contribute to the standard. 
    more » « less