Abstract Material samples are indispensable data sources in many natural science, social science, and humanity disciplines. More and more researchers recognize that samples collected in one discipline can be of great value for another. This has motivated organizations that manage a large number of samples to make their holdings accessible to the world. Currently, multiple projects are working to connect natural history and other samples managed by individual institutions or individuals into a universe of samples that follow FAIR principles. This poster reports the progress of the US NSF‐funded iSamples project, in the context of other efforts initiated by US DOE, DiSCCo, BCoN, and GBIF. By October 2021, we will also be able to present an iSamples prototype. We encourage individual organizations that hold material samples to get to know these projects and help shape these projects to realize the goal of a global linked sample cloud that connects all material samples and is accessible to all.
more »
« less
Internet of Samples (iSamples): Toward an interdisciplinary cyberinfrastructure for material samples
Abstract Sampling the natural world and built environment underpins much of science, yet systems for managing material samples and associated (meta)data are fragmented across institutional catalogs, practices for identification, and discipline-specific (meta)data standards. The Internet of Samples (iSamples) is a standards-based collaboration to uniquely, consistently, and conveniently identify material samples, record core metadata about them, and link them to other samples, data, and research products. iSamples extends existing resources and best practices in data stewardship to render a cross-domain cyberinfrastructure that enables transdisciplinary research, discovery, and reuse of material samples in 21st century natural science.
more »
« less
- PAR ID:
- 10280283
- Date Published:
- Journal Name:
- GigaScience
- Volume:
- 10
- Issue:
- 5
- ISSN:
- 2047-217X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
This dataset contains a complete export of all iSamples records as of April 21, 2025, in GeoParquet format. The dataset includes over 6.6 million sample records with rich metadata including geographic coordinates, material classifications, context categories, and related resources. The data was exported using the iSamples export client with the query 'source:*', capturing the complete state of the iSample.xyz repository. Each record includes sample identifiers, descriptions, classifications, geospatial information (using WGS 84 coordinate system), timestamps, and various categorical attributes. This GeoParquet file provides an efficient format for analyzing the global distribution and classification of physical samples across scientific domains. The dataset is valuable for researchers working with physical samples in geoscience, material science, biology, and related fields who need to discover, access, or analyze sample collections at scale.more » « less
-
Physical samples and their associated (meta)data underpin scientific discoveries across disciplines, and can enable new science when appropriately archived. However, there are significant gaps in community practices and infrastructure that currently prevent accurate provenance tracking, reproducibility, and attribution. For the vast majority of samples, descriptive metadata is often sparse, inaccessible, or absent. Samples and associated (meta)data may also be scattered across numerous physical collections, data repositories, laboratories, data files, and papers with no clear linkages or provenance tracking as new information is generated over time. The Physical Samples Curation Cluster has therefore developed ‘A Scientific Author Guide for Publishing Open Research Using Physical Samples.’ This involved synthesizing existing practices, community feedback, and assessing real-world examples to identify community and infrastructure needs. We identified areas of work needed to enable authors to efficiently reference samples and related data, link related samples and data, and track their use. Our goal is to help improve the discoverability, interoperability, use of physical samples and associated (meta)data into the future.more » « less
-
Meng, X-L (Ed.)Many data science students and practitioners are reluctant to adopt good coding practices as long as the code ‘works.’ However, code standards are an important part of modern data science practice, and they play an essential role in the development of data acumen. Good coding practices lead to more reliable code and save more time than they cost, making them important even for beginners. We believe that principled coding is vital for quality data science practice. To effectively instill these practices within academic programs, instructors and programs need to begin establishing these practices early, to reinforce them often, and to hold themselves to a higher standard while guiding students. We describe key aspects of good coding practices for data science, illustrating with examples in R and in Python, though similar standards are applicable to other software environments. Practical coding guidelines are organized into a top ten list.more » « less
-
Educational research supports incorporating active engagement into K-12 education using authentic STEM experiences. While there are discipline-specific resources to provide students with such experiences, there are limited transdisciplinary opportunities that integrate engineering education and technological skill-building to contextualize core scientific concepts. Here, we present an adaptable module that integrates hands-on technology education and place-based learning to improve student understanding of key chemistry concepts as they relate to local environmental science. The module also supports disciplinary core ideas, practices, and cross-cutting concepts in accordance with the Next Generation Science Standards. We field-tested our module in three different high school courses: Chemistry, Oceanography and Advanced Placement Environmental Science at schools in Washington, USA. Students built spectrophotometric pH sensors using readily available electronic components and calibrated them with known pH reference standards. Students then used their sensors to measure the pH of local environmental water samples. Assessments showed significant improvement in content knowledge in all three courses relating to environmental relevance of pH, and to the design, use and environmental application of sensors. Students also reported increased self-confidence in the material, even when their content knowledge remained the same. These findings suggest that classroom sensor building and collection of environmental data increases student understanding and self-confidence by connecting chemistry concepts to local environmental settings.more » « less
An official website of the United States government

