The open data movement has brought revolutionary changes to the field of mineralogy. With a growing number of datasets made available through community efforts, researchers are now able to explore new scientific topics such as mineral ecology, mineral evolution and new classification systems. The recent results have shown that the necessary open data coupled with data science skills and expertise in mineralogy will lead to impressive new scientific discoveries. Yet, feedback from researchers also reflects the needs for better FAIRness of open data, that is, findable, accessible, interoperable and reusable for both humans and machines. In this paper, we present our recent work on building the open data service of Mindat, one of the largest mineral databases in the world. In the past years, Mindat has supported numerous scientific studies but a machine interface for data access has never been established. Through the OpenMindat project we have achieved solid progress on two activities: (1) cleanse data and improve data quality, and (2) build a data sharing platform and establish a machine interface for data query and access. We hope OpenMindat will help address the increasing data needs from researchers in mineralogy for an internationally recognized authoritative database that is fully compliant with the FAIR guiding principles and helps accelerate scientific discoveries.
Minerals are information-rich materials that offer researchers a glimpse into the evolution of planetary bodies. Thus, it is important to extract, analyze, and interpret this abundance of information to improve our understanding of the planetary bodies in our solar system and the role our planet’s geosphere played in the origin and evolution of life. Over the past several decades, data-driven efforts in mineralogy have seen a gradual increase. The development and application of data science and analytics methods to mineralogy, while extremely promising, has also been somewhat ad hoc in nature. To systematize and synthesize the direction of these efforts, we introduce the concept of “Mineral Informatics,” which is the next frontier for researchers working with mineral data. In this paper, we present our vision for Mineral Informatics and the X-Informatics underpinnings that led to its conception, as well as the needs, challenges, opportunities, and future directions of the field. The intention of this paper is not to create a new specific field or a sub-field as a separate silo, but to document the needs of researchers studying minerals in various contexts and fields of study, to demonstrate how the systemization and enhanced access to mineralogical data will increase cross- and interdisciplinary studies, and how data science and informatics methods are a key next step in integrative mineralogical studies.
more » « less- Award ID(s):
- 2148939
- NSF-PAR ID:
- 10504187
- Publisher / Repository:
- De Gruyter
- Date Published:
- Journal Name:
- American Mineralogist
- Volume:
- 108
- Issue:
- 7
- ISSN:
- 0003-004X
- Page Range / eLocation ID:
- 1242 to 1257
- Subject(s) / Keyword(s):
- Informatics mineralogy FAIR data analytics
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Understanding the mineralogy and geochemistry of the subsurface is key when assessing and exploring for mineral deposits. To achieve this goal, rapid acquisition and accurate interpretation of drill core data are essential. Hyperspectral shortwave infrared imaging is a rapid and non-destructive analytical method widely used in the minerals industry to map minerals with diagnostic features in core samples. In this paper, we present an automated method to interpret hyperspectral shortwave infrared data on drill core to decipher major felsic rock-forming minerals using supervised machine learning techniques for processing, masking, and extracting mineralogical and textural information. This study utilizes a co-registered training dataset that integrates hyperspectral data with quantitative scanning electron microscopy data instead of spectrum matching using a spectral library. Our methodology overcomes previous limitations in hyperspectral data interpretation for the full mineralogy (i.e., quartz and feldspar) caused by the need to identify spectral features of minerals; in particular, it detects the presence of minerals that are considered invisible in traditional shortwave infrared hyperspectral analysis.more » « less
-
Aperiodic discharge of brine at Blood Falls forms a red-tinged fan at the terminus of Taylor Glacier, Antarctica. Samples from this discharge provide an opportunity for mineralogical study at a Martian analogue study site. Environmental samples were collected in the field and analyzed in the laboratory using Fourier transform infrared, Raman, visible to near-infrared, and Mössbauer spectroscopies. Samples were further characterized using microprobe and inductively coupled plasma optical emission spectroscopy for chemistry, and x-ray diffraction, scanning electron microscopy, and transmission electron microscopy for mineralogy, crystallography, and chemistry. The mineralogy of these samples is dominated by the carbonate minerals calcite and aragonite, accompanied by quartz, feldspar, halide, and clay minerals. There is no strong evidence for crystalline iron oxide/hydroxide phases, but compositionally and morphologically diverse iron- and chlorine-rich amorphous nanospheres are found in many of the samples. These results showcase the strengths and weaknesses of different analytical methods and underscore the need for multiple complementary techniques to inform the complicated mineralogy at this locale. These analyses suggest that the red color at Blood Falls arises from oxidation of dissolved Fe 2+ in the subglacial fluid that transforms upon exposure to air to form nanospheres of amorphous hydroxylated mixed-valent iron-containing material, with color also influenced by other ions in those structures. Finally, the results provide a comprehensive mineralogical analysis previously missing from the literature for an analogue site with a well-studied sub-ice microbial community. Thus, this mineral assemblage could indicate a habitable environment if found elsewhere in the Solar System.more » « less
-
Abstract Plagioclase feldspar is the most abundant rock‐forming mineral in the crust of the Earth, Moon, and Mars and is also an important component in some minor bodies in the Solar System. The distribution, abundance, and precise composition of plagioclase on planetary surfaces from remote sensing data are important measurements for evaluating changing conditions during magma evolution. Optical constants are critical input parameters in radiative transfer theory, which enables modeling of spectra for the extraction of mineral abundances and grain sizes from a remotely sensed spectrum. Mid‐infrared (MIR) optical constants of most triclinic rock‐forming minerals are not available due to the complexity associated with the derivation of optical constants of low‐symmetry minerals. In this work, we have calculated the MIR optical constants of a labradorite single crystal using dispersion theory and laboratory reflectance spectra at non‐normal incidence. The optical constants we derived here will assist in modeling spectra in the MIR region and quantifying mineral composition, particle size, and abundances from remote sensing data.
-
Integrating mineralogy with data science is critical to modernizing Earth materials research and its applications to geosciences. Data were compiled on 95 650 garnet sample analyses from a variety of sources, ranging from large repositories (EarthChem, RRUFF, MetPetDB) to individual peer-reviewed literature. An important feature is the inclusion of mineralogical “dark data” from papers published prior to 1990. Garnets are commonly used as indicators of formation environments, which directly correlate with their geochemical properties; thus, they are an ideal subject for the creation of an extensive data resource that incorporates composition, locality information, paragenetic mode, age, temperature, pressure, and geochemistry. For the data extracted from existing databases and literature, we increased the resolution of several key aspects, including petrogenetic and paragenetic attributes, which we extended from generic material type (e.g., igneous, metamorphic) to more specific rock-type names (e.g., diorite, eclogite, skarn) and locality information, increasing specificity by examining the continent, country, area, geological context, longitude, and latitude. Likewise, we utilized end-member and quality index calculations to help assess the garnet sample analysis quality. This comprehensive dataset of garnet information is an open-access resource available in the Evolutionary System of Mineralogy Database (ESMD) for future mineralogical studies, paving the way for characterizing correlations between chemical composition and paragenesis through natural kind clustering (Chiama et al., 2022; https://doi.org/10.48484/camh-xy98). We encourage scientists to contribute their own unpublished and unarchived analyses to the growing data repositories of mineralogical information that are increasingly valuable for advancing scientific discovery.more » « less