skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: GeospaceLAB: Python package for managing and visualizing data in space physics
In the space physics community, processing and combining observational and modeling data from various sources is a demanding task because they often have different formats and use different coordinate systems. The Python package GeospaceLAB has been developed to provide a unified, standardized framework to process data. The package is composed of six core modules, including DataHub as the data manager, Visualization for generating publication quality figures, Express for higher-level interfaces of DataHub and Visualization , SpaceCoordinateSystem for coordinate system transformations, Toolbox for various utilities, and Configuration for preferences. The core modules form a standardized framework for downloading, storing, post-processing and visualizing data in space physics. The object-oriented design makes the core modules of GeospaceLAB easy to modify and extend. So far, GeospaceLAB can process more than twenty kinds of data products from nine databases, and the number will increase in the future. The data sources include, e.g., measurements by EISCAT incoherent scatter radars, DMSP, SWARM, and Grace satellites, OMNI solar wind data, and GITM simulations. In addition, the package provides an interface for the users to add their own data products. Hence, researchers can easily collect, combine, and view multiple kinds of data for their work using GeospaceLAB. Combining data from different sources will lead to a better understanding of the physics of the studied phenomena and may lead to new discoveries. GeospaceLAB is an open source software, which is hosted on GitHub. We welcome everyone in the community to contribute to its future development.  more » « less
Award ID(s):
2033787 1952737
PAR ID:
10386431
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Astronomy and Space Sciences
Volume:
9
ISSN:
2296-987X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Atmospheric aerosol and chemistry modules are key elements in Earth system models (ESMs), as they predict air pollutant concentrations and properties that can impact human health, weather, and climate. The current uncertainty in climate projections is partly due to the inaccurate representation of aerosol direct and indirect forcing. Aerosol/chemistry parameterizations used within ESMs and other atmospheric models span large structural and parameter uncertainties that are difficult to assess independently of their host models. Moreover, there is a strong need for a standardized interface between aerosol/chemistry modules and the host model to facilitate portability of aerosol/chemistry parameterizations from one model to another, allowing not only a comparison between different parameterizations within the same modeling framework, but also quantifying the impact of different model frameworks on aerosol/chemistry predictions. To address this need, we have initiated a new community effort to coordinate the construction of a Generalized Aerosol/Chemistry Interface (GIANT) for use across weather and climate models. We aim to organize a series of community workshops and hackathons to design and build GIANT, which will serve as the interface between a range of aerosol/chemistry modules and the physics and dynamics components of atmospheric host models. GIANT will leverage ongoing efforts at the U.S. modeling centers focused on building next-generation ESMs and the international AeroCom initiative to implement this common aerosol/chemistry interface. GIANT will create transformative opportunities for scientists and students to conduct innovative research to better characterize structural and parametric uncertainties in aerosol/chemistry modules, and to develop a common set of aerosol/chemistry parameterizations. 
    more » « less
  2. AbstractÐPrivacy of data as well as providing anonymization of data for various kinds of analysis have been addressed in the context of tabular transactional data which was mainstream. With the advent of the Internet and social networks, there is an emphasis on using different kinds of graphs for modeling and analysis. In addition to single graphs, the use of MultiLayer Networks (or MLNs) for modeling and analysis is becoming popular for complex data having multiple types of entities and relationships. They provide a better understanding of data as well as flexibility and efficiency of analysis. In this article, we understand the provenance of data privacy and some of the thinking on extending it to graph data models. We will focus on the issues of data privacy for models that are different from traditional data models and discuss alternatives. We will also consider privacy from a visualization perspective as we have developed a community Dashboard for MLN generation, analysis, and visualization based on our research. 
    more » « less
  3. Abstract Soil microbial communities play critical roles in various ecosystem processes, but studies at a large spatial and temporal scale have been challenging due to the difficulty in finding the relevant samples in available data sets as well as the lack of standardization in sample collection and processing. The National Ecological Observatory Network (NEON) has been collecting soil microbial community data multiple times per year for 47 terrestrial sites in 20 eco‐climatic domains, producing one of the most extensive standardized sampling efforts for soil microbial biodiversity to date. Here, we introduce the neonMicrobe R package—a suite of downloading, preprocessing, data set assembly, and sensitivity analysis tools for NEON’s newly published 16S and ITS amplicon sequencing data products which characterize soil bacterial and fungal communities, respectively. neonMicrobe is designed to make these data more accessible to ecologists without assuming prior experience with bioinformatic pipelines. We describe quality control steps used to remove quality‐flagged samples, report on sensitivity analyses used to determine appropriate quality filtering parameters for the DADA2 workflow, and demonstrate the immediate usability of the output data by conducting standard analyses of soil microbial diversity. The sequence abundance tables produced byneonMicrobecan be linked to NEON’s other data products (e.g., soil physical and chemical properties, plant community composition) and soil subsamples archived in the NEON Biorepository. We provide recommendations for incorporatingneonMicrobeinto reproducible scientific workflows, discuss technical considerations for large‐scale amplicon sequence analysis, and outline future directions for NEON‐enabled microbial ecology. In particular, we believe that NEON marker gene sequence data will allow researchers to answer outstanding questions about the spatial and temporal dynamics of soil microbial communities while explicitly accounting for scale dependence. We expect that the data produced by NEON and theneonMicrobeR package will act as a valuable ecological baseline to inform and contextualize future experimental and modeling endeavors. 
    more » « less
  4. The National Science Foundation Ice Core Facility (NSF-ICF, fka NICL) is in the process of building a new facility including freezer and scientist support space. The facility is being designed to minimize environmental impacts while maximizing ice core curation and science support. In preparation for the new facility, we are updating research equipment and integrating ice core data collection and processing by assigning International Generic Sample Numbers (IGSN) to advance the “FAIR”ness and establish clear provenance of samples, fostering the next generation of linked research data products. The NSF-ICF team, in collaboration with the US ice core science community, has established a metadata schema for the assignment of IGSNs to ice cores and samples. In addition, in close coordination with the US ice core community, we are adding equipment modules that expand traditional sets of physical property, visual stratigraphy, and electrical conductance ice core measurements. One such module is an ice core hyperspectral imaging (HSI) system. Adapted for the cold laboratory settings, the SPECIM SisuSCS HSI system can collect up to 224 bands using a continuous line-scanning mode in the visible and near-infrared (VNIR) 400-1000 nm spectral region. A modular system design allows expansion of spectral properties in the future. The second module is an updated multitrack electrical conductance system. These new data will guide real time optimization of sampling for planned analyses during ice core processing, especially for ice with deformed or highly compressed layering. The aim is to facilitate the collection of robust, long-term, and FAIR data archives for every future ice core section processed at NSF-ICF. The NSF-ICF is fully funded by the National Science Foundation and operated by the U.S. Geological Survey. 
    more » « less
  5. Abstract Understanding patterns and drivers of species distribution and abundance, and thus biodiversity, is a core goal of ecology. Despite advances in recent decades, research into these patterns and processes is currently limited by a lack of standardized, high‐quality, empirical data that span large spatial scales and long time periods. The NEON fills this gap by providing freely available observational data that are generated during robust and consistent organismal sampling of several sentinel taxonomic groups within 81 sites distributed across the United States and will be collected for at least 30 years. The breadth and scope of these data provide a unique resource for advancing biodiversity research. To maximize the potential of this opportunity, however, it is critical that NEON data be maximally accessible and easily integrated into investigators' workflows and analyses. To facilitate its use for biodiversity research and synthesis, we created a workflow to process and format NEON organismal data into the ecocomDP (ecological community data design pattern) format that were available through the ecocomDP R package; we then provided the standardized data as an R data package (neonDivData). We briefly summarize sampling designs and data wrangling decisions for the major taxonomic groups included in this effort. Our workflows are open‐source so the biodiversity community may: add additional taxonomic groups; modify the workflow to produce datasets appropriate for their own analytical needs; and regularly update the data packages as more observations become available. Finally, we provide two simple examples of how the standardized data may be used for biodiversity research. By providing a standardized data package, we hope to enhance the utility of NEON organismal data in advancing biodiversity research and encourage the use of the harmonized ecocomDP data design pattern for community ecology data from other ecological observatory networks. 
    more » « less