Abstract Soil microbial communities play critical roles in various ecosystem processes, but studies at a large spatial and temporal scale have been challenging due to the difficulty in finding the relevant samples in available data sets as well as the lack of standardization in sample collection and processing. The National Ecological Observatory Network (NEON) has been collecting soil microbial community data multiple times per year for 47 terrestrial sites in 20 eco‐climatic domains, producing one of the most extensive standardized sampling efforts for soil microbial biodiversity to date. Here, we introduce the neonMicrobe R package—a suite of downloading, preprocessing, data set assembly, and sensitivity analysis tools for NEON’s newly published 16S and ITS amplicon sequencing data products which characterize soil bacterial and fungal communities, respectively. neonMicrobe is designed to make these data more accessible to ecologists without assuming prior experience with bioinformatic pipelines. We describe quality control steps used to remove quality‐flagged samples, report on sensitivity analyses used to determine appropriate quality filtering parameters for the DADA2 workflow, and demonstrate the immediate usability of the output data by conducting standard analyses of soil microbial diversity. The sequence abundance tables produced byneonMicrobecan be linked to NEON’s other data products (e.g., soil physical and chemical properties, plant community composition) and soil subsamples archived in the NEON Biorepository. We provide recommendations for incorporatingneonMicrobeinto reproducible scientific workflows, discuss technical considerations for large‐scale amplicon sequence analysis, and outline future directions for NEON‐enabled microbial ecology. In particular, we believe that NEON marker gene sequence data will allow researchers to answer outstanding questions about the spatial and temporal dynamics of soil microbial communities while explicitly accounting for scale dependence. We expect that the data produced by NEON and theneonMicrobeR package will act as a valuable ecological baseline to inform and contextualize future experimental and modeling endeavors.
more »
« less
NEON distributed initial soil characterization dataset (DP1.10047.001) modified for statistical analysis of organic carbon and extractable metals in Hall and Thompson (2021)
We compiled National Ecological Observatory Network (NEON) datasets related to the initial distributed soil sampling effort and subsetted them (removed samples with missing values for certain variables, and several samples with extreme values) for use in statistical analyses to describe relationships between soil organic carbon (SOC) and metals measured in several soil chemical extractions. The NEON provisional data products we used were DP1.10047.001 and DP1.10008.001, which were subsequently combined by NEON as a single data product DP1.10047.001, “Soil physical and chemical properties, distributed initial characterization”. These datasets were used for the analyses reported in a manuscript by Hall and Thompson (2021) in the Soil Science Society of America Journal.
more »
« less
- Award ID(s):
- 1802745
- PAR ID:
- 10387430
- Publisher / Repository:
- Environmental Data Initiative
- Date Published:
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
To understand controls on soil organic matter chemical composition across North America, we collected 13C NMR spectra and conducted and synthesized additional biogeochemical measurements from NEON Megapit soil samples as well as additional samples (total n = 42). This dataset supports the findings described in the associated manuscript by Hall, Ye et al. (2020).more » « less
-
Abstract Nitrogen (N) is a key limiting nutrient in terrestrial ecosystems, but there remain critical gaps in our ability to predict and model controls on soil N cycling. This may be in part due to lack of standardized sampling across broad spatial–temporal scales. Here, we introduce a continentally distributed, publicly available data set collected by the National Ecological Observatory Network (NEON) that can help fill these gaps. First, we detail the sampling design and methods used to collect and analyze soil inorganic N pool and net flux rate data from 47 terrestrial sites. We address methodological challenges in generating a standardized data set, even for a network using uniform protocols. Then, we evaluate sources of variation within the sampling design and compare measured net N mineralization to simulated fluxes from the Community Earth System Model 2 (CESM2). We observed wide spatiotemporal variation in inorganic N pool sizes and net transformation rates. Site explained the most variation in NEON’s stratified sampling design, followed by plots within sites. Organic horizons had larger pools and net N transformation rates than mineral horizons on a sample weight basis. The majority of sites showed some degree of seasonality in N dynamics, but overall these temporal patterns were not matched by CESM2, leading to poor correspondence between observed and modeled data. Looking forward, these data can reveal new insights into controls on soil N cycling, especially in the context of other environmental data sets provided by NEON, and should be leveraged to improve predictive modeling of the soil N cycle.more » « less
-
Microorganisms are ubiquitous in the biosphere, playing a crucial role in both biogeochemistry of the planet and human health. However, identifying these microorganisms and defining their function are challenging. Widely used approaches in comparative metagenomics, 16S amplicon sequencing and whole genome shotgun sequencing (WGS), have provided access to DNA sequencing analysis to identify microorganisms and evaluate diversity and abundance in various environments. However, advances in parallel high-throughput DNA sequencing in the past decade have introduced major hurdles, namely standardization of methods, data storage, reproducible interoperability of results, and data sharing. The National Ecological Observatory Network (NEON), established by the National Science Foundation, enables all researchers to address queries on a regional to continental scale around a variety of environmental challenges and provide high-quality, integrated, and standardized data from field sites across the U.S. As the amount of metagenomic data continues to grow, standardized procedures that allow results across projects to be assessed and compared is becoming increasingly important in the field of metagenomics. We demonstrate the feasibility of using publicly available NEON soil metagenomic sequencing datasets in combination with open access Metagenomics Rapid Annotation using the Subsystem Technology (MG-RAST) server to illustrate advantages of WGS compared to 16S amplicon sequencing. Four WGS and four 16S amplicon sequence datasets, from surface soil samples prepared by NEON investigators, were selected for comparison, using standardized protocols collected at the same locations in Colorado between April-July 2014. The dominant bacterial phyla detected across samples agreed between sequencing methodologies. However, WGS yielded greater microbial resolution, increased accuracy, and allowed identification of more genera of bacteria, archaea, viruses, and eukaryota, and putative functional genes that would have gone undetected using 16S amplicon sequencing. NEON open data will be useful for future studies characterizing and quantifying complex ecological processes associated with changing aquatic and terrestrial ecosystems.more » « less
-
Dataset Abstract A spatial variability study conducted across the LTER Main Site area (45 ha) at KBS prior to dividing the site into 1-ha experimental plots. During the 1988 growing season a stratified unaligned sampling scheme was used to collect 400-600 geo-referenced samples across the site (uniformly planted to a single variety of soybeans) for: geomorphological characteristics (microtopography, soil horizon depths, bulk density, texture); soil chemical characteristics (pH, NO3, NH4, total C, total N, moisture, inorganic P, trace metals); soil biological characteristics (N mineralization potentials, microbial biomass C, microbial biomass N, fungal/bacterial ratios, nematodes and other soil invertebrates; seed bank size); plant weed species abundance, weed biomass at peak standing crop); and insect characteristics (major pest and predator species). Most soil samples were taken before crop emergence, plant phenology samples were taken throughout the growing season, biomass samples were taken at physiological maturity, and insect samples were taken continuously. Dried soil and plant samples are archived for potential future analysis. original data source http://lter.kbs.msu.edu/datasets/6more » « less
An official website of the United States government
