skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, June 12 until 2:00 AM ET on Friday, June 13 due to maintenance. We apologize for the inconvenience.


This content will become publicly available on December 1, 2025

Title: Data driven discovery and quantification of hyperspectral leaf reflectance phenotypes across a maize diversity panel
Abstract Estimates of plant traits derived from hyperspectral reflectance data have the potential to efficiently substitute for traits, which are time or labor intensive to manually score. Typical workflows for estimating plant traits from hyperspectral reflectance data employ supervised classification models that can require substantial ground truth datasets for training. We explore the potential of an unsupervised approach, autoencoders, to extract meaningful traits from plant hyperspectral reflectance data using measurements of the reflectance of 2151 individual wavelengths of light from the leaves of maize (Zea mays) plants harvested from 1658 field plots in a replicated field trial. A subset of autoencoder‐derived variables exhibited significant repeatability, indicating that a substantial proportion of the total variance in these variables was explained by difference between maize genotypes, while other autoencoder variables appear to capture variation resulting from changes in leaf reflectance between different batches of data collection. Several of the repeatable latent variables were significantly correlated with other traits scored from the same maize field experiment, including one autoencoder‐derived latent variable (LV8) that predicted plant chlorophyll content modestly better than a supervised model trained on the same data. In at least one case, genome‐wide association study hits for variation in autoencoder‐derived variables were proximal to genes with known or plausible links to leaf phenotypes expected to alter hyperspectral reflectance. In aggregate, these results suggest that an unsupervised, autoencoder‐based approach can identify meaningful and genetically controlled variation in high‐dimensional, high‐throughput phenotyping data and link identified variables back to known plant traits of interest.  more » « less
Award ID(s):
1954556
PAR ID:
10580003
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Publisher / Repository:
Wiley
Date Published:
Journal Name:
The Plant Phenome Journal
Volume:
7
Issue:
1
ISSN:
2578-2703
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This study describes the evaluation of a range of approaches to semantic segmentation of hyperspectral images of sorghum plants, classifying each pixel as either nonplant or belonging to one of the three organ types (leaf, stalk, panicle). While many current methods for segmentation focus on separating plant pixels from background, organ-specific segmentation makes it feasible to measure a wider range of plant properties. Manually scored training data for a set of hyperspectral images collected from a sorghum association population was used to train and evaluate a set of supervised classification models. Many algorithms show acceptable accuracy for this classification task. Algorithms trained on sorghum data are able to accurately classify maize leaves and stalks, but fail to accurately classify maize reproductive organs which are not directly equivalent to sorghum panicles. Trait measurements extracted from semantic segmentation of sorghum organs can be used to identify both genes known to be controlling variation in a previously measured phenotypes (e.g., panicle size and plant height) as well as identify signals for genes controlling traits not previously quantified in this population (e.g., stalk/leaf ratio). Organ level semantic segmentation provides opportunities to identify genes controlling variation in a wide range of morphological phenotypes in sorghum, maize, and other related grain crops. 
    more » « less
  2. Abstract Understanding spatial and temporal variation in plant traits is needed to accurately predict how communities and ecosystems will respond to global change. The National Ecological Observatory Network’s (NEON’s) Airborne Observation Platform (AOP) provides hyperspectral images and associated data products at numerous field sites at 1 m spatial resolution, potentially allowing high‐resolution trait mapping. We tested the accuracy of readily available data products of NEON’s AOP, such as Leaf Area Index (LAI), Total Biomass, Ecosystem Structure (Canopy height model [CHM]), and Canopy Nitrogen, by comparing them to spatially extensive field measurements from a mesic tallgrass prairie. Correlations with AOP data products exhibited generally weak or no relationships with corresponding field measurements. The strongest relationships were between AOP LAI and ground‐measured LAI (r = 0.32) and AOP Total Biomass and ground‐measured biomass (r = 0.23). We also examined how well the full reflectance spectra (380–2,500 nm), as opposed to derived products, could predict vegetation traits using partial least‐squares regression (PLSR) models. Among all the eight traits examined, only Nitrogen had a validation of more than 0.25. For all vegetation traits, validation ranged from 0.08 to 0.29 and the range of the root mean square error of prediction (RMSEP) was 14–64%. Our results suggest that currently available AOP‐derived data products should not be used without extensive ground‐based validation. Relationships using the full reflectance spectra may be more promising, although careful consideration of field and AOP data mismatches in space and/or time, biases in field‐based measurements or AOP algorithms, and model uncertainty are needed. Finally, grassland sites may be especially challenging for airborne spectroscopy because of their high species diversity within a small area, mixed functional types of plant communities, and heterogeneous mosaics of disturbance and resource availability. Remote sensing observations are one of the most promising approaches to understanding ecological patterns across space and time. But the opportunity to engage a diverse community of NEON data users will depend on establishing rigorous links with in‐situ field measurements across a diversity of sites. 
    more » « less
  3. The role of intraspecific trait variation in functional ecology has gained traction in recent years as many papers have observed its importance in driving community diversity and ecology. Yet much of the work in this field relies on field-based trait surveys. Here, we used continuous canopy trait information derived from remote sensing data of a highly polymorphic tree species, Metrosideros polymorpha, to quantify environmental controls on intraspecific trait variation. M. polymorpha, an endemic, keystone tree species in Hawai’i, varies morphologically, chemically, and genetically across broad elevation and soil substrate age gradients, making it an ideal model organism to explore large-scale environmental drivers of intraspecific trait variation. M. polymorpha canopy reflectance (visible to shortwave infrared; 380–2510 nm) and light detection and ranging (LiDAR) data collected by the Global Airborne Observatory were modeled to canopy trait estimates of leaf mass per area, chlorophyll a and b, carotenoids, total carbon, nitrogen, phosphorus, phenols, cellulose, and top of canopy height using previously developed leaf chemometric equations. We explored how these derived traits varied across environmental gradients by extracting elevation, slope, aspect, precipitation, and soil substrate age data at canopy locations. We then obtained the feature importance values of the environmental factors in predicting each leaf trait by training random forest models to predict leaf traits individually. Of these environmental factors, elevation was the most important predictor for all canopy traits. Elevation not only affected canopy traits directly but also indirectly by influencing the relationships between soil substrate age and canopy traits as well as between nitrogen and other traits, as indicated by the change in slope between the variables at different elevation ranges. In conclusion, intraspecific variation in M. polymorpha traits derived from remote sensing adheres to known leaf economic spectrum (LES) patterns as well as interspecific LES traits previously mapped using imaging spectroscopy. 
    more » « less
  4. Plant traits are often measured in the field or laboratory to characterize stress responses. However, direct measurements are not always cost effective for broader sampling efforts, whereas indirect approaches such as reflectance spectroscopy could offer efficient and scalable alternatives. Here, we used field spectroscopy to assess whether (1) existing vegetation indices could predict leaf trait responses to heat stress, or if (2) partial least squares regression (PLSR) spectral models could quantify these trait responses. On several warm, sunny days, we measured leaf trait responses indicative of photosynthetic mechanisms, plant water status, and morphology, including electron transport rate (ETR), photochemical quenching (qP), leaf water potential (Ψleaf), and specific leaf area (SLA) in 51 urban trees from nine species. Concurrent measures of hyperspectral leaf reflectance from the same individuals were used to calculate vegetation indices for correlation with trait responses. We found that vegetation indices predicted only SLA robustly (R2 = 0.55), while PLSR predicted all leaf trait responses of interest with modest success (R2 = 0.36 to 0.58). Using spectral band subsets corresponding to commercially available drone-mounted hyperspectral cameras, as well as those selected for use in common multispectral satellite missions, we were able to estimate ETR, qP, and SLA with reasonable accuracy, highlighting the potential for large-scale prediction of these parameters. Overall, reflectance spectroscopy and PLSR can identify wavelengths and wavelength ranges that are important for remote sensing-based modeling of important functional trait responses of trees to heat stress over broad ranges. 
    more » « less
  5. Elizabeth Borer (Ed.)
    Understanding spatial and temporal variation in plant traits is needed to accurately predict how communities and ecosystems will respond to global change. The National Ecological Observatory Network’s (NEON’s) Airborne Observation Platform (AOP) provides hyperspectral images and associated data products at numerous field sites at 1 m spatial resolution, potentially allowing high-resolution trait mapping. We tested the accuracy of readily available data products of NEON’s AOP, such as Leaf Area Index (LAI), Total Biomass, Ecosystem Structure (Canopy height model [CHM]), and Canopy Nitrogen, by comparing them to spatially extensive field measurements from a mesic tallgrass prairie. Correlations with AOP data products exhibited generally weak or no relationships with corresponding field measurements. The strongest relationships were between AOP LAI and ground-measured LAI (r = 0.32) and AOP Total Biomass and ground-measured biomass (r = 0.23). We also examined how well the full reflectance spectra (380–2,500 nm), as opposed to derived products, could predict vegetation traits using partial least-squares regression (PLSR) models. Among all the eight traits examined, only Nitrogen had a validation of more than 0.25. For all vegetation traits, validation ranged from 0.08 to 0.29 and the range of the root mean square error of prediction (RMSEP) was 14–64%. Our results suggest that currently available AOP-derived data products should not be used without extensive ground-based validation. Relationships using the full reflectance spectra may be more promising, although careful consideration of field and AOP data mismatches in space and/or time, biases in field-based measurements or AOP algorithms, and model uncertainty are needed. Finally, grassland sites may be especially challenging for airborne spectroscopy because of their high species diversity within a small area, mixed functional types of plant communities, and heterogeneous mosaics of disturbance and resource availability. Remote sensing observations are one of the most promising approaches to understanding ecological patterns across space and time. But the opportunity to engage a diverse community of NEON data users will depend on establishing rigorous links with in-situ field measurements across a diversity of sites. 
    more » « less