skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Data driven discovery and quantification of hyperspectral leaf reflectance phenotypes across a maize diversity panel
Abstract Estimates of plant traits derived from hyperspectral reflectance data have the potential to efficiently substitute for traits, which are time or labor intensive to manually score. Typical workflows for estimating plant traits from hyperspectral reflectance data employ supervised classification models that can require substantial ground truth datasets for training. We explore the potential of an unsupervised approach, autoencoders, to extract meaningful traits from plant hyperspectral reflectance data using measurements of the reflectance of 2151 individual wavelengths of light from the leaves of maize (Zea mays) plants harvested from 1658 field plots in a replicated field trial. A subset of autoencoder‐derived variables exhibited significant repeatability, indicating that a substantial proportion of the total variance in these variables was explained by difference between maize genotypes, while other autoencoder variables appear to capture variation resulting from changes in leaf reflectance between different batches of data collection. Several of the repeatable latent variables were significantly correlated with other traits scored from the same maize field experiment, including one autoencoder‐derived latent variable (LV8) that predicted plant chlorophyll content modestly better than a supervised model trained on the same data. In at least one case, genome‐wide association study hits for variation in autoencoder‐derived variables were proximal to genes with known or plausible links to leaf phenotypes expected to alter hyperspectral reflectance. In aggregate, these results suggest that an unsupervised, autoencoder‐based approach can identify meaningful and genetically controlled variation in high‐dimensional, high‐throughput phenotyping data and link identified variables back to known plant traits of interest.  more » « less
Award ID(s):
1954556
PAR ID:
10643823
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
The Plant Phenome Journal
Volume:
7
Issue:
1
ISSN:
2578-2703
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract To predict ecological responses at broad environmental scales, grass species are commonly grouped into two broad functional types based on photosynthetic pathway. However, closely related species may have distinctive anatomical and physiological attributes that influence ecological responses, beyond those related to photosynthetic pathway alone. Hyperspectral leaf reflectance can provide an integrated measure of covarying leaf traits that may result from phylogenetic trait conservatism and/or environmental conditions. Understanding whether spectra‐trait relationships are lineage specific or reflect environmental variation across sites is necessary for using hyperspectral reflectance to predict plant responses to environmental changes across spatial scales. We measured hyperspectral leaf reflectance (400–2400 nm) and 12 structural, biochemical, and physiological leaf traits from five grass‐dominated sites spanning the Great Plains of North America. We assessed if variation in leaf reflectance spectra among grass species is explained more by evolutionary lineage (as captured by tribes or subfamilies), photosynthetic pathway (C3or C4), or site differences. We then determined whether leaf spectra can be used to predict leaf traits within and across lineages. Our results using redundancy analysis ordination (RDA) show that grass tribe identity explained more variation in leaf spectra (adjustedR2 = 0.12) than photosynthetic pathway, which explained little variation in leaf spectra (adjustedR2 = 0.00). Furthermore, leaf reflectance from the same tribe across multiple sites was more similar than leaf reflectance from the same site across tribes (adjustedR2 = 0.12 and 0.08, respectively). Across all sites and species, trait predictions based on spectra ranged considerably in predictive accuracies (R2 = 0.65 to <0.01), butR2was >0.80 for certain lineages and sites. The relationship between Vcmax, a measure of photosynthetic capacity, and spectra was particularly promising. Chloridoideae, a lineage more common at drier sites, appears to have distinct spectra‐trait relationships compared with other lineages. Overall, our results show that evolutionary relatedness explains more variation in grass leaf spectra than photosynthetic pathway or site, but consideration of lineage‐ and site‐specific trait relationships is needed to interpret spectral variation across large environmental gradients. 
    more » « less
  2. This study describes the evaluation of a range of approaches to semantic segmentation of hyperspectral images of sorghum plants, classifying each pixel as either nonplant or belonging to one of the three organ types (leaf, stalk, panicle). While many current methods for segmentation focus on separating plant pixels from background, organ-specific segmentation makes it feasible to measure a wider range of plant properties. Manually scored training data for a set of hyperspectral images collected from a sorghum association population was used to train and evaluate a set of supervised classification models. Many algorithms show acceptable accuracy for this classification task. Algorithms trained on sorghum data are able to accurately classify maize leaves and stalks, but fail to accurately classify maize reproductive organs which are not directly equivalent to sorghum panicles. Trait measurements extracted from semantic segmentation of sorghum organs can be used to identify both genes known to be controlling variation in a previously measured phenotypes (e.g., panicle size and plant height) as well as identify signals for genes controlling traits not previously quantified in this population (e.g., stalk/leaf ratio). Organ level semantic segmentation provides opportunities to identify genes controlling variation in a wide range of morphological phenotypes in sorghum, maize, and other related grain crops. 
    more » « less
  3. Abstract Understanding spatial and temporal variation in plant traits is needed to accurately predict how communities and ecosystems will respond to global change. The National Ecological Observatory Network’s (NEON’s) Airborne Observation Platform (AOP) provides hyperspectral images and associated data products at numerous field sites at 1 m spatial resolution, potentially allowing high‐resolution trait mapping. We tested the accuracy of readily available data products of NEON’s AOP, such as Leaf Area Index (LAI), Total Biomass, Ecosystem Structure (Canopy height model [CHM]), and Canopy Nitrogen, by comparing them to spatially extensive field measurements from a mesic tallgrass prairie. Correlations with AOP data products exhibited generally weak or no relationships with corresponding field measurements. The strongest relationships were between AOP LAI and ground‐measured LAI (r = 0.32) and AOP Total Biomass and ground‐measured biomass (r = 0.23). We also examined how well the full reflectance spectra (380–2,500 nm), as opposed to derived products, could predict vegetation traits using partial least‐squares regression (PLSR) models. Among all the eight traits examined, only Nitrogen had a validation of more than 0.25. For all vegetation traits, validation ranged from 0.08 to 0.29 and the range of the root mean square error of prediction (RMSEP) was 14–64%. Our results suggest that currently available AOP‐derived data products should not be used without extensive ground‐based validation. Relationships using the full reflectance spectra may be more promising, although careful consideration of field and AOP data mismatches in space and/or time, biases in field‐based measurements or AOP algorithms, and model uncertainty are needed. Finally, grassland sites may be especially challenging for airborne spectroscopy because of their high species diversity within a small area, mixed functional types of plant communities, and heterogeneous mosaics of disturbance and resource availability. Remote sensing observations are one of the most promising approaches to understanding ecological patterns across space and time. But the opportunity to engage a diverse community of NEON data users will depend on establishing rigorous links with in‐situ field measurements across a diversity of sites. 
    more » « less
  4. Abstract. Accurate assessment of leaf functional traits is crucial for a diverse range of applications from crop phenotyping to parameterizing global climate models. Leaf reflectance spectroscopy offers a promising avenue to advance ecological and agricultural research by complementing traditional, time-consuming gas exchange measurements. However, the development of robust hyperspectral models for predicting leaf photosynthetic capacity and associated traits from reflectance data has been hindered by limited data availability across species and environments. Here we introduce the Global Spectra-Trait Initiative (GSTI), a collaborative repository of paired leaf hyperspectral and gas exchange measurements from diverse ecosystems. The GSTI repository currently encompasses over 7500 observations from 397 species and 41 sites gathered from 36 published and unpublished studies, thereby offering a key resource for developing and validating hyperspectral models of leaf photosynthetic capacity. The GSTI database is developed on GitHub (https://github.com/plantphys/gsti, last access: 4 January 2026) and published to ESS-DIVE https://doi.org/10.15485/2530733, Lamour et al., 2025). It includes gas exchange data, derived photosynthetic parameters, and key leaf traits often associated with traditional gas exchange measurements such as leaf mass per area and leaf elemental composition. By providing a standardized repository for data sharing and analysis, we present a critical step towards creating hyperspectral models for predicting photosynthetic traits and associated leaf traits for terrestrial plants. 
    more » « less
  5. The role of intraspecific trait variation in functional ecology has gained traction in recent years as many papers have observed its importance in driving community diversity and ecology. Yet much of the work in this field relies on field-based trait surveys. Here, we used continuous canopy trait information derived from remote sensing data of a highly polymorphic tree species, Metrosideros polymorpha, to quantify environmental controls on intraspecific trait variation. M. polymorpha, an endemic, keystone tree species in Hawai’i, varies morphologically, chemically, and genetically across broad elevation and soil substrate age gradients, making it an ideal model organism to explore large-scale environmental drivers of intraspecific trait variation. M. polymorpha canopy reflectance (visible to shortwave infrared; 380–2510 nm) and light detection and ranging (LiDAR) data collected by the Global Airborne Observatory were modeled to canopy trait estimates of leaf mass per area, chlorophyll a and b, carotenoids, total carbon, nitrogen, phosphorus, phenols, cellulose, and top of canopy height using previously developed leaf chemometric equations. We explored how these derived traits varied across environmental gradients by extracting elevation, slope, aspect, precipitation, and soil substrate age data at canopy locations. We then obtained the feature importance values of the environmental factors in predicting each leaf trait by training random forest models to predict leaf traits individually. Of these environmental factors, elevation was the most important predictor for all canopy traits. Elevation not only affected canopy traits directly but also indirectly by influencing the relationships between soil substrate age and canopy traits as well as between nitrogen and other traits, as indicated by the change in slope between the variables at different elevation ranges. In conclusion, intraspecific variation in M. polymorpha traits derived from remote sensing adheres to known leaf economic spectrum (LES) patterns as well as interspecific LES traits previously mapped using imaging spectroscopy. 
    more » « less