skip to main content


The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 5:00 PM ET until 11:00 PM ET on Friday, June 21 due to maintenance. We apologize for the inconvenience.

Title: Semantic Segmentation of Sorghum Using Hyperspectral Data Identifies Genetic Associations
This study describes the evaluation of a range of approaches to semantic segmentation of hyperspectral images of sorghum plants, classifying each pixel as either nonplant or belonging to one of the three organ types (leaf, stalk, panicle). While many current methods for segmentation focus on separating plant pixels from background, organ-specific segmentation makes it feasible to measure a wider range of plant properties. Manually scored training data for a set of hyperspectral images collected from a sorghum association population was used to train and evaluate a set of supervised classification models. Many algorithms show acceptable accuracy for this classification task. Algorithms trained on sorghum data are able to accurately classify maize leaves and stalks, but fail to accurately classify maize reproductive organs which are not directly equivalent to sorghum panicles. Trait measurements extracted from semantic segmentation of sorghum organs can be used to identify both genes known to be controlling variation in a previously measured phenotypes (e.g., panicle size and plant height) as well as identify signals for genes controlling traits not previously quantified in this population (e.g., stalk/leaf ratio). Organ level semantic segmentation provides opportunities to identify genes controlling variation in a wide range of morphological phenotypes in sorghum, maize, and other related grain crops.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Plant Phenomics
Page Range / eLocation ID:
1 to 11
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary

    Maize (Zea maysL.), a model species for genetic studies, is one of the two most important crop species worldwide. The genome sequence of the reference genotype, B73, representative of the stiff stalk heterotic group was recently updated (AGPv4) using long‐read sequencing and optical mapping technology. To facilitate the use ofAGPv4 and to enable functional genomic studies and association of genotype with phenotype, we determined expression abundances for replicatedmRNA‐sequencing datasets from 79 tissues and five abiotic/biotic stress treatments revealing 36 207 expressed genes. Characterization of the B73 transcriptome across six organs revealed 4154 organ‐specific and 7704 differentially expressed (DE) genes following stress treatment. Gene co‐expression network analyses revealed 12 modules associated with distinct biological processes containing 13 590 genes providing a resource for further association of gene function based on co‐expression patterns. Presence−absence variants (PAVs) previously identified using whole genome resequencing data from 61 additional inbred lines were enriched in organ‐specific and stress‐induced DE genes suggesting thatPAVs may function in phenological variation and adaptation to environment. Relative to core genes conserved across the 62 profiled inbreds,PAVs have lower expression abundances which are correlated with their frequency of dispersion across inbreds and on average have significantly fewer co‐expression network connections suggesting that a subset ofPAVs may be on an evolutionary path to pseudogenization. To facilitate use by the community, we developed the Maize Genomics Resource website ( for viewing and data‐mining these resources and deployed two new views on the maize electronic Fluorescent Pictograph Browser (

    more » « less
  2. Plant communities are composed of complex phenotypes that not only differ among taxonomic groups and habitats but also change over time within a species. Restoration projects (e.g. translocations and reseeding) can introduce new functional variation in plants, which further diversifies phenotypes and complicates our ability to identify locally adaptive phenotypes for future restoration. Near‐infrared spectroscopy (NIRS) offers one approach to detect the chemical phenotypes that differentiate plant species, populations, and phenological states of individual plants over time. We use sagebrush (Artemisiaspp.) as a case study to test the accuracy by which NIRS can classify variation within taxonomy and phenology of a plant that is extensively managed and restored. Our results demonstrated that NIRS can accurately classify species of sagebrush within a study site (75–96%), populations of sagebrush within a subspecies (99%), annual phenology within a population (>99%), and seasonal phenology within individual plants (>97%). Low classification accuracy by NIRS in some sites may reflect heterogeneity associated with natural hybridization, translocation of nonlocal seed sources from past restoration, or complex gene‐by‐environment interactions. Advances in our ability to detect and interpret spectral signals from plants may improve both the selection of seed sources for targeted conservation and the capacity to monitor long‐term changes in vegetation.

    more » « less
  3. null (Ed.)
    Brassinosteroids (BRs) are a group of plant steroid hormones involved in regulating growth, development, and stress responses. Many components of the BR pathway have previously been identified and characterized. However, BR phenotyping experiments are typically performed on petri plates and/or in a low-throughput manner. Additionally, the BR pathway has extensive crosstalk with drought responses, but drought experiments are time-consuming and difficult to control. Thus, we developed Robotic Assay for Drought (RoAD) to perform BR and drought response experiments in soil-grown Arabidopsis plants. RoAD is equipped with a bench scale, a precisely controlled watering system, an RGB camera, and a laser profilometer. It performs daily weighing, watering, and imaging tasks and is capable of administering BR response assays by watering plants with Propiconazole (PCZ), a BR biosynthesis inhibitor. We developed image processing algorithms for both plant segmentation and phenotypic trait extraction in order to accurately measure traits in 2-dimensional (2D) and 3-dimensional (3D) spaces including plant surface area, leaf length, and leaf width. We then applied machine learning algorithms that utilized the extracted phenotypic parameters to identify image-derived traits that can distinguish control, drought, and PCZ-treated plants. We carried out PCZ and drought experiments on a set of BR mutants and Arabidopsis accessions with altered BR responses. Finally, we extended the RoAD assays to perform BR response assays using PCZ in Zea mays (maize) plants. This study establishes an automated and non-invasive robotic imaging system as a tool to accurately measure morphological and growth-related traits of Arabidopsis and maize plants, providing insights into the BR-mediated control of plant growth and stress responses. 
    more » « less
  4. Abstract

    Classical genetic studies have identified many cases of pleiotropy where mutations in individual genes alter many different phenotypes. Quantitative genetic studies of natural genetic variants frequently examine one or a few traits, limiting their potential to identify pleiotropic effects of natural genetic variants. Widely adopted community association panels have been employed by plant genetics communities to study the genetic basis of naturally occurring phenotypic variation in a wide range of traits. High-density genetic marker data—18M markers—from 2 partially overlapping maize association panels comprising 1,014 unique genotypes grown in field trials across at least 7 US states and scored for 162 distinct trait data sets enabled the identification of of 2,154 suggestive marker-trait associations and 697 confident associations in the maize genome using a resampling-based genome-wide association strategy. The precision of individual marker-trait associations was estimated to be 3 genes based on a reference set of genes with known phenotypes. Examples were observed of both genetic loci associated with variation in diverse traits (e.g., above-ground and below-ground traits), as well as individual loci associated with the same or similar traits across diverse environments. Many significant signals are located near genes whose functions were previously entirely unknown or estimated purely via functional data on homologs. This study demonstrates the potential of mining community association panel data using new higher-density genetic marker sets combined with resampling-based genome-wide association tests to develop testable hypotheses about gene functions, identify potential pleiotropic effects of natural genetic variants, and study genotype-by-environment interaction.

    more » « less
  5. Abstract Background

    Stalk lodging (mechanical failure of plant stems during windstorms) leads to global yield losses in cereal crops estimated to range from 5% to 25% annually. The cross-sectional morphology of plant stalks is a key determinant of stalk lodging resistance. However, previously developed techniques for quantifying cross-sectional morphology of plant stalks are relatively low-throughput, expensive and often require specialized equipment and expertise. There is need for a simple and cost-effective technique to quantify plant traits related to stalk lodging resistance in a high-throughput manner.


    A new phenotyping methodology was developed and applied to a range of plant samples including, maize (Zea mays), sorghum (Sorghum bicolor), wheat (Triticum aestivum), poison hemlock (Conium maculatum), and Arabidopsis (Arabis thaliana). The major diameter, minor diameter, rind thickness and number of vascular bundles were quantified for each of these plant types. Linear correlation analyses demonstrated strong agreement between the newly developed method and more time-consuming manual techniques (R2 > 0.9). In addition, the new method was used to generate several specimen-specific finite element models of plant stalks. All the models compiled without issue and were successfully imported into finite element software for analysis. All the models demonstrated reasonable and stable solutions when subjected to realistic applied loads.


    A rapid, low-cost, and user-friendly phenotyping methodology was developed to quantify two-dimensional plant cross-sections. The methodology offers reduced sample preparation time and cost as compared to previously developed techniques. The new methodology employs a stereoscope and a semi-automated image processing algorithm. The algorithm can be used to produce specimen-specific, dimensionally accurate computational models (including finite element models) of plant stalks.

    more » « less