skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Deep learning-based phenotyping for genome wide association studies of sudden death syndrome in soybean
Using a reliable and accurate method to phenotype disease incidence and severity is essential to unravel the complex genetic architecture of disease resistance in plants, and to develop disease resistant cultivars. Genome-wide association studies (GWAS) involve phenotyping large numbers of accessions, and have been used for a myriad of traits. In field studies, genetic accessions are phenotyped across multiple environments and replications, which takes a significant amount of labor and resources. Deep Learning (DL) techniques can be effective for analyzing image-based tasks; thus DL methods are becoming more routine for phenotyping traits to save time and effort. This research aims to conduct GWAS on sudden death syndrome (SDS) of soybean [ Glycine max L. (Merr.)] using disease severity from both visual field ratings and DL-based (using images) severity ratings collected from 473 accessions. Images were processed through a DL framework that identified soybean leaflets with SDS symptoms, and then quantified the disease severity on those leaflets into a few classes with mean Average Precision of 0.34 on unseen test data. Both visual field ratings and image-based ratings identified significant single nucleotide polymorphism (SNP) markers associated with disease resistance. These significant SNP markers are either in the proximity of previously reported candidate genes for SDS or near potentially novel candidate genes. Four previously reported SDS QTL were identified that contained a significant SNPs, from this study, from both a visual field rating and an image-based rating. The results of this study provide an exciting avenue of using DL to capture complex phenotypic traits from images to get comparable or more insightful results compared to subjective visual field phenotyping of traits for disease symptoms.  more » « less
Award ID(s):
1954556
PAR ID:
10403616
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Plant Science
Volume:
13
ISSN:
1664-462X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Soybean (Glycine max[L.] Merr.) production is susceptible to biotic and abiotic stresses, exacerbated by extreme weather events. Water limiting stress, that is, drought, emerges as a significant risk for soybean production, underscoring the need for advancements in stress monitoring for crop breeding and production. This project combined multi‐modal information to identify the most effective and efficient automated methods to study drought response. We investigated a set of diverse soybean accessions using multiple sensors in a time series high‐throughput phenotyping manner to: (1) develop a pipeline for rapid classification of soybean drought stress symptoms, and (2) investigate methods for early detection of drought stress. We utilized high‐throughput time‐series phenotyping using unmanned aerial vehicles and sensors in conjunction with machine learning analytics, which offered a swift and efficient means of phenotyping. The visible bands were most effective in classifying the severity of canopy wilting stress after symptom emergence. Non‐visual bands in the near‐infrared region and short‐wave infrared region contribute to the differentiation of susceptible and tolerant soybean accessions prior to visual symptom development. We report pre‐visual detection of soybean wilting using a combination of different vegetation indices and spectral bands, especially in the red‐edge. These results can contribute to early stress detection methodologies and rapid classification of drought responses for breeding and production applications. 
    more » « less
  2. SUMMARY A major challenge in global crop production is mitigating yield loss due to plant diseases. One of the best strategies to control these losses is through breeding for disease resistance. One barrier to the identification of resistance genes is the quantification of disease severity, which is typically based on the determination of a subjective score by a human observer. We hypothesized that image‐based, non‐destructive measurements of plant morphology over an extended period after pathogen infection would capture subtle quantitative differences between genotypes, and thus enable identification of new disease resistance loci. To test this, we inoculated a genetically diverse biparental mapping population of tomato (Solanum lycopersicum) withRalstonia solanacearum, a soilborne pathogen that causes bacterial wilt disease. We acquired over 40 000 time‐series images of disease progression in this population, and developed an image analysis pipeline providing a suite of 10 traits to quantify bacterial wilt disease based on plant shape and size. Quantitative trait locus (QTL) analyses using image‐based phenotyping for single and multi‐traits identified QTLs that were both unique and shared compared with those identified by human assessment of wilting, and could detect QTLs earlier than human assessment. Expanding the phenotypic space of disease with image‐based, non‐destructive phenotyping both allowed earlier detection and identified new genetic components of resistance. 
    more » « less
  3. null (Ed.)
    Tomato (Solanum lycopersicum L.) is a widely used model plant species for dissecting out the genomic bases of complex traits to thus provide an optimal platform for modern “-omics” studies and genome-guided breeding. Genome-wide association studies (GWAS) have become a preferred approach for screening large diverse populations and many traits. Here, we present GWAS analysis of a collection of 115 landraces and 11 vintage and modern cultivars. A total of 26 conventional descriptors, 40 traits obtained by digital phenotyping, the fruit content of six carotenoids recorded at the early ripening (breaker) and red-ripe stages and 21 climate-related variables were analyzed in the context of genetic diversity monitored in the 126 accessions. The data obtained from thorough phenotyping and the SNP diversity revealed by sequencing of ripe fruit transcripts of 120 of the tomato accessions were jointly analyzed to determine which genomic regions are implicated in the expressed phenotypic variation. This study reveals that the use of fruit RNA-Seq SNP diversity is effective not only for identification of genomic regions that underlie variation in fruit traits, but also of variation related to additional plant traits and adaptive responses to climate variation. These results allowed validation of our approach because different marker-trait associations mapped on chromosomal regions where other candidate genes for the same traits were previously reported. In addition, previously uncharacterized chromosomal regions were targeted as potentially involved in the expression of variable phenotypes, thus demonstrating that our tomato collection is a precious reservoir of diversity and an excellent tool for gene discovery. 
    more » « less
  4. Abstract Background Alzheimer’s disease (AD) is a complex neurodegenerative disorder and the most common type of dementia. AD is characterized by a decline of cognitive function and brain atrophy, and is highly heritable with estimated heritability ranging from 60 to 80 $$\%$$ % . The most straightforward and widely used strategy to identify AD genetic basis is to perform genome-wide association study (GWAS) of the case-control diagnostic status. These GWAS studies have identified over 50 AD related susceptibility loci. Recently, imaging genetics has emerged as a new field where brain imaging measures are studied as quantitative traits to detect genetic factors. Given that many imaging genetics studies did not involve the diagnostic outcome in the analysis, the identified imaging or genetic markers may not be related or specific to the disease outcome. Results We propose a novel method to identify disease-related genetic variants enriched by imaging endophenotypes, which are the imaging traits associated with both genetic factors and disease status. Our analysis consists of three steps: (1) map the effects of a genetic variant (e.g., single nucleotide polymorphism or SNP) onto imaging traits across the brain using a linear regression model, (2) map the effects of a diagnosis phenotype onto imaging traits across the brain using a linear regression model, and (3) detect SNP-diagnosis association via correlating the SNP effects with the diagnostic effects on the brain-wide imaging traits. We demonstrate the promise of our approach by applying it to the Alzheimer’s Disease Neuroimaging Initiative database. Among 54 AD related susceptibility loci reported in prior large-scale AD GWAS, our approach identifies 41 of those from a much smaller study cohort while the standard association approaches identify only two of those. Clearly, the proposed imaging endophenotype enriched approach can reveal promising AD genetic variants undetectable using the traditional method. Conclusion We have proposed a novel method to identify AD genetic variants enriched by brain-wide imaging endophenotypes. This approach can not only boost detection power, but also reveal interesting biological pathways from genetic determinants to intermediate brain traits and to phenotypic AD outcomes. 
    more » « less
  5. Chiang, Tzen-Yuh (Ed.)
    Pierce’s disease (PD) caused by the bacterium Xylella fastidiosa is a deadly disease of grapevines. This study used 20 SSR markers to genotype 326 accessions of grape species collected from the southeastern and southwestern United States, Mexico and Costa Rica. Two hundred sixty-six of these accessions, and an additional 12 PD resistant hybrid cultivars developed from southeastern US grape species, were evaluated for PD resistance. Disease resistance was evaluated by quantifying the level of bacteria in stems and measuring PD symptoms on the canes and leaves. Both Bayesian clustering and principal coordinate analyses identified two groups with an east-west divide: group 1 consisted of grape species from the southeastern US and Mexico, and group 2 consisted of accessions collected from the southwestern US and Mexico. The Sierra Madre Oriental mountain range appeared to be a phylogeographic barrier. The state of Texas was identified as a potential hybridization zone. The hierarchal STRUCTURE analysis on each group showed clustering of unique grape species. An east-west divide was also observed for PD resistance. With the exception of Vitis candicans and V . cinerea accessions collected from Mexico, all other grape species as well as the resistant southeastern hybrid cultivars were susceptible to the disease. Southwestern US grape accessions from drier desert regions showed stronger resistance to the disease. Strong PD resistance was observed within three distinct genetic clusters of V . arizonica which is adapted to drier environments and hybridizes freely with other species across its wide range. 
    more » « less