Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates

Washburn, Jacob D; Varela, José Ignacio; Xavier, Alencar; Chen, Qiuyue; Ertl, David; Gage, Joseph L; Holland, James B; Lima, Dayane Cristina; Romay, Maria Cinta; Lopez-Cruz, Marco; de_los_Campos, Gustavo; Barber, Wesley; Zimmer, Cristiano; Trucillo_Silva, Ignacio; Rocha, Fabiani; Rincent, Renaud; Ali, Baber; Hu, Haixiao; Runcie, Daniel E; Gusev, Kirill; Slabodkin, Andrei; Bax, Phillip; Aubert, Julie; Gangloff, Hugo; Mary-Huard, Tristan; Vanrenterghem, Theodore; Quesada-Traver, Carles; Yates, Steven; Ariza-Suárez, Daniel; Ulrich, Argeo; Wyler, Michele; Kick, Daniel R; Bellis, Emily S; Causey, Jason L; Soriano_Chavez, Emilio; Wang, Yixing; Piyush, Ved; Fernando, Gayara D; Hu, Robert K; Kumar, Rachit; Timon, Annan J; Venkatesh, Rasika; Segura_Abá, Kenia; Chen, Huan; Ranaweera, Thilanka; Shiu, Shin-Han; Wang, Peiran; Gordon, Max J; Amos, B K; Busato, Sebastiano; Perondi, Daniel; Gogna, Abhishek; Psaroudakis, Dennis; Chen, C_P James; Al-Mamun, Hawlader A; Danilevicz, Monica F; Upadhyaya, Shriprabha R; Edwards, David; de_Leon, Natalia

doi:10.1093/genetics/iyae195

Citation Details

This content will become publicly available on November 22, 2025

Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates

Abstract Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years. The competition attracted registrants from around the world with representation from academic, government, industry, and non-profit institutions as well as unaffiliated. These participants came from diverse disciplines include plant science, animal science, breeding, statistics, computational biology and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner’s strategy involved two models combining machine learning and traditional breeding tools: one model emphasized environment using features extracted by Random Forest, Ridge Regression and Least-squares, and one focused on genetics. Other high-performing teams’ methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics; weather; and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition. more »

Award ID(s):: 2218206 2035472 2210431

PAR ID:: 10556562

Author(s) / Creator(s):: Washburn, Jacob D; Varela, José Ignacio; Xavier, Alencar; Chen, Qiuyue; Ertl, David; Gage, Joseph L; Holland, James B; Lima, Dayane Cristina; Romay, Maria Cinta; Lopez-Cruz, Marco; de_los_Campos, Gustavo; Barber, Wesley; Zimmer, Cristiano; Trucillo_Silva, Ignacio; Rocha, Fabiani; Rincent, Renaud; Ali, Baber; Hu, Haixiao; Runcie, Daniel E; Gusev, Kirill more » « less

Editor(s):: Sillanpää, Mikko

Publisher / Repository:: Oxford University Press

Date Published:: 2024-11-22

Journal Name:: GENETICS

ISSN:: 1943-2631

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 22, 2025
Journal Article:
https://doi.org/10.1093/genetics/iyae195

More Like this