skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on November 22, 2025

Title: Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates
Abstract Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years. The competition attracted registrants from around the world with representation from academic, government, industry, and non-profit institutions as well as unaffiliated. These participants came from diverse disciplines include plant science, animal science, breeding, statistics, computational biology and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner’s strategy involved two models combining machine learning and traditional breeding tools: one model emphasized environment using features extracted by Random Forest, Ridge Regression and Least-squares, and one focused on genetics. Other high-performing teams’ methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics; weather; and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.  more » « less
Award ID(s):
2218206 2035472 2210431
PAR ID:
10556562
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; « less
Editor(s):
Sillanpää, Mikko
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
GENETICS
ISSN:
1943-2631
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Many factors shape public perceptions of extreme weather risk; understanding these factors is important to encourage preparedness. This article describes a novel workshop designed to encourage individual and community decision-making about predicted storm surge flooding. Over 160 U.S. college students participated in this 4-h experience. Distinctive features included 1) two kinds of visualizations, standard weather forecasting graphics versus 3D computer graphics visualization; 2) narrative about a fictitious storm, role-play, and guided discussion of participants’ concerns; and 3) use of an “ethical matrix,” a collective decision-making tool that elicits diverse perspectives based on the lived experiences of diverse stakeholders. Participants experienced a narrative about a hurricane with potential for devastating storm surge flooding on a fictitious coastal college campus. They answered survey questions before, at key points during, and after the narrative, interspersed with forecasts leading to predicted storm landfall. During facilitated breakout groups, participants role-played characters and filled out an ethical matrix. Discussing the matrix encouraged consideration of circumstances impacting evacuation decisions. Participants’ comments suggest several components may have influenced perceptions of personal risk, risks to others, the importance of monitoring weather, and preparing for emergencies. Surprisingly, no differences between the standard forecast graphics versus the immersive, hyperlocal visualizations were detected. Overall, participants’ comments indicate the workshop increased appreciation of others’ evacuation and preparation challenges. 
    more » « less
  2. Abstract Next-generation surveys like the Legacy Survey of Space and Time (LSST) on the Vera C. Rubin Observatory (Rubin) will generate orders of magnitude more discoveries of transients and variable stars than previous surveys. To prepare for this data deluge, we developed the Photometric LSST Astronomical Time-series Classification Challenge (PLAsTiCC), a competition that aimed to catalyze the development of robust classifiers under LSST-like conditions of a nonrepresentative training set for a large photometric test set of imbalanced classes. Over 1000 teams participated in PLAsTiCC, which was hosted in the Kaggle data science competition platform between 2018 September 28 and 2018 December 17, ultimately identifying three winners in 2019 February. Participants produced classifiers employing a diverse set of machine-learning techniques including hybrid combinations and ensemble averages of a range of approaches, among them boosted decision trees, neural networks, and multilayer perceptrons. The strong performance of the top three classifiers on Type Ia supernovae and kilonovae represent a major improvement over the current state of the art within astronomy. This paper summarizes the most promising methods and evaluates their results in detail, highlighting future directions both for classifier development and simulation needs for a next-generation PLAsTiCC data set. 
    more » « less
  3. To be successful in their future careers, students must be able to process information, devise creative solutions, and apply previous knowledge to new situations. Learning through only traditional teaching practices that rely heavily on lecture format and memorization is insufficient to prepare students for the future. Interactive project-based learning that experiences productive failure provides the opportunity for students to problem-solve novel topics and potentially fail at finding the solution. Through explanation, elaboration, comparison of iterations, refinement, and implementations, students can be more prepared to solve future problems. Our study examined the benefits of productive failure on high school students from both formal and informal learning environments working in collaborative teams to design and create 3D plant models. This STEAM project integrates science, design, and technology through innovative learning experiences in plant and agricultural science using emergent technologies. This learning experience encourages students to work together in collaborative teams of self-identified science, technophile, and art students to create 3D models of plants used in research at the Donald Danforth Plant Science Center in St. Louis, MO. Students learn about scientific research, the importance of plants in our society, and practice science communication skills. To create the 3D models, students must learn-by-doing to become proficient in using previously unfamiliar 3D modeling software where their teachers are merely facilitators. Students become active participants in their own learning by overcoming challenges through research, troubleshooting, teamwork, and perseverance. We used a mixed-method assessment approach comparing pre- and post-reflection questions. Students experience many challenges with learning the 3D model programs. They reported that they overcame difficulties working with the 3D modeling programs primarily through help from others and consulting outside resources, such as YouTube videos, as well as through continued effort. Students indicated that they faced challenges when creating their models but recognized that this project was a learning experience. Productive failure through the process of struggling and learning from one’s mistakes can encourage positive learning outcomes and give students a better ability to overcome future challenges. 
    more » « less
  4. To be successful in their future careers, students must be able to process information, devise creative solutions, and apply previous knowledge to new situations. Learning through only traditional teaching practices that rely heavily on lecture format and memorization is insufficient to prepare students for the future. Interactive project-based learning that experiences productive failure provides the opportunity for students to problem-solve novel topics and potentially fail at finding the solution. Through explanation, elaboration, comparison of iterations, refinement, and implementations, students can be more prepared to solve future problems. Our study examined the benefits of productive failure on high school students from both formal and informal learning environments working in collaborative teams to design and create 3D plant models. This STEAM project integrates science, design, and technology through innovative learning experiences in plant and agricultural science using emergent technologies. This learning experience encourages students to work together in collaborative teams of self-identified science, technophile, and art students to create 3D models of plants used in research at the Donald Danforth Plant Science Center in St. Louis, MO. Students learn about scientific research, the importance of plants in our society, and practice science communication skills. To create the 3D models, students must learn-by-doing to become proficient in using previously unfamiliar 3D modeling software where their teachers are merely facilitators. Students become active participants in their own learning by overcoming challenges through research, troubleshooting, teamwork, and perseverance. We used a mixed-method assessment approach comparing pre- and post-reflection questions. Students experience many challenges with learning the 3D model programs. They reported that they overcame difficulties working with the 3D modeling programs primarily through help from others and consulting outside resources, such as YouTube videos, as well as through continued effort. Students indicated that they faced challenges when creating their models but recognized that this project was a learning experience. Productive failure through the process of struggling and learning from one’s mistakes can encourage positive learning outcomes and give students a better ability to overcome future challenges. 
    more » « less
  5. Abstract As an increasing number of machine learning (ML) products enter the research-to-operations (R2O) pipeline, researchers have anecdotally noted a perceived hesitancy by operational forecasters to adopt this relatively new technology. One explanation often cited in the literature is that this perceived hesitancy derives from the complex and opaque nature of ML methods. Because modern ML models are trained to solve tasks by optimizing a potentially complex combination of mathematical weights, thresholds, and nonlinear cost functions, it can be difficult to determine how these models reach a solution from their given input. However, it remains unclear to what degree a model’s transparency may influence a forecaster’s decision to use that model or if that impact differs between ML and more traditional (i.e., non-ML) methods. To address this question, a survey was offered to forecaster and researcher participants attending the 2021 NOAA Hazardous Weather Testbed (HWT) Spring Forecasting Experiment (SFE) with questions about how participants subjectively perceive and compare machine learning products to more traditionally derived products. Results from this study revealed few differences in how participants evaluated machine learning products compared to other types of guidance. However, comparing the responses between operational forecasters, researchers, and academics exposed notable differences in what factors the three groups considered to be most important for determining the operational success of a new forecast product. These results support the need for increased collaboration between the operational and research communities. Significance StatementParticipants of the 2021 Hazardous Weather Testbed Spring Forecasting Experiment were surveyed to assess how machine learning products are perceived and evaluated in operational settings. The results revealed little difference in how machine learning products are evaluated compared to more traditional methods but emphasized the need for explainable product behavior and comprehensive end-user training. 
    more » « less