skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates
Abstract Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years. The competition attracted registrants from around the world with representation from academic, government, industry, and non-profit institutions as well as unaffiliated. These participants came from diverse disciplines include plant science, animal science, breeding, statistics, computational biology and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner’s strategy involved two models combining machine learning and traditional breeding tools: one model emphasized environment using features extracted by Random Forest, Ridge Regression and Least-squares, and one focused on genetics. Other high-performing teams’ methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics; weather; and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.  more » « less
Award ID(s):
2218206 2035472 2210431
PAR ID:
10556562
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; « less
Editor(s):
Sillanpää, Mikko
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
GENETICS
ISSN:
1943-2631
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Many factors shape public perceptions of extreme weather risk; understanding these factors is important to encourage preparedness. This article describes a novel workshop designed to encourage individual and community decision-making about predicted storm surge flooding. Over 160 U.S. college students participated in this 4-h experience. Distinctive features included 1) two kinds of visualizations, standard weather forecasting graphics versus 3D computer graphics visualization; 2) narrative about a fictitious storm, role-play, and guided discussion of participants’ concerns; and 3) use of an “ethical matrix,” a collective decision-making tool that elicits diverse perspectives based on the lived experiences of diverse stakeholders. Participants experienced a narrative about a hurricane with potential for devastating storm surge flooding on a fictitious coastal college campus. They answered survey questions before, at key points during, and after the narrative, interspersed with forecasts leading to predicted storm landfall. During facilitated breakout groups, participants role-played characters and filled out an ethical matrix. Discussing the matrix encouraged consideration of circumstances impacting evacuation decisions. Participants’ comments suggest several components may have influenced perceptions of personal risk, risks to others, the importance of monitoring weather, and preparing for emergencies. Surprisingly, no differences between the standard forecast graphics versus the immersive, hyperlocal visualizations were detected. Overall, participants’ comments indicate the workshop increased appreciation of others’ evacuation and preparation challenges. 
    more » « less
  2. Abstract Next-generation surveys like the Legacy Survey of Space and Time (LSST) on the Vera C. Rubin Observatory (Rubin) will generate orders of magnitude more discoveries of transients and variable stars than previous surveys. To prepare for this data deluge, we developed the Photometric LSST Astronomical Time-series Classification Challenge (PLAsTiCC), a competition that aimed to catalyze the development of robust classifiers under LSST-like conditions of a nonrepresentative training set for a large photometric test set of imbalanced classes. Over 1000 teams participated in PLAsTiCC, which was hosted in the Kaggle data science competition platform between 2018 September 28 and 2018 December 17, ultimately identifying three winners in 2019 February. Participants produced classifiers employing a diverse set of machine-learning techniques including hybrid combinations and ensemble averages of a range of approaches, among them boosted decision trees, neural networks, and multilayer perceptrons. The strong performance of the top three classifiers on Type Ia supernovae and kilonovae represent a major improvement over the current state of the art within astronomy. This paper summarizes the most promising methods and evaluates their results in detail, highlighting future directions both for classifier development and simulation needs for a next-generation PLAsTiCC data set. 
    more » « less
  3. To be successful in their future careers, students must be able to process information, devise creative solutions, and apply previous knowledge to new situations. Learning through only traditional teaching practices that rely heavily on lecture format and memorization is insufficient to prepare students for the future. Interactive project-based learning that experiences productive failure provides the opportunity for students to problem-solve novel topics and potentially fail at finding the solution. Through explanation, elaboration, comparison of iterations, refinement, and implementations, students can be more prepared to solve future problems. Our study examined the benefits of productive failure on high school students from both formal and informal learning environments working in collaborative teams to design and create 3D plant models. This STEAM project integrates science, design, and technology through innovative learning experiences in plant and agricultural science using emergent technologies. This learning experience encourages students to work together in collaborative teams of self-identified science, technophile, and art students to create 3D models of plants used in research at the Donald Danforth Plant Science Center in St. Louis, MO. Students learn about scientific research, the importance of plants in our society, and practice science communication skills. To create the 3D models, students must learn-by-doing to become proficient in using previously unfamiliar 3D modeling software where their teachers are merely facilitators. Students become active participants in their own learning by overcoming challenges through research, troubleshooting, teamwork, and perseverance. We used a mixed-method assessment approach comparing pre- and post-reflection questions. Students experience many challenges with learning the 3D model programs. They reported that they overcame difficulties working with the 3D modeling programs primarily through help from others and consulting outside resources, such as YouTube videos, as well as through continued effort. Students indicated that they faced challenges when creating their models but recognized that this project was a learning experience. Productive failure through the process of struggling and learning from one’s mistakes can encourage positive learning outcomes and give students a better ability to overcome future challenges. 
    more » « less
  4. To be successful in their future careers, students must be able to process information, devise creative solutions, and apply previous knowledge to new situations. Learning through only traditional teaching practices that rely heavily on lecture format and memorization is insufficient to prepare students for the future. Interactive project-based learning that experiences productive failure provides the opportunity for students to problem-solve novel topics and potentially fail at finding the solution. Through explanation, elaboration, comparison of iterations, refinement, and implementations, students can be more prepared to solve future problems. Our study examined the benefits of productive failure on high school students from both formal and informal learning environments working in collaborative teams to design and create 3D plant models. This STEAM project integrates science, design, and technology through innovative learning experiences in plant and agricultural science using emergent technologies. This learning experience encourages students to work together in collaborative teams of self-identified science, technophile, and art students to create 3D models of plants used in research at the Donald Danforth Plant Science Center in St. Louis, MO. Students learn about scientific research, the importance of plants in our society, and practice science communication skills. To create the 3D models, students must learn-by-doing to become proficient in using previously unfamiliar 3D modeling software where their teachers are merely facilitators. Students become active participants in their own learning by overcoming challenges through research, troubleshooting, teamwork, and perseverance. We used a mixed-method assessment approach comparing pre- and post-reflection questions. Students experience many challenges with learning the 3D model programs. They reported that they overcame difficulties working with the 3D modeling programs primarily through help from others and consulting outside resources, such as YouTube videos, as well as through continued effort. Students indicated that they faced challenges when creating their models but recognized that this project was a learning experience. Productive failure through the process of struggling and learning from one’s mistakes can encourage positive learning outcomes and give students a better ability to overcome future challenges. 
    more » « less
  5. Universities have been expanding undergraduate data science programs. Involving graduate students in these new opportunities can foster their growth as data science educators. We describe two programs that employ a near-peer mentoring structure, in which graduate students mentor undergraduates, to (a) strengthen their teaching and mentoring skills and (b) provide research and learning experiences for undergraduates from diverse backgrounds. In the Data Science for Social Good program, undergraduate participants work in teams to tackle a data science project with social impact. Graduate mentors guide project work and provide just-in-time teaching and feedback. The Stanford Mentoring in Data Science course offers training in effective and inclusive mentorship strategies. In an experiential learning framework, enrolled graduate students are paired with undergraduate students from non-R1 schools, whom they mentor through weekly one-on-one remote meetings. In end-of-program surveys, mentors reported growth through both programs. Drawing from these experiences, we developed a self-paced mentor training guide, which engages teaching, mentoring and project management abilities. These initiatives and the shared materials can serve as prototypes of future programs that cultivate mutual growth of both undergraduate and graduate students in a high-touch, inclusive, and encouraging environment. 
    more » « less