The future of legume genetic data resources: Challenges, opportunities, and priorities

Bauchet, Guillaume J.; Bett, Kirstin E.; Cameron, Connor T.; Campbell, Jacqueline D.; Cannon, Ethalinda K. S.; Cannon, Steven B.  (ORCID:0000000327778034); Carlson, Joseph W.; Chan, Agnes; Cleary, Alan; Close, Timothy J.; Cook, Douglas R.; Cooksey, Amanda M.; Coyne, Clarice; Dash, Sudhansu; Dickstein, Rebecca; Farmer, Andrew D.; Fernández‐Baca, David; Hokin, Samuel; Jones, Elizabeth S.; Kang, Yun; Monteros, Maria J.; Muñoz‐Amatriaín, María; Mysore, Kirankumar S.; Pislariu, Catalina I.; Richards, Chris; Shi, Ainong; Town, Christopher D.; Udvardi, Michael; von Wettberg, Eric Bishop; Young, Nevin D.; Zhao, Patrick X.

doi:10.1002/leg3.16

Abstract

Legumes, comprising one of the largest, most diverse, and most economically important plant families, are the subject of vibrant research and development worldwide. Continued improvement of legume crops will benefit from the recent proliferation of genetic (including genomic) resources; but the diversity, scale, and complexity of these resources presents challenges to those managing and using them. A workshop held in March of 2019 addressed questions of data resources and priorities for the legumes. The workshop identified various needs and recommendations: (a) Develop strategies to effectively store, integrate, and relate genetic resources collected in different projects. (b) Leverage information collected across many legume species by standardizing data formats and ontologies, improving the state of metadata about datasets, and increasing use of the FAIR data principles. (c) Advocate for the critical role that curators exercise in integrating complex datasets into databases and adding high value metadata that enable downstream analytics and facilitate practical applications. (d) Implement standardized software and database development practices to best leverage limited developer time and expertise gained from the various legume (and other) species. (e) Develop tools and databases that can manage genetic information for the world's plant genetic resources, enabling efficient incorporation of important traits into breeding programs. (f) Centralize information on databases, tools, and training materials and establish funding streams to support training and outreach.

More Like this