A roadmap for the functional annotation of protein families: a community perspective

de Crécy-lagard, Valérie (ORCID:0000000299553785); Amorin de Hegedus, Rocio; Arighi, Cecilia (ORCID:0000000208034817); Babor, Jill; Bateman, Alex (ORCID:0000000269824660); Blaby, Ian (ORCID:0000000216313154); Blaby-Haas, Crysten; Bridge, Alan J. (ORCID:0000000321489135); Burley, Stephen K. (ORCID:0000000224879713); Cleveland, Stacey; Colwell, Lucy J.; Conesa, Ana (ORCID:000000019597311X); Dallago, Christian (ORCID:0000000346506181); Danchin, Antoine (ORCID:0000000263505001); de Waard, Anita (ORCID:0000000290344119); Deutschbauer, Adam; Dias, Raquel; Ding, Yousong (ORCID:0000000186100659); Fang, Gang; Friedberg, Iddo (ORCID:0000000217898000); Gerlt, John; Goldford, Joshua; Gorelik, Mark; Gyori, Benjamin M. (ORCID:0000000194395346); Henry, Christopher; Hutinet, Geoffrey; Jaroch, Marshall; Karp, Peter D.; Kondratova, Liudmyla; Lu, Zhiyong (ORCID:000000019998916X); Marchler-Bauer, Aron; Martin, Maria-Jesus; McWhite, Claire; Moghe, Gaurav D.; Monaghan, Paul; Morgat, Anne; Mungall, Christopher J. (ORCID:0000000266012165); Natale, Darren A.; Nelson, William C.; O’Donoghue, Seán; Orengo, Christine; O’Toole, Katherine H.; Radivojac, Predrag (ORCID:0000000267690793); Reed, Colbie; Roberts, Richard J.; Rodionov, Dmitri; Rodionova, Irina A. (ORCID:0000000265002758); Rudolf, Jeffrey D.; Saleh, Lana; Sheynkman, Gloria (ORCID:0000000242239947); Thibaud-Nissen, Francoise; Thomas, Paul D. (ORCID:0000000290743507); Uetz, Peter; Vallenet, David (ORCID:0000000166480332); Carter, Erica Watson; Weigele, Peter R. (ORCID:0000000336964541); Wood, Valerie (ORCID:0000000163307526); Wood-Charlson, Elisha M.; Xu, Jin

doi:10.1093/database/baac062

Citation Details

A roadmap for the functional annotation of protein families: a community perspective

Abstract Over the last 25 years, biology has entered the genomic era and is becoming a science of ‘big data’. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3–4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward. more »

Award ID(s):: 2129768

PAR ID:: 10369262

Author(s) / Creator(s):: de Crécy-lagard, Valérie; Amorin de Hegedus, Rocio; Arighi, Cecilia; Babor, Jill; Bateman, Alex; Blaby, Ian; Blaby-Haas, Crysten; Bridge, Alan J.; Burley, Stephen K.; Cleveland, Stacey; Colwell, Lucy J.; Conesa, Ana; Dallago, Christian; Danchin, Antoine; de Waard, Anita; Deutschbauer, Adam; Dias, Raquel; Ding, Yousong; Fang, Gang; Friedberg, Iddo more » « less

Publisher / Repository:: Oxford University Press

Date Published:: 2022-08-12

Journal Name:: Database

Volume:: 2022

ISSN:: 1758-0463

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/database/baac062

More Like this