The ProteomeXchange consortium at 10 years: 2023 update

Deutsch, Eric W.; Bandeira, Nuno; Perez-Riverol, Yasset (ORCID:0000000165796941); Sharma, Vagisha; Carver, Jeremy J.; Mendoza, Luis; Kundu, Deepti J.; Wang, Shengbo; Bandla, Chakradhar; Kamatchinathan, Selvakumar; Hewapathirana, Suresh; Pullman, Benjamin S.; Wertz, Julie; Sun, Zhi; Kawano, Shin; Okuda, Shujiro (ORCID:0000000277048104); Watanabe, Yu; MacLean, Brendan; MacCoss, Michael J.; Zhu, Yunping (ORCID:0000000273207411); Ishihama, Yasushi; Vizcaíno, Juan Antonio (ORCID:0000000239054335)

doi:10.1093/nar/gkac1040

Abstract

Mass spectrometry (MS) is by far the most used experimental approach in high-throughput proteomics. The ProteomeXchange (PX) consortium of proteomics resources (http://www.proteomexchange.org) was originally set up to standardize data submission and dissemination of public MS proteomics data. It is now 10 years since the initial data workflow was implemented. In this manuscript, we describe the main developments in PX since the previous update manuscript in Nucleic Acids Research was published in 2020. The six members of the Consortium are PRIDE, PeptideAtlas (including PASSEL), MassIVE, jPOST, iProX and Panorama Public. We report the current data submission statistics, showcasing that the number of datasets submitted to PX resources has continued to increase every year. As of June 2022, more than 34 233 datasets had been submitted to PX resources, and from those, 20 062 (58.6%) just in the last three years. We also report the development of the Universal Spectrum Identifiers and the improvements in capturing the experimental metadata annotations. In parallel, we highlight that data re-use activities of public datasets continue to increase, enabling connections between PX resources and other popular bioinformatics resources, novel research and also new data resources. Finally, we summarise the current state-of-the-art in data management practices for sensitive human (clinical) proteomics data.

More Like this