Efficient Shared Peak Counting in Database Peptide Search Using Compact Data Structure for Fragment-Ion Index

Haseeb, Muhammad; Saeed, Fahad

doi:10.1109/BIBM47256.2019.8983152

Citation Details

Efficient Shared Peak Counting in Database Peptide Search Using Compact Data Structure for Fragment-Ion Index

Database search is the most commonly employed method for identification of peptides from MS/MS spectra data. The search involves comparing experimentally obtained MS/MS spectra against a set of theoretical spectra predicted from a protein sequence database. One of the most commonly employed similarity metrics for spectral comparison is the shared-peak count between a pair of MS/MS spectra. Most modern methods index all generated fragment-ion data from theoretical spectra to speed up the shared peak count computations between a given experimental spectrum and all theoretical spectra. However, the bottleneck for this method is the gigantic memory footprint of fragment-ion index that leads to non-scalable solutions. In this paper, we present a novel data structure, called Compact Fragment-Ion Index Representation (CFIR), that efficiently compresses highly redundant ion-mass information in the data to reduce the index size. Our proposed data structure outperforms all existing fragment-ion indexing data structures by at least 2× in memory consumption while exhibiting the same time complexity for index construction and peptide search. The results also show comparable indexing speed, search speed and speedup scalability for CFIR-index and the state-of-the-art algorithms. more »

Award ID(s):: 1925960

PAR ID:: 10140314

Author(s) / Creator(s):: Haseeb, Muhammad; Saeed, Fahad

Date Published:: 2019-11-01

Journal Name:: Proceedings of IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Page Range / eLocation ID:: 275 to 278

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BIBM47256.2019.8983152

More Like this