RUPEE: Scalable protein structure search using run position encoded residue descriptors

Ayoub, Ron; Lee, Yugyung

doi:10.1109/BIBM.2017.8217627

Citation Details

RUPEE: Scalable protein structure search using run position encoded residue descriptors

We have developed a fast, scalable, and purely geometric structure search combining techniques from information retrieval and big data with a novel approach to encoding sequences of torsion angles. Along the way, we introduce a new torsion angle plot without breaks in continuity while still maintaining traditional torsion angle ranges, to assist in identifying separable regions of torsion angles. Subsequently, we introduce a new heuristic we call run position encoding, for handling the lack of specificity of items within character sequences containing runs of repeats. Comparing our results to the output of the CATH structural scan, response times are measured in seconds as opposed to minutes and average RMSDs and TM-scores are better. Our approach is a step towards a comprehensive indexing of protein structures scalable to millions of entries. Code and data are available at https://github.com/rayoub/rupee. more »

Award ID(s):: 1650549

PAR ID:: 10051101

Author(s) / Creator(s):: Ayoub, Ron; Lee, Yugyung

Date Published:: 2017-11-01

Journal Name:: Bioinformatics and Biomedicine (BIBM), 2017 IEEE International Conference on

Page Range / eLocation ID:: 74 to 78

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BIBM.2017.8217627

More Like this