Towards molecular structure discovery from cryo-ET density volumes via modelling auxiliary semantic prototypes

Nair, Ashwin; Li, Xingjian; Solanki, Bhupendra; Mukhopadhyay, Souradeep; Jha, Ankit; Rafid_Uddin, Mostofa; Singha, Mainak; Banerjee, Biplab; Xu, Min

doi:10.1093/bib/bbae570

Citation Details

Towards molecular structure discovery from cryo-ET density volumes via modelling auxiliary semantic prototypes

Abstract Cryo-electron tomography (cryo-ET) is confronted with the intricate task of unveiling novel structures. General class discovery (GCD) seeks to identify new classes by learning a model that can pseudo-label unannotated (novel) instances solely using supervision from labeled (base) classes. While 2D GCD for image data has made strides, its 3D counterpart remains unexplored. Traditional methods encounter challenges due to model bias and limited feature transferability when clustering unlabeled 2D images into known and potentially novel categories based on labeled data. To address this limitation and extend GCD to 3D structures, we propose an innovative approach that harnesses a pretrained 2D transformer, enriched by an effective weight inflation strategy tailored for 3D adaptation, followed by a decoupled prototypical network. Incorporating the power of pretrained weight-inflated Transformers, we further integrate CLIP, a vision-language model to incorporate textual information. Our method synergizes a graph convolutional network with CLIP’s frozen text encoder, preserving class neighborhood structure. In order to effectively represent unlabeled samples, we devise semantic distance distributions, by formulating a bipartite matching problem for category prototypes using a decoupled prototypical network. Empirical results unequivocally highlight our method’s potential in unveiling hitherto unknown structures in cryo-ET. By bridging the gap between 2D GCD and the distinctive challenges of 3D cryo-ET data, our approach paves novel avenues for exploration and discovery in this domain. more »

Award ID(s):: 2238093 2211597 2205148 2007595 1949629

PAR ID:: 10585952

Author(s) / Creator(s):: Nair, Ashwin; Li, Xingjian; Solanki, Bhupendra; Mukhopadhyay, Souradeep; Jha, Ankit; Rafid_Uddin, Mostofa; Singha, Mainak; Banerjee, Biplab; Xu, Min

Publisher / Repository:: Oxford University Press

Date Published:: 2024-11-22

Journal Name:: Briefings in Bioinformatics

Volume:: 26

Issue:: 1

ISSN:: 1467-5463

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1093/bib/bbae570

More Like this