On Learning Discriminative Features from Synthesized Data for Self-supervised Fine-Grained Visual Recognition

Wang, Zihu; Liu, Lingqiao; Weston, Scott_Ricardo Figueroa; Tian, Samuel; Li, Peng

doi:10.1007/978-3-031-73024-5_7

Citation Details

This content will become publicly available on November 24, 2025

On Learning Discriminative Features from Synthesized Data for Self-supervised Fine-Grained Visual Recognition

Self-Supervised Learning (SSL) has become a prominent approach for acquiring visual representations across various tasks, yet its application in fine-grained visual recognition (FGVR) is challenged by the intricate task of distinguishing subtle differences between categories. To overcome this, we introduce an novel strategy that boosts SSL's ability to extract critical discriminative features vital for FGVR. This approach creates synthesized data pairs to guide the model to focus on discriminative features critical for FGVR during SSL. We start by identifying non-discriminative features using two main criteria: features with low variance that fail to effectively separate data and those deemed less important by Grad-CAM induced from the SSL loss. We then introduce perturbations to these non-discriminative features while preserving discriminative ones. A decoder is employed to reconstruct images from both perturbed and original feature vectors to create data pairs. An encoder is trained on such generated data pairs to become invariant to variations in non-discriminative dimensions while focusing on discriminative features, thereby improving the model's performance in FGVR tasks. We demonstrate the promising FGVR performance of the proposed approach through extensive evaluation on a wide variety of datasets. more »

Award ID(s):: 1956313

PAR ID:: 10593477

Author(s) / Creator(s):: Wang, Zihu; Liu, Lingqiao; Weston, Scott_Ricardo Figueroa; Tian, Samuel; Li, Peng

Publisher / Repository:: 2024 European Conference on Computer Vision (ECCV)

Date Published:: 2024-11-24

Page Range / eLocation ID:: 101 to 117

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 24, 2025
Book Chapter:
https://doi.org/10.1007/978-3-031-73024-5_7

More Like this