Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition

Anderson, Connor; Gwilliam, Matt; Gaskin, Evelyn; Farrell, Ryan

doi:10.1109/WACV57701.2024.00088

Citation Details

Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition

While the community has seen many advances in recent years to address the challenging problem of Fine-grained Visual Categorization (FGVC), progress seems to be slowing—new state-of-the-art methods often distinguish themselves by improving top-1 accuracy by mere tenths of a percent. However, across all of the now-standard FGVC datasets, there remain sizeable portions of the test data that none of the current state-of-the-art (SOTA) models can successfully predict. This paper provides a framework for identifying and studying the errors that current methods make across diverse fine-grained datasets. Three models of difficulty—Prediction Overlap, Prediction Rank and Pair-wise Class Confusion—are employed to highlight the most challenging sets of images and classes. Extensive experiments apply a range of standard and SOTA methods, evaluating them on multiple FGVC domains and datasets. Insights acquired from coupling these difficulty paradigms with the careful analysis of experimental results suggest crucial areas for future FGVC research, focusing critically on the set of elusive images that none of the current models can correctly classify. Code is available at catalys1.github.io/elusive-images-fgvc. more »

Award ID(s):: 1651832

PAR ID:: 10630061

Author(s) / Creator(s):: Anderson, Connor; Gwilliam, Matt; Gaskin, Evelyn; Farrell, Ryan

Publisher / Repository:: IEEE

Date Published:: 2024-01-03

ISSN:: 2642-9381

ISBN:: 979-8-3503-1892-0

Page Range / eLocation ID:: 818 to 828

Format(s):: Medium: X

Location:: Waikoloa, HI, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/WACV57701.2024.00088

More Like this