Multi-Context Dual Hyper-Prior Neural Image Compression

Khoshkhahtinat, Atefeh; Zafari, Ali; Mehta, Piyush M.; Akyash, Mohammad; Kashiani, Hossein; Nasrabadi, Nasser M.

doi:10.1109/ICMLA58977.2023.00091

Citation Details

Multi-Context Dual Hyper-Prior Neural Image Compression

Transform and entropy models are the two core components in deep image compression neural networks. Most existing learning-based image compression methods utilize convolutional-based transform, which lacks the ability to model long-range dependencies, primarily due to the limited receptive field of the convolution operation. To address this limitation, we propose a Transformer-based nonlinear transform. This transform has the remarkable ability to efficiently capture both local and global information from the input image, leading to a more decorrelated latent representation. In addition, we introduce a novel entropy model that incorporates two different hyperpriors to model cross-channel and spatial dependencies of the latent representation. To further improve the entropy model, we add a global context that leverages distant relationships to predict the current latent more accurately. This global context employs a causal attention mechanism to extract long-range information in a content-dependent manner. Our experiments show that our proposed framework performs better than the state-of-the-art methods in terms of rate-distortion performance. more »

Award ID(s):: 1650474

PAR ID:: 10496379

Author(s) / Creator(s):: Khoshkhahtinat, Atefeh; Zafari, Ali; Mehta, Piyush M.; Akyash, Mohammad; Kashiani, Hossein; Nasrabadi, Nasser M.

Publisher / Repository:: IEEE

Date Published:: 2023-12-15

Journal Name:: 2023 International Conference on Machine Learning and Applications (ICMLA)

ISBN:: 979-8-3503-4534-6

Page Range / eLocation ID:: 618 to 625

Format(s):: Medium: X

Location:: Jacksonville, FL, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICMLA58977.2023.00091

More Like this