An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning

Chamain, Lahiru D.; Qi, Siyu; Ding, Zhi

doi:10.23919/EUSIPCO54536.2021.9616312

Citation Details

An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning

Learning-based image/video codecs typically utilizethe well known auto-encoder structure where the encoder trans-forms input data to a low-dimensional latent representation.Efficient latent encoding can reduce bandwidth needs duringcompression for transmission and storage. In this paper, weexamine the effect of assigning high level coarse grouping labelsto each latent vector. Designing coding profiles for each latentgroup can achieve high compression encoding. We show thatsuch grouping can be learned via end-to-end optimization of thecodec and the deep learning (DL) model to optimize rate-accuracyfor a given data set. For cloud-based inference, source encodercan select a coding profile based on its learned grouping andencode the data features accordingly. Our test results on imageclassification show that significant performance improvementcan be achieved with learned grouping over its non-groupingcounterpart. more »

Award ID(s):: 1934568

PAR ID:: 10350087

Author(s) / Creator(s):: Chamain, Lahiru D.; Qi, Siyu; Ding, Zhi

Date Published:: 2021-08-23

Journal Name:: 29th European Signal Processing Conference (EUSIPCO)

Page Range / eLocation ID:: 691 to 695

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.23919/EUSIPCO54536.2021.9616312

More Like this