Count Sketch with Zero Checking: Efficient Recovery of Heavy Components

Zhou, Guanqiang; Tian, Zhi

doi:10.1109/ICASSP39728.2021.9413853

Citation Details

Count Sketch with Zero Checking: Efficient Recovery of Heavy Components

The problem of recovering heavy components of a high-dimensional vector from compressed data is of great interest in broad applications, such as feature extraction under scarce computing memory and distributed learning under limited bandwidth. Recently, a compression algorithm called count sketch has gained wide popularity to recover heavy components in various fields. In this paper, we carefully analyze count sketch and illustrate that its default recovery method, namely median filtering, has a distinct error pattern of reporting false positives. To counteract this error pattern, we propose a new scheme called zero checking which adopts a two-step recovery approach to improve the probability of detecting false positives. Our proposed technique builds on rigorous error analysis, which enables us to optimize the selection of a key design parameter for maximum performance gain. The empirical results show that our scheme achieves better recovery accuracy than median filtering and requires less samples to accurately recover heavy components. more »

Award ID(s):: 1939553 1704274 1741338

NSF-PAR ID:: 10273934

Author(s) / Creator(s):: Zhou, Guanqiang; Tian, Zhi

Date Published:: 2021-06-01

Journal Name:: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Page Range / eLocation ID:: 5120 to 5124

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP39728.2021.9413853

More Like this