Approximate Hybrid Binary-Unary Computing with Applications in BERT Language Model and Image Processing

Khataei, Alireza; Singh, Gaurav; Bazargan, Kia

doi:10.1145/3543622.3573181

Citation Details

Approximate Hybrid Binary-Unary Computing with Applications in BERT Language Model and Image Processing

We propose a novel method for approximate hardware implementation of univariate math functions with significantly fewer hardware resources compared to previous approaches. Examples of such functions include exp(x) and the activation function GELU(x), both used in transformer networks, gamma(x), which is used in image processing, and other functions such as tanh(x), cosh(x), sq(x), and sqrt(x). The method builds on previous works on hybrid binary-unary computing. The novelty in our approach is that we break a function into a number of sub-functions such that implementing each sub-function becomes cheap, and converting the output of the sub-functions to binary becomes almost trivial. Our method also uses self-similarity in functions to further reduce the cost. We compare our method to the conventional binary, previous stochastic computing, and hybrid binary-unary methods on several functions at 8-, 12-, and 16-bit resolutions. While preserving high accuracy, our method outperforms previous works in terms of hardware cost, e.g., tolerating less than 0.01 mean absolute error, our method reduces the (area x latency) cost on average by 5, 7, and 2 orders of magnitude, compared to the conventional binary, stochastic computing, and hybrid binary-unary methods, respectively. Ultimately, we demonstrate the potential benefits of our method for natural language processing and image processing applications. We deploy our method to implement major blocks in an encoding layer of BERT language model, and also the Roberts Cross edge detection algorithm. Both include non-linear functions. more »

Award ID(s):: 2016390

PAR ID:: 10478286

Author(s) / Creator(s):: Khataei, Alireza; Singh, Gaurav; Bazargan, Kia

Publisher / Repository:: ACM

Date Published:: 2023-02-12

Journal Name:: FPGA '23: Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays

ISBN:: 9781450394178

Page Range / eLocation ID:: 165 to 175

Subject(s) / Keyword(s):: hardware accelerators, approximate computing, unary computing, stochastic computing, BERT language model, image processing

Format(s):: Medium: X

Location:: Monterey CA USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3543622.3573181

More Like this