Learning the sparsity for ReRAM: mapping and pruning sparse neural network for ReRAM based accelerator

Lin, Jilan; Zhu, Zhenhua; Wang, Yu; Xie, Yuan

doi:10.1145/3287624.3287715

Big data computing applications such as deep learning and graph analytic usually incur a large amount of data movements. Deploying such applications on conventional von Neumann architecture that separates the processing units and memory components likely leads to performance bottleneck due to the limited memory bandwidth. A common approach is to develop architecture and memory co-design methodologies to overcome the challenge. Our research follows the same strategy by leveraging resistive memory (ReRAM) to further enhance the performance and energy efficiency. Specifically, we employ the general principles behind processing-in-memory to design efficient ReRAM based accelerators that support both testing and training operations. Related circuit and architecture optimization will be discussed too.

More Like this