
Recent researches on information theory have shed new light on the continuous attempts to open the black box of neural signal encoding. Inspired by the problem of lossy signal compression for wireless communication, in this chapter, we introduce the Bitwise Bottleneck approach, which focuses on quantizing and encoding neural network activations from float-point representation to low-precision fixed-point representation. Based on the rate-distortion theory, the Bitwise Bottleneck attempts to determine the most significant bits in activation representation by assigning and approximating the sparse coefficients associated with different bits. Given the constraint of a limited average code rate, the bottleneck minimizes the distortion for optimal activation quantization in a flexible layer-by-layer manner. Experiments over ImageNet and other datasets show that by minimizing the quantization distortion of each layer the neural network with bottlenecks achieves the state-of-the-art accuracy with low-precision activation. Meanwhile, by reducing the code rate the proposed method can improve the memory and computational efficiency by over six times compared with the deep neural network with standard single-precision representation.11This chapter is based on our related publication [1].

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call