TY - GEN
T1 - Towards acceleration of deep convolutional neural networks using stochastic computing
AU - Li, Ji
AU - Ren, Ao
AU - Li, Zhe
AU - Ding, Caiwen
AU - Yuan, Bo
AU - Qiu, Qinru
AU - Wang, Yanzhi
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/2/16
Y1 - 2017/2/16
N2 - In recent years, Deep Convolutional Neural Network (DCNN) has become the dominant approach for almost all recognition and detection tasks and outperformed humans on certain tasks. Nevertheless, the high power consumptions and complex topologies have hindered the widespread deployment of DCNNs, particularly in wearable devices and embedded systems with limited area and power budget. This paper presents a fully parallel and scalable hardware-based DCNN design using Stochastic Computing (SC), which leverages the energy-accuracy trade-off through optimizing SC components in different layers. We first conduct a detailed investigation of the Approximate Parallel Counter (APC) based neuron and multiplexer-based neuron using SC, and analyze the impacts of various design parameters, such as bit stream length and input number, on the energy/power/area/accuracy of the neuron cell. Then, from an architecture perspective, the influence of inaccuracy of neurons in different layers on the overall DCNN accuracy (i.e., software accuracy of the entire DCNN) is studied. Accordingly, a structure optimization method is proposed for a general DCNN architecture, in which neurons in different layers are implemented with optimized SC components, so as to reduce the area, power, and energy of the DCNN while maintaining the overall network performance in terms of accuracy. Experimental results show that the proposed approach can find a satisfactory DCNN configuration, which achieves 55X, 151X, and 2X improvement in terms of area, power and energy, respectively, while the error is increased by 2.86%, compared with the conventional binary ASIC implementation.
AB - In recent years, Deep Convolutional Neural Network (DCNN) has become the dominant approach for almost all recognition and detection tasks and outperformed humans on certain tasks. Nevertheless, the high power consumptions and complex topologies have hindered the widespread deployment of DCNNs, particularly in wearable devices and embedded systems with limited area and power budget. This paper presents a fully parallel and scalable hardware-based DCNN design using Stochastic Computing (SC), which leverages the energy-accuracy trade-off through optimizing SC components in different layers. We first conduct a detailed investigation of the Approximate Parallel Counter (APC) based neuron and multiplexer-based neuron using SC, and analyze the impacts of various design parameters, such as bit stream length and input number, on the energy/power/area/accuracy of the neuron cell. Then, from an architecture perspective, the influence of inaccuracy of neurons in different layers on the overall DCNN accuracy (i.e., software accuracy of the entire DCNN) is studied. Accordingly, a structure optimization method is proposed for a general DCNN architecture, in which neurons in different layers are implemented with optimized SC components, so as to reduce the area, power, and energy of the DCNN while maintaining the overall network performance in terms of accuracy. Experimental results show that the proposed approach can find a satisfactory DCNN configuration, which achieves 55X, 151X, and 2X improvement in terms of area, power and energy, respectively, while the error is increased by 2.86%, compared with the conventional binary ASIC implementation.
UR - http://www.scopus.com/inward/record.url?scp=85015332539&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85015332539&partnerID=8YFLogxK
U2 - 10.1109/ASPDAC.2017.7858306
DO - 10.1109/ASPDAC.2017.7858306
M3 - Conference contribution
AN - SCOPUS:85015332539
T3 - Proceedings of the Asia and South Pacific Design Automation Conference, ASP-DAC
SP - 115
EP - 120
BT - 2017 22nd Asia and South Pacific Design Automation Conference, ASP-DAC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 22nd Asia and South Pacific Design Automation Conference, ASP-DAC 2017
Y2 - 16 January 2017 through 19 January 2017
ER -