TY - GEN
T1 - A Unified DNN Weight Pruning Framework Using Reweighted Optimization Methods
AU - Zhang, Tianyun
AU - Ma, Xiaolong
AU - Zhan, Zheng
AU - Zhou, Shanglin
AU - Ding, Caiwen
AU - Fardad, Makan
AU - Wang, Yanzhi
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/12/5
Y1 - 2021/12/5
N2 - To address the large model size and intensive computation requirement of deep neural networks (DNNs), weight pruning techniques have been proposed and generally fall into two categories, i.e., static regularization-based pruning and dynamic regularization-based pruning. However, the former method currently suffers either complex workloads or accuracy degradation, while the latter one takes a long time to tune the parameters to achieve the desired pruning rate without accuracy loss. In this paper, we propose a unified DNN weight pruning framework with dynamically updated regularization terms bounded by the designated constraint. Our proposed method increases the compression rate, reduces the training time and reduces the number of hyper-parameters compared with state-of-the-art ADMM-based hard constraint method.
AB - To address the large model size and intensive computation requirement of deep neural networks (DNNs), weight pruning techniques have been proposed and generally fall into two categories, i.e., static regularization-based pruning and dynamic regularization-based pruning. However, the former method currently suffers either complex workloads or accuracy degradation, while the latter one takes a long time to tune the parameters to achieve the desired pruning rate without accuracy loss. In this paper, we propose a unified DNN weight pruning framework with dynamically updated regularization terms bounded by the designated constraint. Our proposed method increases the compression rate, reduces the training time and reduces the number of hyper-parameters compared with state-of-the-art ADMM-based hard constraint method.
UR - http://www.scopus.com/inward/record.url?scp=85119453056&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119453056&partnerID=8YFLogxK
U2 - 10.1109/DAC18074.2021.9586152
DO - 10.1109/DAC18074.2021.9586152
M3 - Conference contribution
AN - SCOPUS:85119453056
T3 - Proceedings - Design Automation Conference
SP - 493
EP - 498
BT - 2021 58th ACM/IEEE Design Automation Conference, DAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 58th ACM/IEEE Design Automation Conference, DAC 2021
Y2 - 5 December 2021 through 9 December 2021
ER -