Thread: Towards fine-grained precision reconfiguration in variable-precision neural network accelerator

作者: Shichang Zhang , Ying Wang , Xiaoming Chen , Yinhe Han , Yujie Wang

DOI: 10.1587/ELEX.16.20190145

关键词:

摘要:

参考文章(25)
Yunji Chen, Tao Luo, Shaoli Liu, Shijin Zhang, Liqiang He, Jia Wang, Ling Li, Tianshi Chen, Zhiwei Xu, Ninghui Sun, Olivier Temam, DaDianNao: A Machine-Learning Supercomputer international symposium on microarchitecture. pp. 609- 622 ,(2014) , 10.1109/MICRO.2014.58
Chen Zhang, Peng Li, Guangyu Sun, Yijin Guan, Bingjun Xiao, Jason Cong, Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks field programmable gate arrays. pp. 161- 170 ,(2015) , 10.1145/2684746.2689060
Kilian Weinberger, Wenlin Chen, Yixin Chen, James Wilson, Stephen Tyree, Stephen Tyree, Compressing Neural Networks with the Hashing Trick international conference on machine learning. pp. 2285- 2294 ,(2015)
Jiaxiang Wu, Cong Leng, Yuhang Wang, Qinghao Hu, Jian Cheng, Quantized Convolutional Neural Networks for Mobile Devices computer vision and pattern recognition. pp. 4820- 4828 ,(2016) , 10.1109/CVPR.2016.521
Sajid Anwar, Kyuyeon Hwang, Wonyong Sung, Structured Pruning of Deep Convolutional Neural Networks ACM Journal on Emerging Technologies in Computing Systems. ,vol. 13, pp. 32- ,(2017) , 10.1145/3005348
Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally, EIE: efficient inference engine on compressed deep neural network international symposium on computer architecture. ,vol. 44, pp. 243- 254 ,(2016) , 10.1145/3007787.3001163
Sachin S. Talathi, V. Sreekanth Annapureddy, Darryl D. Lin, Fixed point quantization of deep convolutional networks international conference on machine learning. pp. 2849- 2858 ,(2016)
Shuchang Zhou, He Wen, Yuxin Wu, Yuheng Zou, Zekun Ni, Xinyu Zhou, DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients arXiv: Neural and Evolutionary Computing. ,(2016)
Yoshua Bengio, Matthieu Courbariaux, Ran El-Yaniv, Itay Hubara, Daniel Soudry, Quantized neural networks: training neural networks with low precision weights and activations Journal of Machine Learning Research. ,vol. 18, pp. 6869- 6898 ,(2017)
Xushen Han, Dajiang Zhou, Shihao Wang, Shinji Kimura, CNN-MERP: An FPGA-based memory-efficient reconfigurable processor for forward and backward propagation of convolutional neural networks 2016 IEEE 34th International Conference on Computer Design (ICCD). pp. 320- 327 ,(2016) , 10.1109/ICCD.2016.7753296