Distributed arithmetic realisation of cyclic convolution and its DFT application

作者: H.-C. Chen , J.-I. Guo , H.-C. Chen , C.-W. Jen , T.-S. Chang

DOI: 10.1049/IP-CDS:20041173

关键词: Discrete Fourier transformCircuit designSystolic arrayBlock diagramRow and column spacesMathematicsAlgorithmCircular convolutionBlock (data storage)ArithmeticBarrel shifter

摘要: The authors present a new hardware-efficient group distributed arithmetic (GDA) design approach for the one-dimensional (1-D) discrete Fourier transform (DFT). adopts (DA) computation and exploits good features of cyclic convolution to facilitate an efficient realisation 1-D N-point DFT using small ROM modules, barrel shifter, N accumulators. proposed GDA is achieved by rearranging contents into several groups such that all elements in can be accessed simultaneously accumulating outputs increase utilisation. Moreover, combining symmetrical property coefficients with requires only half stored, which further reduces size factor two. Realisation long-length formulated based on data permutation rows columns matrix directly partition short ones, so length DFTs may realised efficiently achieve low hardware cost. This termed ‘block-based approach’. Compared existing systolic array designs DA-based designs, reduce delay–area product 29%–68% 0.35 μm CMOS cell library.

参考文章(24)
Charles M. Rader, Joseph H. McClellen, Number theory in digital signal processing ,(1979)
Shousheng He, M. Torkelson, A new approach to pipeline FFT processor international conference on parallel processing. pp. 766- 770 ,(1996) , 10.1109/IPPS.1996.508145
L.W. Chan, M.Y. Chen, A new systolic array for discrete Fourier transform IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 36, pp. 1665- 1666 ,(1988) , 10.1109/29.7554
J.A. Beraldin, T. Aboulnasr, W. Steenaart, Efficient one-dimensional systolic array realization of the discrete Fourier transform IEEE Transactions on Circuits and Systems. ,vol. 36, pp. 95- 100 ,(1989) , 10.1109/31.16566
D.J. Soudris, V. Paliouras, T. Stouraitis, A. Thanailakis, Design methodology for the implementation of multidimensional circular convolution IEE Proceedings - Circuits, Devices and Systems. ,vol. 144, pp. 323- 328 ,(1997) , 10.1049/IP-CDS:19971548
T.-S. Chang, C. Chen, C.-W. Jen, New distributed arithmetic algorithm and its application to IDCT IEE Proceedings - Circuits, Devices and Systems. ,vol. 146, pp. 159- 163 ,(1999) , 10.1049/IP-CDS:19990537
Jiun-In Guo, An efficient design for one-dimensional discrete Hartley transform using parallel additions IEEE Transactions on Signal Processing. ,vol. 48, pp. 2806- 2813 ,(2000) , 10.1109/78.869030
E. Chan, S. Panchanathan, A VLSI architecture for DFT midwest symposium on circuits and systems. pp. 292- 295 ,(1993) , 10.1109/MWSCAS.1993.343072
T.M. Pytosh, A.M. Magnani, A new parallel 2-D FFT architecture international conference on acoustics, speech, and signal processing. pp. 905- 908 ,(1990) , 10.1109/ICASSP.1990.115993