Dual-mode floating-point adder architectures

作者: Ahmet Akkaş

DOI: 10.1016/J.SYSARC.2008.05.004

关键词:

摘要: Most modern microprocessors provide multiple identical functional units to increase performance. This paper presents dual-mode floating-point adder architectures that support one higher precision addition and two parallel lower additions. A double implemented with the improved single-path algorithm is modified design a supports both single similar technique used quadruple implements two-path algorithm. The To estimate area worst-case delay, double, quadruple, adders are in VHDL using algorithms. correctness of all designs tested verified through extensive simulation. Synthesis results show designed require roughly 26% more 10% delay than same obtained for requires 33% 35% 13% 18% adders, respectively.

参考文章(38)
A. Klimovitski, Using SSE and SSE2 : Misconceptions and reality Intel Developer Update Magazine. ,(2001)
D.R. Lutz, C.N. Hinds, Accelerating floating-point 3D graphics for vector microprocessors asilomar conference on signals, systems and computers. ,vol. 1, pp. 355- 359 ,(2003) , 10.1109/ACSSC.2003.1291935
Stuart Franklin Oberman, Design issues in high performance floating point arithmetic units Stanford University. ,(1996)
Kurt Keutzer, David Chinnery, Closing the gap between ASIC & custom : tools and techniques for high-performance ASIC design Kluwer Academic Publishers. ,(2002)
E.M. Schwarz, R.M. Smith, C.A. Krygowski, The S/390 G5 floating point unit supporting hex and binary architectures symposium on computer arithmetic. pp. 258- 265 ,(1999) , 10.1109/ARITH.1999.762852
A. Beaumont-Smith, N. Burgess, S. Lefrere, C.C. Lim, Reduced latency IEEE floating-point standard adder architectures symposium on computer arithmetic. pp. 35- 42 ,(1999) , 10.1109/ARITH.1999.762826
Dong-Yan Huang, Xuesong Gong, Daqing Zhou, T. Miki, S. Hotani, Implementation of the MPEG-4 advanced audio coding encoder on ADSP-21060 SHARC international symposium on circuits and systems. ,vol. 3, pp. 544- 547 ,(1999) , 10.1109/ISCAS.1999.778903
V.G. Oklobdzija, An algorithmic and novel design of a leading zero detector circuit: comparison with logic synthesis IEEE Transactions on Very Large Scale Integration Systems. ,vol. 2, pp. 124- 128 ,(1994) , 10.1109/92.273153
Y. Wang, A. Mangaser, P. Srinivasan, A processor architecture for 3D graphics IEEE Computer Graphics and Applications. ,vol. 12, pp. 96- 105 ,(1992) , 10.1109/38.156019
Yun He, Chris H. Q. Ding, Using Accurate Arithmetics to Improve Numerical Reproducibility and Stability in Parallel Applications ieee international conference on high performance computing data and analytics. ,vol. 18, pp. 259- 277 ,(2001) , 10.1023/A:1008153532043