作者: P.-M. Seidel
DOI: 10.1109/MWSCAS.2003.1562547
关键词:
摘要: We propose optimizations for the IEEE floating-point fused multiply-add operation by considering multiple exclusive parallel computation paths in implementation. For proposed design we can show a significant performance improvement over conventional implementations. Considering variable latency implementation allows further reduction of average latency.