作者: Jordan Vrtanoski , Toni Draganov Stojanovski
DOI: 10.1109/TELFOR.2012.6419306
关键词:
摘要: OpenCL platform provides unified development environment for various multicore processors. In this paper, we evaluate the framework application in pattern recognition. We have selected most common algorithm Artificial Neural Networks (ANN) training — backpropagation parallelization with because of its high demand processing resources. will show a SIMD version suitable implementation. Our implementation showed 25.8 speedup execution on ATI 5870 GPU compared to Intel Xeon W3530 when MNIST handwritten digits data set.