A Nearest Neighbor Data Structure for Graphics Hardware

作者: Lawrence Cayton

DOI:

关键词: GraphicsDegree of parallelismCUDAComputer scienceParallel computingData structureNearest neighbor searchBottleneckMassively parallelGraphics hardware

摘要: Nearest neighbor search is a core computational task in database systems and throughout data analysis. It also major bottleneck, hence an enormous body of research has been devoted to structures algorithms for accelerating the task. Recent advances graphics hardware provide tantalizing speedups on variety tasks suggest alternate approach problem: simply run brute force massively parallel system. In this paper we marry approaches with novel structure that can effectively make use such as cards. The architectural complexities hardware—the high degree parallelism, small amount memory relative instruction throughput, single instruction, multiple design—present significant challenges design. Furthermore, applies perfectly hardware, leading one question whether intelligent algorithm or even hope outperform basic approach. Despite these misgivings, demonstrate our structure—termed Random Ball Cover—provides over GPUbased

参考文章(21)
Stephen M. Omohundro, Five Balltree Construction Algorithms ,(2009)
Sergey Brin, Near Neighbor Search in Large Metric Spaces very large data bases. pp. 574- 584 ,(1995)
C. Lauterbach, Q. Mo, D. Manocha, gProximity: Hierarchical GPU-based Operations for Collision and Distance Queries Computer Graphics Forum. ,vol. 29, pp. 419- 428 ,(2010) , 10.1111/J.1467-8659.2009.01611.X
Duy Nguyen-Tuong, Jan Peters, Using model knowledge for learning inverse dynamics international conference on robotics and automation. pp. 2677- 2682 ,(2010) , 10.1109/ROBOT.2010.5509858
Robert Krauthgamer, James R. Lee, Navigating nets: simple algorithms for proximity search symposium on discrete algorithms. pp. 798- 807 ,(2004) , 10.5555/982792.982913
Jeffrey K. Uhlmann, Satisfying general proximity / similarity queries with metric trees Information Processing Letters. ,vol. 40, pp. 175- 179 ,(1991) , 10.1016/0020-0190(91)90074-R
James W. Demmel, Vasily Volkov, Benchmarking GPUs to tune dense linear algebra ieee international conference on high performance computing data and analytics. pp. 31- ,(2008) , 10.5555/1413370.1413402
Tim Foley, Jeremy Sugerman, KD-tree acceleration structures for a GPU raytracer siggraph eurographics conference on graphics hardware. pp. 15- 22 ,(2005) , 10.1145/1071866.1071869
Nadathur Satish, Mark Harris, Michael Garland, Designing efficient sorting algorithms for manycore GPUs international parallel and distributed processing symposium. pp. 1- 10 ,(2009) , 10.1109/IPDPS.2009.5161005