中国科学技术大学学报 ›› 2017, Vol. 47 ›› Issue (10): 808-816.DOI: 10.3969/j.issn.0253-2778.2017.10.002

• 论著 • 上一篇    下一篇

基于OpenCL的加速鲁棒特征算法并行实现

郭景,陈贤富   

  1. 中国科学技术大学信息科学技术学院,安徽合肥 230027
  • 收稿日期:2017-04-01 修回日期:2017-08-05 出版日期:2017-10-31 发布日期:2017-10-31
  • 通讯作者: 陈贤富
  • 作者简介:郭景,男,1992年生,硕士生.研究方向:图像匹配、目标跟踪.E-mail:guojing6@mail.ustc.edu.cn

Parallel implementation of surf algorithm based on OpenCL

GUO Jing, CHEN Xianfu   

  1. School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China)
  • Received:2017-04-01 Revised:2017-08-05 Online:2017-10-31 Published:2017-10-31

摘要: 加速鲁棒特征算法(speed up robust features, SURF)的时间复杂度大,传统串行计算的方法,实时性难以保证.针对上述问题,提出一种基于OpenCL架构的SURF并行实现方法.首先对算法中的积分图的计算、Hessian响应图、特征点主方向、特征点描述等步骤实施数据并行和任务并行处理,并给出详细的算法流程.接着从OpenCL架构的数据传输、内存访问以及负载均衡等方面优化算法性能.实验结果表明,该算法对不同分辨率的图片均实现了10倍以上的加速比,一些高分辨率的图片甚至可以达到39.5倍,并且算法适用于多种通用计算平台.

关键词: 加速鲁棒特征, 开放运算语言, 图像处理器, 并行计算

Abstract: SURF algorithm has high computational complexity and can not meet the real-time requirement. To solve these problems, a parallel SURF algorithm based on OpenCL was presented. Firstly, data parallelism and task parallelism model were applied to the calculations of the integral images, Hessian detector, orientation and descriptor, and the detailed algorithm process was given. Secondly, the performance of the parallel algorithm was optimized from data transmission, memory access and load balancing. The experimental results show that the algorithm can achieve more than 10 times speedup for images with different resolution, and some high-resolution images can even reach up to 39.5 times. Furthermore, it can be applied to a variety of general purpose computing platforms.

Key words: SURF, OpenCL, GPU, parallel computing

中图分类号: