计算机与现代化

• 应用与开发 • 上一篇    下一篇

基于GPU的交叉相关外推并行化算法

  

  1.  
    (南京信息工程大学大气科学学院,江苏 南京 210044
  • 收稿日期:2013-10-29 出版日期:2014-02-14 发布日期:2014-02-14
  • 作者简介:王兴(1983-),男,江苏泰州人,南京信息工程大学大气科学学院博士研究生,研究方向:气象信息安全技术; 王介君(1982-),男,山东青岛人,博士研究生,研究方向:大气遥感科学与技术; 孙宁(1982-),男,江苏宜兴人,讲师,博士,研究方向:气候变化; 汪瑶(1986-),女,江苏泗阳人,助教,硕士,研究方向:大气科学。
  • 基金资助:
    江苏省2012年度普通高校研究生科研创新计划项目(CXZZ12_0513); 国家科技支撑计划项目(2012BAH05B01)

 
GPU-based Parallel Algorithm for Cross-correlation Extrapolation

  1.  
    (School of Atmospheric Science, Nanjing University of Information Science & Technology, Nanjing 210044, China)
  • Received:2013-10-29 Online:2014-02-14 Published:2014-02-14

摘要: 为克服交叉相关外推算法时间复杂度高、运算时间过长的缺点,提出一种基于GPU的快速并行化算法,应用于地闪落点的外推预测。首先分析串行的算法流程,然后对算法进行并行化分析设计,再针对AMD系列GPU硬件架构特点,运用OpenCL技术从主存与设备内存之间的数据传输、显存访问模式等方面对算法进一步优化。最后将地闪监测实况数据与本算法外推计算结果进行比对,分析不同精度下串行与并行算法的计算效率。实验结果表明,该算法充分利用GPU强大的并行计算能力,计算速度提高了近17倍。

关键词: 图形处理器, 并行计算, 交叉相关外推, 闪电外推, 开放运算语言

Abstract: To overcome the shortcomings of high time complexity and long computing time about cross-correlation extrapolation algorithm, an improved GPU-based fast parallel algorithm is presented, which is applied to extrapolate the Cloud-to-Ground flash development trend. First of all, we analyze the serial algorithm flow, and then design the parallel algorithm flow, optimize the algorithm by way of improving the data transfer between device memory and main memory, and optimize the memory access patterns. These optimization measures are based on OpenCL technology and aimed at the hardware architecture of AMD series GPU. Finally, we compare the Cloud-to-Ground flash monitoring data against the extrapolation results computed by this algorithm, and analyze the efficiency of the serial and the parallel algorithms under different precisions. The experimental result indicates that, the algorithm takes advantage of the powerful GPU parallel computing capability, and the calculation speed increases by nearly 17 times.

Key words: GPU, parallel computing, cross-correlation extrapolation, lightning extrapolation, OpenCL

中图分类号: