计算机与现代化

• 算法设计与分析 • 上一篇    下一篇

基于OpenCL的雷达外推算法改进与优化

  

  1. (1.南京信息工程大学大气科学学院,江苏南京210044;2.南京信大气象科技有限公司,江苏南京210044)
  • 收稿日期:2014-05-12 出版日期:2014-08-15 发布日期:2014-08-19
  • 作者简介:王兴(1983-),男,江苏泰州人,南京信息工程大学大气科学学院博士研究生,研究方向:气象信息安全技术; 苗春生 (1954-),男,内蒙古呼和浩特人,教授,博士生导师,研究方向:中尺度天气动力学,天气气候预测和现代教育技术; 王秀君 (1988-),女,江苏盐城人,硕士,研究方向:3S集成与气象应用。
  • 基金资助:
    江苏省2012年度普通高校研究生科研创新计划项目(CXZZ12_0513); 国家科技支撑计划项目(2012BAH05B01)

Improvement and Optimization on Radar Extrapolation Algorithm Based on OpenCL

  1. (1. School of Atmospheric Science, Nanjing University of Information Science & Technology, Nanjing 210044, China;

    2. Nanjing Xinda Meteorological Science and Technology Co. Ltd., Nanjing 210044, China)
  • Received:2014-05-12 Online:2014-08-15 Published:2014-08-19

摘要:

基于雷达资料的外推是临近预报中重要的方法之一,随着全国气象雷达网络建设规模的不断提高以及观测资料精细化程度的提升
,基于区域乃至全国雷达拼图的外推预报,每次计算都需花费大量时间,甚至滞后于每6分钟一次的资料观测频次。为解决传统外推算法
运算复杂度高,实时性差的问题,运用OpenCL构建基于GPU的异构计算模型对外推算法进行并行化改进。然后逐步分析影响算法性能的瓶
颈,并通过改进和测试数据比对,阐述算法优化的过程。其中,内存与线程的映射优化、合理利用局部存储器作为高速缓存以及隐藏CPU
执行时间等方法不仅对本算法的执行效率带来显著提升,也可为其他基于OpenCL异构计算的优化提供参考。以AMD Graphic Core Next
和Northern Islands二代GPU架构作为测试平台,并以Intel CPU并行计算作为测试参考,测试结果表明,改进后的算法在硬件同等功耗
的情况下,计算性能提升15~22倍。

关键词: 雷达外推算法, 开放运算语言, 并行化计算, 异构计算, 图形处理器

Abstract:

 Extrapolation based on radar data is one of the important methods for weather nowcasting. With the
increasing scale of the national weather radar network construction, and the enhancing about the refinement of
meteorological observational data, the extrapolation forecast based on regional and even national radar puzzle,
the computation time is very long. The time waiting for extrapolation computation usually lags behind the data’s
observation frequency which is once every six minutes. To solve the problem that the traditional extrapolation
algorithm is of high computational complexity and poor real-time, this paper discusses the heterogeneous computing
model based on GPU, and presents a parallel algorithm with OpenCL to achieve high performance, then analyzes the
bottlenecks of this application, and discusses how to bring up the computation speed by algorithm process
improvement and test data comparison. Some methods such as optimizing the mapping relationship of memory and
threads, utilizing local memory as high speed cache, and hiding CPU execution time, not only bring the efficiency
of the algorithm significantly improved, but also provide a reference for other optimization based on OpenCL
heterogeneous computing. Using AMD Graphic Core Next and Northern Islands which are two generation GPU
architectures as test platforms, and using Intel CPU parallel computing as a test reference, the test results show
that the improved algorithm consuming the same power dissipation under different hardware, the computing
performance is improved 15-22 times.

Key words: radar extrapolation algorithm, OpenCL, parallel computing, heterogeneous computing, GPU

中图分类号: