计算机与现代化 ›› 2021, Vol. 0 ›› Issue (06): 91-95.

• 网络与通信 • 上一篇    下一篇

基于HBase的大数据架构下负载平衡技术

  

  1. (天津市气象局信息中心,天津  300074)
  • 出版日期:2021-07-05 发布日期:2021-07-05
  • 作者简介:雷鸣(1976—),男,山东莱州人,高级工程师,博士,研究方向:大数据应用,图像匹配,E-mail: lmeagle01@163.com; 姜罕盛(1986—),男,天津人,工程师,硕士,研究方向:大数据应用,E-mail: sembiyjhs@163.com。
  • 基金资助:
    国家自然科学基金资助项目(41575156);  天津市气象局科研项目(201914ybxm12)

Load Balancing Technology Under Big Data Architecture Based on HBase

  1. (Tianjin Meteorological Information Center, Tianjin 300074, China)
  • Online:2021-07-05 Published:2021-07-05

摘要: 随着气象数据规模和种类的不断增长,气象数据已经逐渐进入海量服务阶段,而基于大数据背景提供更敏捷的数据服务已经成为业务发展的迫切需求。本文针对气象中的半/非结构化数据,提出基于HBase系统的负载平衡算法和策略。在实际测试对比中发现,系统可以满足200多万个格点,100个并发的场景,查询速度在2 s以内,与未曾增加负载平衡算法相比,系统数据响应速度提升了42.69倍,能够有效地满足实际业务需要。

关键词: 负载平衡, HBase, 分布式存储, 大数据, HDFS

Abstract:  With the continuous growth of the scale and type of meteorological data, meteorological data has gradually entered the stage of massive service, and meanwhile providing more agile data services based on the background of big data has become an urgent demand for business development. In this paper, a load balancing algorithm and strategy based on HBase system is proposed for semi/unstructured meteorological data. In the actual test and comparison, it is found that the system can meet more than 2 million grid points and 100 concurrent scenarios, and the query speed is within 2 s. Compared with the load balancing algorithm which is not added load balance, the system data response speed is improved by 42.69 times, which can effectively meet the actual business needs.

Key words: load balance, HBase, distributed storage, big data, HDFS