计算机与现代化 ›› 2023, Vol. 0 ›› Issue (02): 28-33.

• 数据库与数据挖掘 • 上一篇    下一篇

基于半监督聚类的通信缺陷研判知识库构建及迭代技术

  

  1. (1. 国网江苏省电力有限公司信息通信分公司,江苏 南京 210024; 2. 南京南瑞信息通信科技有限公司,江苏 南京 210012; 3. 北京邮电大学网络与交换国家重点实验室,北京100876)
  • 出版日期:2023-04-10 发布日期:2023-04-10
  • 作者简介:洪涛(1994—),男,安徽南陵人,工程师,硕士,研究方向:电力通信,人工智能,E-mail: 1101650439@qq.com; 通信作者: 朱鹏宇(1993—),男,江苏盐城人,工程师,硕士,研究方向:电力通信,人工智能,E-mail: pyzhu2016@163.com; 郭波(1977—),男,江苏南京人,高级工程师,硕士,研究方向:电力通信,自动化,人工智能,E-mail: 13951983232@163.com; 王敬宇(1978—),男,北京人,教授,博士,研究方向:通信网络,人工智能,E-mail: wangjingyu@ebupt.com。
  • 基金资助:
    国家重点研发计划专项资助(2020YFB1807801);国家自然科学基金资助项目(62071067, 62001054);国家电网公司科技项目(5700-202040367A-0-0-00)

Communication Fault Diagnosis Knowledge Base Construction and Iteration Based on Semi-supervised Clustering

  1. (1. Information and Communication Branch of State Grid Jiangsu Electric Power Co., Ltd., Nanjing 210024, China;
    2. Nanjing NARI Information and Communication Technology Co., Ltd., Nanjing 210012, China; 3. State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China1)
  • Online:2023-04-10 Published:2023-04-10

摘要: 电力通信网是电力系统不可缺少的重要组成部分,是电网调度自动化和生产管理现代化的基础,是确保电网安全、经济、稳定运行的重要技术手段。传统的通信缺陷研判工作依赖人工经验,难以满足日益庞大、复杂的通信网络安全生产需求。而基于规则引擎或神经网络等方法在生产环境应用中逐渐遇到瓶颈,样本较少难以训练,且作为黑盒较难在生产环境中独立使用。针对上述问题,本文提出基于改进马尔科夫聚类的告警聚类算法和基于序列相似性计算与OPTICS聚类的缺陷研判算法,以适应当前缺陷数据小样本场景,在上述算法结果基础上,利用少量的缺陷单标签构建缺陷研判知识库及其迭代学习机制,通过实际生产积累的数据进行验证,结果表明相关算法及其知识库在应对实际生产问题时效果良好。

关键词: 电力通信, 缺陷研判, 知识库, 半监督聚类

Abstract: Power communication network is an indispensable and important part of power system, it’s the basis of power grid dispatching automation and production management modernization, and an important technical means to ensure the safe, economic and stable operation of power grid. The diagnosis of communication faults still depends on manual experience, which is difficult to meet the safety production needs of increasingly large and complex communication network. Methods based on rule engine or neural network gradually encounter bottlenecks in the application of production environment. It is difficult to train due to less samples, or work independently in production environment as a black box. To solve the above problems, this paper proposes an alarm clustering algorithm based on improved Markov-clustering and a fault diagnosis algorithm based on sequence similarity and OPTICS clustering, adapt to the current small sample scenario of fault data. On the basis of the above algorithm results, the fault diagnosis knowledge base and its iterative learning mechanism are constructed by using a small number of labels. It is verified by the data accumulated in actual production. The results show that the relevant algorithms and knowledge base have a good effect in dealing with actual production problems.

Key words: power communication, fault diagnosis, knowledge base, semi-supervised clustering