计算机与现代化

• 信息系统 • 上一篇    下一篇

基于改进Leaders算子的审计潜在疑点发现

  

  1. (1.广东电网有限责任公司云浮供电局,广东云浮527300;2.广东电网有限责任公司信息中心,广东广州510620;
    3.广东电网有限责任公司揭阳供电局,广东揭阳522000;4.广东电网有限责任公司,广东广州510620)
  • 出版日期:2018-04-28 发布日期:2018-05-02
  • 作者简介:邵锦炜(1991),男,广东云浮人,广东电网有限责任公司云浮供电局助理工程师,本科,研究方向:信息系统管理,项目管理; 林俊(1985),男,福建泉州人,广东电网有限责任公司信息中心工程师,硕士,研究方向:电气工程,项目管理,信息安全; 刘雅婷(1989),女,广东揭阳人,广东电网有限责任公司揭阳供电局助理工程师,本科,研究方向:信息系统管理,信息安全; 肖嘉丽(1980),女,广东广州人,广东电网有限责任公司高级经济师,硕士,研究方向:企业管理。

Detection of Potential Audit Doubts Based on Improved Leaders Operator

  1. (1. Yunfu Power Supply Bureau, Guangdong Power Grid Co. Ltd.〖KG-*4〗, Yunfu 527300, China;
    2. Information Center, Guangdong Power Grid Co. Ltd.〖KG-*4〗, Guangzhou 510620, China;
    3. Jieyang Power Supply Bureau, Guangdong Power Grid Co. Ltd.〖KG-*4〗, Jieyang 522000, China;
    4. Guangdong Power Grid Co. Ltd.〖KG-*4〗, Guangzhou 510620, China) 
     
  • Online:2018-04-28 Published:2018-05-02

摘要: 数据库查询方法审计疑点发现依赖于审计人员先验知识,当经验不足且审计数据量巨大时,难以发挥大数据优势并从海量数据中发现疑点。为解决这一问题,提出基于改进Leaders算子迭代聚类的审计大数据潜在疑点发现方法。该方法在无先验知识的情形下,通过Leaders算法自动完成审计大数据的初始聚类,在此基础上通过随机抽样融合方法对初始聚类结果优化,最后通过多次迭代聚类的方法,对实例数较少或可疑程度易被掩盖的小簇进一步聚类,实现审计大 数据的精确聚类,并将实例较少且行为明显异常的数据聚类识别为潜在疑点,配合审计人员审计经验快速精确定位审计疑点。实验结果验证了算法的有效性,表明算法有助于从海量数据中自主发现审计疑点,缩小疑点筛查范围,提高审计效率。

关键词: 计算机辅助审计, 潜在审计疑点发现, 改进Leaders算子, 抽样融合, 迭代聚类

Abstract:  Database query method audit doubts discovery relies on the prior knowledge of the auditors, but when the auditors have not enough audit experience and the amount of audit data is huge, it is difficult to take advantage of big data and find the audit doubts from the massive data. And so, in order to solve this problem, a method based on improved Leaders operator and iterative clustering is proposed. In the absence of prior knowledge, Leaders algorithm is used for automatic initial clustering of large audit data, and then, the random sampling fusion method is introduced to optimize the clustering results based on that initial clustering center, finally, the multiple iterative clustering method is used to further find the small clusters with fewer or doubtful instances and thus the accurate clustering of large audit data is achieved. The data clusters with fewer instances or obviously abnormal behavior are identified as potential audit doubts, which can cooperate with audit experience to assist auditors to locate audit doubtful points quickly and accurately. Experimental results verify the effectiveness of the proposed algorithm, and show that the proposed algorithm is helpful to find out the audit doubts from the mass data, narrow the scope of doubts screening and improve the audit efficiency.

Key words: computer aided audit, detection of potential audit doubts, improved Leaders operator, data sampling and fusion, iterative clustering

中图分类号: