计算机与现代化 ›› 2022, Vol. 0 ›› Issue (12): 13-17.

• 算法设计与分析 • 上一篇    下一篇

基于半监督学习的学生消费数据异常检测

  

  1. (1.中国石油大学(华东)信息化建设处,山东青岛266580;2.中国石油大学(华东)计算机与通信工程学院,山东青岛266580)
  • 出版日期:2023-01-04 发布日期:2023-01-04
  • 作者简介:宋晓丽(1986—),女,山东潍坊人,工程师,硕士,研究方向:高校信息化,E-mail: songxiaoli@upc.edu.cn; 张勇波(1977—),男,高级工程师,硕士,研究方向:高校信息化,E-mail: zhangyb@upc.edu.cn; 张培颖(1981—),男,副教授,博士,研究方向:语义计算,未来网络架构,E-mail: zhangpeiying@upc.edu.cn。
  • 基金资助:
    山东省自然科学基金资助项目(ZR2020MF006); 中国高校产学研创新基金资助项目(2021FNA01001)

Anomaly Detection of Student Consumption Data Based on Semi-supervised Learning

  1. (1. Information Construction Department, China University of Petroleum (East China), Qingdao 266580, China; 
    2. College of Computer and Communication Engineering, China University of Petroleum (East China), Qingdao 266580, China)
  • Online:2023-01-04 Published:2023-01-04

摘要: 随着校园卡的应用场景越来越广泛,校园卡的资金安全问题日益突出,校园卡欺诈不但给师生和校内商家带来经济损失,还会危害校园的正常秩序。针对传统异常检测方法无法有效提取学生消费数据时序特征的问题,提出一种基于半监督学习的学生消费数据异常检测方法。首先,利用门控循环单元改进自编码器,使得模型可以更准确地进行消费数据的重构;然后,采用马氏距离计算重构误差,计算Fβ-分数确定误差阈值,进行异常数据的检测;最后,利用所提方法对某高校的学生消费数据进行异常检测实验。实验结果表明,所提方法具有更优越的检测性能。

关键词: 深度学习, 数据挖掘, 自编码器, 异常检测

Abstract: With the more and more extensive application scenarios of campus card, the problem of capital security of campus card has become increasingly prominent. Campus card fraud will not only bring economic losses to teachers, students and businesses in the school, but also endanger the normal order of the campus. Aiming at the problem that the traditional anomaly detection method can not effectively extract the temporal feature of student consumption data, this paper proposes an anomaly detection method of student consumption data based on semi-supervised learning. Firstly, the auto-encoder is enhanced with the Gated Recurrent Unit, so that the model can reconstruct the consumption data more accurately. Then, the reconstruction error is calculated by Mahalanobis Distance, and the error threshold is determined by Fβ-Socre to detect abnormal data. Finally, the proposed method is used to detect the anomaly of student consumption data in a university. Experimental results show that the proposed method has better detection performance.

Key words: deep learning, data mining, auto-encoder, anomaly detection