计算机与现代化

• 图像处理 • 上一篇    下一篇

基于改进随机森林的洗钱交易角色识别应用

  

  1. (华北计算技术研究所,北京100083)
  • 收稿日期:2017-06-14 出版日期:2018-03-08 发布日期:2018-03-09
  • 作者简介:张昊(1992-),男,四川成都人,华北计算技术研究所硕士研究生,研究方向:大数据处理,机器学习; 黄蔚(1972-),女,研究员,硕士,研究方向:大数据处理整合与挖掘分析; 胡国超(1982-),男,工程师,硕士,研究方向:大数据处理整合与挖掘分析。

Role Identification of Money Laundering Based on Improved Random Forest

  1. (North China Institute of Computing Technology, Beijing 100083, China)
  • Received:2017-06-14 Online:2018-03-08 Published:2018-03-09

摘要: 对于识别洗钱交易中钱庄账户的方法、现有公安办案方法与现有研究基于机器预警方法存在准确度低且滞后的问题,本文分析洗钱犯罪集团中钱庄与客户的交易行为特点,从主体背景属性、交易统计量、交易网络、交易行为离群4种角度提取一系列特征进行刻画,并通过随机森林进行特征选择与优化,在已经标注过的数据上进行模型训练与验证,形成一个能够对参与洗钱交易者身份进行自动识别的应用。通过实际数据验证可以发现有严重危害的钱庄经营者。

关键词: 洗钱交易, 角色识别, 交易网络, 中心性, 决策树权重, 随机森林

Abstract: Aiming at the problems of low accuracy and delay of the method of identifying the bank account in the money laundering transaction, the method of public security unit handling, and the existing research based on machine warning, this paper analyzes the trading behavior characteristics of the bank and the clients in the money-laundering crime group, extracting a series of features like the personal background attributes, trade statistics, trading platform, trading behavior to depict outliers from four kinds of view. The features are selected and optimized by random forest model. The annotated data is trained and verified. An application is formed for automatic recognition of traders involved in money laundering. Through the actual data validation, those banking operators with serious hazards can be found out.

Key words:  money laundering, role identification, transaction network, centrality, weighted decision tree, random forest

中图分类号: