Computer and Modernization

Previous Articles     Next Articles

SensitiveFileDetectionMethodBasedonCNN

  

  1. (1.NARIGroupCorporation/StateGridElectricPowerResearchInstitute,Nanjing211106,China;
     2.Information&TelecommunicationBranch,StateGridJiangsuElectricPowerCo.Ltd.,Nanjing210024,China)
  • Received:2017-10-30 Online:2018-08-23 Published:2018-08-27

Abstract: Inrecentyears,thepowerindustryinformationconstructionhasmadegreatachievements.Moreandmoreofficedocuments,projectdocuments,projectcontractsandotherdocumentsinvolvingindustrysecrettransmitonInternet,onthetransmissionprocess,enterprise-classsensitivedocumentsmayhavebeenleaked.Traditionalsensitivedatarecognitionmethodbasedonsensitivelexiconforfeaturedetectioncangetdetectionresultquickly,butthereisalowaccuracy,highfalsenegativesrateandfalsepositivesrate.ThispaperproposesasensitivefiledetectionmethodbasedonDeepLearning.Themethodreferstowordembeddingandconvolutionneuralnetworkalgorithmtorealizetheaccurateclassificationofsensitivedocuments.Theapproachinthispapermakesenterprisesensitivefilesdetectionindependentoffeaturekeywords,andreducesthefalsenegativerateandfalsepositiverate.

Key words: sensitivewordtable, wordembedding, convolutionneuralnetwork, deeplearning, sensitivefiledetection

CLC Number: