Computer and Modernization

Previous Articles     Next Articles

 
A Spam Filtering Algorithm Based on Conditional Entropy

  

  1.  
    (1. Bohai University, Jinzhou 121000, China; 2. Shenyang University, Shenyang 110044, China)
  • Received:2013-10-16 Online:2014-02-14 Published:2014-02-14

Abstract: In spam filtering, according to the filter misjudgment for legitimate mails, we put forward an improved spam filtering algorithm, which improves the conditional entropy estimation method of information gain. Combined with the Bayes minimum risk decision method, we analyze the algorithm through the recall and accuracy by carrying out an experiment on the English Corpus. Experimental results show that the improved algorithm can enhance the classification precision and reduce the misjudgment of legitimate emails, which can reduce the loss of users.

Key words: spam, information gain, conditional entropy, minimum risk

CLC Number: