计算机与现代化

• 算法设计与分析 • 上一篇    下一篇

基于近邻边缘检测的支持向量机

  

  1. 晋中学院信息技术与工程学院,山西晋中030619
  • 收稿日期:2014-12-08 出版日期:2015-03-23 发布日期:2015-03-26
  • 作者简介:王秀华(1968-),女,山西榆次人,晋中学院信息技术与工程学院讲师,硕士,研究方向:智能化信息处理与数据库; 武丽芬(1978-),女,山西榆次人,副教授,硕士,研究方向:人工智能与数据挖掘。

Support Vector Machine Based on Neighbor Edge Detection

  1. School of Information Technology & Engineering, Jinzhong University, Jinzhong 030619, China
  • Received:2014-12-08 Online:2015-03-23 Published:2015-03-26

摘要: 针对标准支持向量机方法需要存储、计算和处理核矩阵而学习效率很低,不能有效处理较大规模数据挖掘的问题,提出一种基于近邻边缘检测的支持向量机方法 (SVM Method Based on Neighbor Edge Detection, ED_SVM)。该方法将近邻边缘检测技术引入SVM的训练过程,即首先对数据进行划分,选择混合类样本,通过边缘检测技术提取其中位于近似最优分类边界附近的含有较多重要支持向量信息的样本,构成新的小规模训练集,以在压缩训练集的同时保持原始支持向量信息的分布特性;并在新构成的训练集上训练标准SVM,在提高SVM学习效率的同时得到优秀的泛化性能。实验结果表明,本文提出的ED_SVM方法能够同时获得较高的测试精度和学习效率。

关键词: 支持向量机, 边缘检测, 支持向量, 泛化性能, 学习效率, ED_SVM算法

Abstract: This paper presents a Support Vector Machine (SVM) method based on neighbor edge detection, called Support Vector Machine based on Neighbor Edge Detection (ED_SVM), in order to solve the problem that there is low training efficiency and it can not solve the large scale data mining problems of normal SVM, because it needs save, compute and solve the large kernel matrix. By dividing data and obtaining the mixed clusters, this method extracts the important samples near the approximate optimal hyperplane by introducing neighbor edge detection technology into the SVM training process, which have the most important support vector information. The new training samples set is constructed by these new important samples to keep the distribution feature of original support vectors and compress the size of training dataset. Then the normal SVM is trained on these new training samples and the good generalization performance can be obtained with high learning efficiency synchronously. The experiment results demonstrate that the proposed ED_SVM model can obtain the high learning efficiency and testing accuracy simultaneously.

中图分类号: