一种启发式线性回归损失函数选取方法

doi:10.3969/j.issn.1006-2475.2017.08.001

计算机与现代化 ›› 2017, Vol. 0 ›› Issue (8): 1-.doi: 10.3969/j.issn.1006-2475.2017.08.001

• 算法设计与分析 • 下一篇

一种启发式线性回归损失函数选取方法

雅安职业技术学院机电与信息工程系，四川雅安625000

收稿日期:2017-02-28 出版日期:2017-08-31 发布日期:2017-09-01
作者简介:张祎(1975-)，女，四川雅安人，雅安职业技术学院机电与信息工程系讲师，硕士，研究方向：计算机应用，软件工程。

A Heuristic Linear Regression Loss Function Selection Method

Department of Mechanical, Electrical and Information Engineering, Ya’an Polytechnic College, Ya’an 625000, China

Received:2017-02-28 Online:2017-08-31 Published:2017-09-01

摘要/Abstract

摘要： 损失函数度量回归分析中，信息损失和错误的程度是机器学习算法最小化的目标函数。本文研究在有限数据集上线性回归分析的损失函数选取方法。对于给定的噪声密度，存在一个满足一致性条件的最优损失函数（如噪声密度满足高斯分布，则常见的最优损失函数是平方损失函数）。但在实际应用中，噪声密度往往是不确定的，且训练样本集有限。一些统计信息可用来对有限信息环境下的损失函数进行选取，但这些统计信息是基于一些一致性假设且在有限的样本集上不一定有效。针对这些问题，借鉴Vapnik的ε-insensitive损失函数，提出一种启发式的基于样本数目及噪声方差的参数设置方法。实验结果表明，与常用的平方损失函数及Huber的leastmodulus loss相比，本文的损失函数性能更健壮且预测效率更准确。

关键词: 损失函数, 支持向量机, 平方损失函数, 参数选择, VC维

Abstract: Loss function is used to quantify information loss and false degree in regression analysis. This paper addresses heuristic loss function selection for linear regression. For a given noise density, there exists an optimal loss function under an asymptotic setting i.e. squared loss is optimal for Gaussian noise density. However, in reallife applications the noise density is always unknown and the training samples are finite. Robust statistics provides ways for selecting the loss function using statistical information about noise density, however robust statistics is based on asymptotic assumption and may not be well applied for finite sample data sets. For such practical problems, we try to utilize concept of Vapnik’s εinsensitive loss function. We propose a heuristic method for setting the value of ε as a function of samples and noise variance. Experimental comparisons for linear regression problems show that the proposed loss function performs more robustly performance and yields higher prediction accuracy compared with popular squared loss and Huber’s leastmodulus loss.

Key words: support vector machine, square loss function, parameter selection, VC dimension

中图分类号:

P338

张祎. 一种启发式线性回归损失函数选取方法[J]. 计算机与现代化, 2017, 0(8): 1-.

ZHANG Yi. A Heuristic Linear Regression Loss Function Selection Method[J]. Computer and Modernization, 2017, 0(8): 1-.

参考文献

1］ Seber G A F, Lee A J. Linear Regression Analysis［M］. John Wiley & Sons, 2012.
［2］ Bishop C M. Pattern Recognition and Machine Learning［M］. Springer, 2006.
［3］ Cherkassky V, Mulier F M. Learning from Data: Concepts, Theory, and Methods［M］. John Wiley & Sons, 2007.
［4］ Cherkassky V. Model complexity control and statistical learning theory［J］. Natural Computing, 2002,1(1):109-133.
［5］ Domingos P. A unified biasvariance decomposition for zeroone and squared loss［C］// Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Applications of Artificial Intelligence. 2000:564-569.
［6］ Huber P J. Robust estimation of a location parameter［J］. The Annals of Mathematical Statistics, 1964,35(1):73-101.
［7］ Vapnik V. The Nature of Statistical Learning Theory［M］. New York : Springer Press, 1995.
［8］ Trevor Hastie, Jerome Friedman, Robert Tibshirani. The Elements of Statistical Learning［M］. Springer Series in Statistics, Berlin: Springer, 2001.
［9］ Cherkassky V, Ma Y. Selection of metaparameters for support vector regression［M］// Artificial Neural Networks—ICANN 2002. Springer Berlin Heidelberg, 2002:687-693.
［10］Hough B. Solution of the minimum modulus problem for covering systems［J］. Annals of Mathematics, 2015,181(1):361-382.
［11］Chang C C, Lin C J. LIBSVM: A library for support vector machines［J］. ACM Transactions on Intelligent Systems and Technology(TIST), 2011,2(3),doi:10.1145/1961189.1961199.
［12］Alpaydin E. Introduction to Machine Learning［M］. MIT press, 2014.
［13］Yang Liu, Hanneke S. Activized learning with uniform classification noise［C］// Proceedings of the 30th International Conference on Machine Learning. 2013:370-378.
［14］Dwork C. A firm foundation for private data analysis［J］. Communications of the ACM, 2011,54(1):86-95.
［15］Wang Jianjun, Ma Yizhong, Ouyang Linhan, et al. A new Bayesian approach to multiresponse surface optimization integrating loss function with posterior probability［J］. European Journal of Operational Research, 2016,249(1):231-237.

[1]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[2]	韩瑞超, 孟令军, 敖利丞, 谢宇斌, 甄明硕. 基于改进YOLOv5的施工防护佩戴检测[J]. 计算机与现代化, 2024, 0(10): 49-54.
[3]	杜菲瑀, 王海燕, 姚海洋, 陈晓. 基于领域自适应的水下图像增强算法[J]. 计算机与现代化, 2024, 0(10): 55-60.
[4]	曹宁1, 严心娥1, 徐根祺2, 许又文1, 张正勃2, 杜倩云2. 基于DEFA-LSSAR的水利工程边坡力学参数预测模型[J]. 计算机与现代化, 2024, 0(07): 106-111.
[5]	王志强, 郑爽. 基于半监督学习的StyleGAN图像生成模型[J]. 计算机与现代化, 2024, 0(06): 14-18.
[6]	曹宁1, 徐根祺2, 张雯3, 许又文1, 何盼情1. 基于AFSPSO-ν-SVM的山洪灾害预测方法研究#br# #br#[J]. 计算机与现代化, 2024, 0(05): 33-37.
[7]	钟海龙1, 2, 何月顺1, 何璘琳1, 陈杰1, 田鸣3, 郑瑞银4. 基于代价敏感卷积神经网络的加密流量分类#br# #br#[J]. 计算机与现代化, 2024, 0(05): 55-60.
[8]	刘馨嫔1, 2, 3, 王洪1, 3, 赵良瑾1, 3. 基于多任务学习的近岸舰船检测方法[J]. 计算机与现代化, 2024, 0(03): 29-33.
[9]	杨博, 庄毅. 基于AOA-MSVM的控制集群故障检测方法[J]. 计算机与现代化, 2023, 0(12): 112-116.
[10]	杨孙哲, 孙爱珍. 基于HSV颜色与LBP纹理特征的水稻氮素营养诊断[J]. 计算机与现代化, 2023, 0(07): 86-92.
[11]	申志, 李元 . 基于KPCA和SSA优化SVM的非线性过程故障检测#br#[J]. 计算机与现代化, 2023, 0(06): 15-20.
[12]	尹建丰, 卫鑫, 顾雄伟, 黄凯, 魏敏捷. 基于图像阈值优化及改进SVM的电表数字识别[J]. 计算机与现代化, 2023, 0(05): 106-110.
[13]	盛江岸, 陈淑荣. 融合双重注意力机制的戴口罩人脸识别方法[J]. 计算机与现代化, 2023, 0(02): 72-77.
[14]	张飙, 王慧贤, 韩冰, . 基于改进YOLOv3的高分辨率遥感图像复合目标检测[J]. 计算机与现代化, 2022, 0(12): 74-80.
[15]	申智, 徐丽, 符祥远. 基于改进YOLO v4光线模糊场景下交通标志检测[J]. 计算机与现代化, 2022, 0(07): 27-32.

一种启发式线性回归损失函数选取方法

A Heuristic Linear Regression Loss Function Selection Method

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价