基于EBGAN的图像风格化技术

doi:10.3969/j.issn.1006-2475.2020.04.005

计算机与现代化 ›› 2020, Vol. 0 ›› Issue (04): 24-.doi: 10.3969/j.issn.1006-2475.2020.04.005

基于EBGAN的图像风格化技术

(河海大学计算机与信息学院,江苏南京211100)

收稿日期:2019-10-08 出版日期:2020-04-22 发布日期:2020-04-24
作者简介:陶颖(1994-),女,江苏南京人,硕士研究生,研究方向:深度学习,计算机视觉,E-mail: yingtao18@foxmail.com; 刘惠义(1961-),男,江苏常州人,教授,博士,研究方向:计算机图形学,计算机视觉,E-mail: hyliu@hhu.edu.cn。
基金资助:
江苏省水利厅科技计划项目(2017003ZB)

An Image Style Conversion Technology Based on EBGAN

(College of Computer and Information, Hohai University, Nanjing 211100, China)

Received:2019-10-08 Online:2020-04-22 Published:2020-04-24

摘要/Abstract

摘要： 为了解决传统图像风格化算法生成图像的多样性较差的问题，本文提出一种基于EBGAN(Energy-Based Generative Adversarial Net)的网络模型，即在鉴别器中引入能量函数思想，设计Autoencoder使其能分别针对真假输入产生不同重构结果，计算输入图像重构前后的误差值，以此误差值作为能量概念用来鉴别输入图像。在Autoencoder的编码阶段，对于编码后的向量引入正交控制，控制同一batch中的两两向量最大正交化，推动生成器生成朝着不同方向发展的图像。使用该模型在Facades和Cityscapes数据集上进行实验，实验结果表明本文的网络模型能有效完成图像风格化过程，较传统图像风格化网络模型能生成更加多样化的图像。

关键词: 生成对抗网络, 能量函数, 图像风格化

Abstract: In order to solve the problem of poor diversity of the generated images in the traditional image style conversion algorithm, this paper proposes a network model based on EBGAN (Energy-Based Generative Adversarial Net). The idea of energy function is introduced into the discriminator, and the Autoencoder is designed to generate different reconstruction results for the true and false input respectively. The error value before and after the reconstruction of the input image is calculated, which is used as the energy concept to identify the input image. In the coding stage of Autoencoder, the orthogonal control is introduced in to the encoded vectors to control the maximum orthogonalization of two vectors in the same batch, so as to promote the generator net to generate images in different directions. Experiments on Facades and Cityscapes datasets show that the proposed network model can effectively achieve process of image stylization and generate more diversified images than the traditional network model.

Key words: GAN, energy function, image style conversion

中图分类号:

TP183

陶颖,刘惠义. 基于EBGAN的图像风格化技术[J]. 计算机与现代化, 2020, 0(04): 24-.

TAO Ying, LIU Hui-yi. An Image Style Conversion Technology Based on EBGAN[J]. Computer and Modernization, 2020, 0(04): 24-.

参考文献

［1］ GATYS L A, ECKER A S, BETHGE M. A neural algorithm of artistic style［J］. Computer Vision and Pattern Recognition, 2015:arXiv:1508.06576.
［2］陈淑環,韦玉科,徐乐,等. 基于深度学习的图像风格迁移研究综述［J］. 计算机应用研究, 2019,36(8):2250-2255.
［3］ GATYS L, ECKER A S, BETHGE M. Texture synthesis using convolutional neural networks［C］// Advances in Neural Information Processing Systems. 2015:262-270.
［4］ LI Y H, WANG N Y, LIU J Y, et al. Demystifying neural style transfer［J］. Computer Vision and Pattern Recognition, 2017:arXiv:1701.01036.

［5］ GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets［C］// Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014:2672-2680.
［6］ ISOLA P, ZHU J Y, ZHOU T, et al. Image-to-image translation with conditional adversarial networks［C］// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017:1125-1134.
［7］ CHEN Q, KOLTUN V. Photographic image synthesis with cascaded refinement networks［C］// Proceedings of 2017 IEEE International Conference on Computer Vision. 2017:1511-1520.
［8］ ZHU J Y, ZHANG R, PATHAK D, et al. Toward multimodal image-to-image translation［C］// Advances in Neural Information Processing Systems. 2017:465-476.
［9］ ZHAO J, MATHIEU M, LECUN Y. Energy-based generative adversarial network［J］. Machine Learning, 2016:arXiv:1609.03126.
［10］RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation［C］// International Conference on Medical Image Computing and Computer-assisted Intervention. 2015:234-241.
［11］VINCENT P, LAROCHELLE H, LAJOIE I, et al. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion［J］. Journal of Machine Learning Research, 2010,11(12):3371-3408.
［12］YAMANAKA J, KUWASHIMA S, KURITA T. Fast and accurate image super resolution by deep CNN with skip connection and network in network［C］// International Conference on Neural Information Processing. 2017:217-225.
［13］KAVUKCUOGLU K, SERMANET P, BOUREAU Y L, et al. Learning convolutional feature hierarchies for visual recognition［C］// Advances in Neural Information Processing Systems. 2010:1090-1098.
［14］IOFFE S, SZEGEDY C. Batch normalization: Accelerating deep network training by reducing Internal covariate shift［C］// Proceedings of the 32nd International Conference on Machine Learning. 2015:448-456.
［15］KIM T, BENGIO Y. Deep directed generative models with energy-based probability estimation［J］. Machine Learning, 2016:arXiv:1606.03439.
［16］WANG R, CULLY A, CHANG H J, et al. Magan: Margin adaptation for generative adversarial networks［J］. Machine Learning, 2017:arXiv:1704.03817.
［17］RATLIFF L J, BURDEN S A, SASTRY S S. Characterization and computation of local Nash equilibria in continuous games［C］// Proceedings of 2013 51st Annual Allerton Conference on Communication, Control, and Computing. 2013:917-924.
［18］LIU J W, CHI G H, LUO X L. Contrastive divergence learning for the restricted Boltzmann machine［C］// 2013 9th International Conference on Natural Computation. 2013:18-22.
［19］HYNES M B, DE STERCK H. A polynomial expansion line search for large-scale unconstrained minimization of smooth L 2-regularized loss functions, with implementation in Apache Spark［C］// Proceedings of 2016 SIAM International Conference on Data Mining. 2016:603-611.
［20］ARJOVSKY M, CHINTALA S, BOTTOU L. Wasserstein gan［J］. Machine Learning, 2017:arXiv:1701.07875.
［21］GOLDSTEIN T, OSHER S. The split Bregman method for L1-regularized problems［J］. SIAM Journal on Imaging Sciences, 2009,2(2):323-343.
［22］KINGMA D P, BA J. Adam: A method for stochastic optimization［J］. Machine Learning, 2014:arXiv:1412.6980.
［23］HUYNH-THU Q, GHANBARI M. Scope of validity of PSNR in image/video quality assessment［J］. Electronics Letters, 2008,44(13):800-801.

[1]	徐新爱, 李钢. 基于DCGAN的课堂表情图像生成方法[J]. 计算机与现代化, 2024, 0(08): 88-91.
[2]	王志强, 郑爽. 基于半监督学习的StyleGAN图像生成模型[J]. 计算机与现代化, 2024, 0(06): 14-18.
[3]	卢梓菡1, 张东1, 杨艳1, 杨双2. 基于生成对抗网络的乳腺癌免疫组化图像生成[J]. 计算机与现代化, 2024, 0(03): 92-96.
[4]	刘彦红, 杨秋翔. 改进生成对抗网络的图像去雾算法[J]. 计算机与现代化, 2024, 0(02): 56-63.
[5]	付鸿林, 张太红, 杨雅婷, 艾孜麦提·艾瓦尼尔, 马博. 基于生成对抗网络的维语场景文字修改网络[J]. 计算机与现代化, 2024, 0(01): 41-46.
[6]	王鑫, 肖韬睿. 基于生成对抗网络的人脸识别对抗攻击[J]. 计算机与现代化, 2023, 0(10): 115-120.
[7]	江蕾, 唐建, 杨超越, 吕婷婷. 基于CWGAN-GP与CNN的轴承故障诊断方法[J]. 计算机与现代化, 2023, 0(07): 1-6.
[8]	李海涛, 胡泽涛, 张俊虎. 基于NS-StyleGAN2的鱼类图像扩充方法[J]. 计算机与现代化, 2023, 0(01): 13-17.
[9]	庄文华, 唐晓刚, 张斌权, 原光明. 基于生成对抗网络的高照度可见光图像生成[J]. 计算机与现代化, 2023, 0(01): 1-6.
[10]	彭鹏菲, 周琳茹. 加入奖励的GRU对抗网络文本生成模型[J]. 计算机与现代化, 2022, 0(07): 121-126.
[11]	翟慧聪, 张明, 邓星, 王利群. 基于生成对抗网络的图像动漫化[J]. 计算机与现代化, 2022, 0(07): 21-26.
[12]	秦鸣乐, 年梅, 张俊, . 基于深度生成对抗网络的恶意TLS流量识别[J]. 计算机与现代化, 2022, 0(04): 121-126.
[13]	陈云翔, 王巍, 宁娟, 陈怡丹, 赵永新, 周庆华. PSWGAN-GP:改进梯度惩罚的生成对抗网络[J]. 计算机与现代化, 2022, 0(04): 21-26.
[14]	李阳阳, 杨英光. 基于生成对抗网络的社交机器人检测[J]. 计算机与现代化, 2022, 0(03): 1-6.
[15]	陈圆圆, 刘惠义. 基于生成对抗网络的破损老照片修复[J]. 计算机与现代化, 2021, 0(04): 42-47.

基于EBGAN的图像风格化技术

An Image Style Conversion Technology Based on EBGAN

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价