基于NS-StyleGAN2的鱼类图像扩充方法

计算机与现代化 ›› 2023, Vol. 0 ›› Issue (01): 13-17.

基于NS-StyleGAN2的鱼类图像扩充方法

（青岛科技大学信息科学与技术学院，山东青岛 266061）

出版日期:2023-03-02 发布日期:2023-03-02
作者简介:李海涛（1978—），男，山东菏泽人，副教授，博士，研究方向：地理信息系统，北斗定位导航，移动物联网，E-mail： taohaili@sina.com；胡泽涛（1997—），男，山东菏泽人，硕士研究生，研究方向：智慧海洋，智慧渔业，E-mail： huzetao_123@163.com；张俊虎（1974—），男，副教授，博士，研究方向：分布式数据处理，E-mail：jzhang@qust.edu.cn。
基金资助:
山东省重点研发计划（科技示范工程）项目（2021SFGC0701）

Method of Fish Image Expansion Based on NS-StyleGAN2 Network

（Information Science and Technology Academy， Qingdao University of Science and Technology， Qingdao 266061， China）

Online:2023-03-02 Published:2023-03-02

摘要/Abstract

摘要： 图像多分类领域中经常出现类别不平衡问题，这会对分类模型的学习训练产生负面影响。通过对样本数量较少的类别进行扩充可以有效解决类别不平衡问题。生成对抗网络作为近年来新兴的一种神经网络，输入真实图像样本训练可以输出与真实样本非常相似的生成样本。根据此特性，本文结合第二代样式生成对抗网络（StyleGAN2）的设计思想与鱼类图像的特点，设计一种噪声抑制样式生成对抗网络NS-StyleGAN2 （Noise-Suppressed Style Generative Adversarial Networks 2）。NS-StyleGN2去除了StyleGAN2合成网络中低分辨率层的噪声输入，从而抑制低分辨率层的噪声权重，使StyleGAN2生成样本细节特征更逼近真实样本特征。采用202张鲢鱼图像进行训练，本文提出的方法在起始分数、弗雷歇起始距离、内核起始距离得分等方面均优于DCGAN、WGAN、StyleGAN2，表明该方法可以有效进行图像扩充。

关键词: 样式生成对抗网络, 图像扩充, 噪声抑制, 起始分数, 弗雷歇起始距离

Abstract: Category imbalance often occurs in the field of image multi-classification， which has a negative impact on the learning and training of the classification model. It can be effectively solved by expanding the category with fewer samples. Generative adversarial network， as a newly developed neural network in recent years， can output generated samples that are very similar to real samples when trained by real image samples. According to this characteristic， this paper designs a noise-suppressed second generation style generation adversarial network 2（NS-StyleGAN2） by combining the design philosophy of the second generation style generation adversarial network （StyleGAN2） and the characteristics of fish image. NS-StyleGAN2 removes the noise input of the low-resolution layer in the StyleGAN2’s synthetic network， so as to suppress the noise weight of the low-resolution layer and make the StyleGAN2-generated samples’ detail features more close to the real samples’. 202 images of silver carp are used for training. The method proposed in this paper is superior to DCGAN， WGAN and StyleGAN2 in inception score， Frechet inception distance and kernel inception distance， which shows this method can be used for image expansion effectively.

Key words: StyleGAN （Style Generative Adversarial Networks）, image augmentation, noise suppression, IS, FID

李海涛, 胡泽涛, 张俊虎. 基于NS-StyleGAN2的鱼类图像扩充方法[J]. 计算机与现代化, 2023, 0(01): 13-17.

LI Hai-tao, HU Ze-tao, ZHANG Jun-hu. Method of Fish Image Expansion Based on NS-StyleGAN2 Network[J]. Computer and Modernization, 2023, 0(01): 13-17.

参考文献

［1］赵楠，谭惠文. 人工智能技术的发展及应用分析［J］. 中国电子科学研究院学报， 2021，16（7）:737-740.
［2］邱根，王锂，白利兵. 基于生成对抗网络数据扩充的缺陷识别方法［J］. 电子测量与仪器学报， 2021，35（2）:212-220.
［3］范毅晟，谢保川，李雪青. 计算机图形学的发展及应用探讨［J］. 电子技术与软件工程， 2013（22）:103.
［4］陈佛计，朱枫，吴清潇，等. 生成对抗网络及其在图像生成中的应用研究综述［J］. 计算机学报， 2021，44（2）:347-369.
［5］魏富强，古兰拜尔·吐尔洪，买日旦·吾守尔. 生成对抗网络及其应用研究综述［J］. 计算机工程与应用， 2021，57（19）:18-31.
［6］吴少乾，李西明. 生成对抗网络的研究进展综述［J］. 计算机科学与探索， 2020，14（3）:377-388.
［7］ GOODFELLOW I， Pouget-Abadie J， Mirza M， et al. Generative adversarial networks［J］. Communications of the ACM， 2020，63（11）:139-144.
［8］马永杰，徐小冬，张茹，等. 生成式对抗网络及其在图像生成中的研究进展［J］. 计算机科学与探索， 2021，15（10）:1795-1811.
［9］ SHORTEN C， KHOSHGOFTAAR T M. A survey on image data augmentation for deep learning［J］. Journal of Big Data， 2019，6（1）:1-48.
［10］黄琼男，朱卫纲，李永刚. 于GAN的SAR数据扩充研究综述［J］. 兵器装备工程学报， 2021，42（11）:31-38.
［11］于晓明，黄铧. 改进GAN网络在生成短视频的应用研究［J］. 计算机科学， 2021，48（S2）:625-629.
［12］孙雄风，黄珍，陈志军，等. 基于改进GAN的端到端自动驾驶图像生成方法［J］. 交通信息与安全， 2021，39（5）:50-58.
［13］徐永士，贲可荣，王天雨，等. DCGAN模型改进与SAR图像生成研究［J］. 计算机科学， 2020，47（12）:93-99.
［14］ RADFORD A， METZ L， CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks［J］. arXiv preprint arXiv：1511. 06434，
2015.
［15］裴莉莉，孙朝云，孙静，等. 基于DCGAN的路面裂缝图像生成方法［J］. 中南大学学报（自然科学版）， 2021，52（11）:3899-3906.
［16］刘宇飞，周源，刘欣，等. 基于Wasserstein GAN的新一代人工智能小样本数据增强方法——以生物领域癌症分期数据为例［J］. Engineering， 2019，5（1）:338-354.
［17］胡龙辉，王朝立，孙占全，等. 基于WGAN的图像识别方法［J］. 控制工程， 2020，27（12）:2168-2175.
［18］李健宁，路阳，陶贤鹏，等. 基于WGAN-GP的水稻病害图像生成方法研究［J］.信息记录材料， 2021，22（8）:235-238.
［19］ KARRAS T， LAINE S， AILA T. A style-based generator architecture for generative adversarial networks［C］// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2019:4396-4405.
［20］赵宸，帅仁俊，马力，等. 基于Self-Attention-StyleGAN的皮肤癌图像生成与分类［J/OL］.计算机工程与应用:1-16［2022-01-05］.http://kns.cnki.net/kcms/detail/11.2127.tp.
20210419.1440.067.html.
［21］高志军，王新勃，王健. 基于样式生成对抗网络的视网膜OCT图像生成方法［J］. 黑龙江科技大学学报， 2021，31（2）:247-253.
［22］俞彬. 基于生成对抗网络的图像类别不平衡问题数据扩充方法［D］. 广州:华南理工大学， 2018.
［23］林志鹏，曾立波，吴琼水. 基于生成对抗网络的宫颈细胞图像数据增强［J］. 科学技术与工程， 2020，20（28）:11672-11677.
［24］ KARRAS T， LAINE S， AITTALA M， et al. Analyzing and improving the image quality of stylegan［C］// Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020:8107-8116.
［25］ ABDAL R， ZHU P， MITRA N J， et al. Styleflow：Attribute-
conditioned exploration of stylegan-generated images using conditional continuous normalizing flows［J］. ACM Transactions on Graphics （TOG）， 2021，40（3）: 1-21.
［26］ TEPEC D， SKOCAJ D. Image synthesis as a pretext for unsupervised histopathological diagnosis［C］// Proceedings of the 5th International Workshop on Simulation and Synthesis in Medical Imaging. 2020:174-183.
［27］ KARRAS T， AITTALA M， HELLSTEN J， et al. Training generative adversarial networks with limited data［C］// Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020:12104-12114.

[1]	王闻博,穆向阳,汤楠. #br# 低照度图像增强算法[J]. 计算机与现代化, 2014, 0(1): 27-31.
[2]	孙清文;张金锋. 基于双耦合改进型混沌振子系统的弱信号检测[J]. 计算机与现代化, 2012, 1(1): 17-21.