计算机与现代化 ›› 2023, Vol. 0 ›› Issue (01): 13-17.

• 图像处理 • 上一篇    下一篇

基于NS-StyleGAN2的鱼类图像扩充方法

  

  1. (青岛科技大学信息科学与技术学院,山东 青岛 266061)
  • 出版日期:2023-03-02 发布日期:2023-03-02
  • 作者简介:李海涛(1978—),男,山东菏泽人,副教授,博士,研究方向:地理信息系统,北斗定位导航,移动物联网,E-mail: taohaili@sina.com; 胡泽涛(1997—),男,山东菏泽人,硕士研究生,研究方向:智慧海洋,智慧渔业,E-mail: huzetao_123@163.com; 张俊虎(1974—),男,副教授,博士,研究方向:分布式数据处理,E-mail:jzhang@qust.edu.cn。
  • 基金资助:
    山东省重点研发计划(科技示范工程)项目(2021SFGC0701)

Method of Fish Image Expansion Based on NS-StyleGAN2 Network

  1. (Information Science and Technology Academy, Qingdao University of Science and Technology, Qingdao 266061, China)
  • Online:2023-03-02 Published:2023-03-02

摘要: 图像多分类领域中经常出现类别不平衡问题,这会对分类模型的学习训练产生负面影响。通过对样本数量较少的类别进行扩充可以有效解决类别不平衡问题。生成对抗网络作为近年来新兴的一种神经网络,输入真实图像样本训练可以输出与真实样本非常相似的生成样本。根据此特性,本文结合第二代样式生成对抗网络(StyleGAN2)的设计思想与鱼类图像的特点,设计一种噪声抑制样式生成对抗网络NS-StyleGAN2 (Noise-Suppressed Style Generative Adversarial Networks 2)。NS-StyleGN2去除了StyleGAN2合成网络中低分辨率层的噪声输入,从而抑制低分辨率层的噪声权重,使StyleGAN2生成样本细节特征更逼近真实样本特征。采用202张鲢鱼图像进行训练,本文提出的方法在起始分数、弗雷歇起始距离、内核起始距离得分等方面均优于DCGAN、WGAN、StyleGAN2,表明该方法可以有效进行图像扩充。

关键词: 样式生成对抗网络, 图像扩充, 噪声抑制, 起始分数, 弗雷歇起始距离

Abstract: Category imbalance often occurs in the field of image multi-classification, which has a negative impact on the learning and training of the classification model. It can be effectively solved by expanding the category with fewer samples. Generative adversarial network, as a newly developed neural network in recent years, can output generated samples that are very similar to real samples when trained by real image samples. According to this characteristic, this paper designs a noise-suppressed second generation style generation adversarial network 2(NS-StyleGAN2) by combining the design philosophy of the second generation style generation adversarial network (StyleGAN2) and the characteristics of fish image. NS-StyleGAN2 removes the noise input of the low-resolution layer in the StyleGAN2’s synthetic network, so as to suppress the noise weight of the low-resolution layer and make the StyleGAN2-generated samples’ detail features more close to the real samples’. 202 images of silver carp are used for training. The method proposed in this paper is superior to DCGAN, WGAN and StyleGAN2 in inception score, Frechet inception distance and kernel inception distance, which shows this method can be used for image expansion effectively.

Key words: StyleGAN (Style Generative Adversarial Networks), image augmentation, noise suppression, IS, FID