加入奖励的GRU对抗网络文本生成模型

摘要/Abstract

摘要： 针对目前生成对抗网络文本生成模型采用有监督形式造成的错误累计以及生成文本信息单一等问题，提出一种基于GRU生成对抗网络的文本生成模型，GRU生成器采用策略梯度进行参数更新，且该模型增加蒙特卡洛搜索推导生成样本序列。采用参数较少的GRU神经网络作为生成器和判别器，判别器的输出loss函数指导生成过程中的参数优化，以蒙特卡洛策略思想补充生成过程中的非完整序列，减少错误累计并增加文本生成信息的丰富性。引入门截断机制，用自定义函数替换GRU网络中的sigmoid函数，改进当前时刻的隐含变量的激活函数，改善原函数收敛速度较慢且容易产生梯度消失问题，使之更适应本文模型。仿真实验结果表明本文模型丰富了文本生成的多样性，提高了模型的收敛速度，验证了本模型的有效性。该模型有较好的应用性。

关键词: 生成对抗网络, 文本生成, GRU神经网络, 蒙特卡洛策略

Abstract: Aiming at the problems of accumulated errors caused by the supervised form of the current generative adversarial network text generation model and the single generated text information, a text generation model based on GRU generative adversarial network is proposed. The GRU generator uses rollout-policy to update parameters, and Monte Carlo search is added into the model to generate sample sequences. The GRU neural network with fewer parameters is used as the generator and the discriminator. The output loss function of the discriminator guides the parameter optimization in the generation process, and the Monte Carlo strategy is used to supplement the incomplete sequence in the generation process to reduce the accumulation of errors and increase the text richness of generated information. This paper introduces the gate truncation mechanism, replaces the sigmoid function in the GRU network with a custom function, improves the activation function of the implicit variable at the current time, and improves the slower convergence speed of the original function and the problem of gradient disappearance, making it more suitable for this model. The results of simulation experiments show that this model enriches the diversity of text generation, improves the convergence speed of the model, and proves the effectiveness of this model. The model has good applicability.

Key words: GAN, text generation, GRU(Gated Recurrent Unit) neural network, Monte Carlo strategy

彭鹏菲, 周琳茹. 加入奖励的GRU对抗网络文本生成模型[J]. 计算机与现代化, 2022, 0(07): 121-126.

PENG Peng-fei, ZHOU Lin-ru. GRU Adversarial Network Text Generation Model with Reward[J]. Computer and Modernization, 2022, 0(07): 121-126.

参考文献［23］

［1］	LIU Y P, CHEN X Y, LIU C, et al. Delving into transferable adversarial examples and black-box attacks［J］. arXiv preprint arXiv:1611.02770, 2016.
［2］	王姿雯. 基于深度学习的多条件个性化文本生成［D］. 北京:北京邮电大学, 2019.
［3］	HE D, LU H Q, XIA Y C, et al. Decoding with value networks for neural machine translation［C］// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017:178-186.
［4］	LEI W Q, JIN X S, KAN M Y, et al. Sequicity: Simplifying task-oriented dialogue systems with single sequence-to-sequence architectures［C］// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018:1437-1447.
［5］	PAMUNGKAS E W. Emotionally-aware chatbots: A survey［J］. arXiv preprint arXiv:1906.09774, 2019.
［6］	LIAO Y, BING L D, LI P J, et al. QuaSE: Sequence editing under quantifiablem guidance［C］// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2018:3855-3864.
［7］	FEDUS W, GOODFELLOW I, DAI A M. MaskGAN: Better text generation via filling in the ______ ［J］. arXiv preprint arXiv:1801.07736, 2018.
［8］	GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets［C］// Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014:2672-2680.
［9］	KUSNER M J,HERNANDE-LOBATO J M.GANs for sequences of discrete elements with the Gumbel-softmax distribution［J］. arXiv preprint arXiv:1611.04051, 2016.
［10］	CHE T, LI Y R, ZHANG R X, et al. Maximum-likelihood augmented discrete generative adversarial networks［J］. arXiv preprint arXiv:1702.07983, 2017.
［11］	LIN K, LI D Q, HE X D, et al. Adversarial ranking for language generation［C］// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017:3158-3168.
［12］	YU L T, ZHANG W N, WANG J, et al. SeqGAN: Sequence generative adversarial nets with policy gradient［C］// The 31st AAAI Conference on Artificial Intelligence. 2017:2852-2858.
［13］	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:770-778.
［14］	DENTON E, CHINTALA S, SZLAM A, et al. Deep generative image models using a Laplacian pyramid of adversarial networks［C］// Proceedings of the 28th International Conference on Neural Information Processing Systems. 2015:1486-1494.
［15］	SALAKHUTDINOV R. Learning deep generative models［J］. Annual Review of Statistics and Its Application, 2015,2:361-385.
［16］	许海明. 基于深度学习的文本生成技术研究［D］. 成都:电子科技大学, 2020.
［17］	CHO K, VAN MERRIENBOER B, BAHDANAU D, et al. On the properties of neural machine translation: Encoder-decoder approaches［J］. arXiv preprint arXiv:1409.1259, 2014.
［18］	HOCHREITER S, SCHMIDHUBER J. Long short-term memory［J］. Neural Computation, 1997,9(8):1735-1780.
［19］	胡懋晗. 基于生成对抗网络的文本生成的研究［D］. 成都:电子科技大学, 2020.
［20］	RAMACHANDRAN P, ZOPH B, LE Q V. Swish: A self-gated activation function［J］. arXiv preprint arXiv:1710.05941, 2017.
［21］	张志远,李媛媛. 加入目标指导的强化对抗文本生成方法研究［J］. 计算机应用研究, 2020,37(11):3343-3346.
［22］	KINGMA D P, BA J. Adam: A method for stochastic optimization［J］. arXiv preprint arXiv:1412.6980, 2014.
［23］	PAPINENI K, ROUKOS S, WARD T, et al. BLEU: A method for automatic evaluation of machine translation［C］// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. 2002:311-318.

[1]	付鸿林, 张太红, 杨雅婷, 艾孜麦提·艾瓦尼尔, 马博. 基于生成对抗网络的维语场景文字修改网络[J]. 计算机与现代化, 2024, 0(01): 41-46.
[2]	王鑫, 肖韬睿. 基于生成对抗网络的人脸识别对抗攻击[J]. 计算机与现代化, 2023, 0(10): 115-120.
[3]	江蕾, 唐建, 杨超越, 吕婷婷. 基于CWGAN-GP与CNN的轴承故障诊断方法[J]. 计算机与现代化, 2023, 0(07): 1-6.
[4]	李海涛, 胡泽涛, 张俊虎. 基于NS-StyleGAN2的鱼类图像扩充方法[J]. 计算机与现代化, 2023, 0(01): 13-17.
[5]	庄文华, 唐晓刚, 张斌权, 原光明. 基于生成对抗网络的高照度可见光图像生成[J]. 计算机与现代化, 2023, 0(01): 1-6.
[6]	翟慧聪, 张明, 邓星, 王利群. 基于生成对抗网络的图像动漫化[J]. 计算机与现代化, 2022, 0(07): 21-26.
[7]	秦鸣乐, 年梅, 张俊, . 基于深度生成对抗网络的恶意TLS流量识别[J]. 计算机与现代化, 2022, 0(04): 121-126.
[8]	陈云翔, 王巍, 宁娟, 陈怡丹, 赵永新, 周庆华. PSWGAN-GP:改进梯度惩罚的生成对抗网络[J]. 计算机与现代化, 2022, 0(04): 21-26.
[9]	李阳阳, 杨英光. 基于生成对抗网络的社交机器人检测[J]. 计算机与现代化, 2022, 0(03): 1-6.
[10]	陈圆圆, 刘惠义. 基于生成对抗网络的破损老照片修复[J]. 计算机与现代化, 2021, 0(04): 42-47.
[11]	韩灿灿, 李志华, 徐睿. 基于CycleGAN的非平行语音去噪方法[J]. 计算机与现代化, 2021, 0(02): 73-77.
[12]	马悦. 基于条件生成对抗网络的医学手术图像去烟算法[J]. 计算机与现代化, 2021, 0(01): 50-55.
[13]	熊方康, 陆玲, 曹廷荣, 彭丽君. 基于生成对抗网络的农作物叶片病害识别[J]. 计算机与现代化, 2020, 0(11): 39-46.
[14]	高佰宏1,刘朝晖1,刘华2. 基于SCSO-GRU模型的网络流量预测[J]. 计算机与现代化, 2020, 0(04): 72-.
[15]	陶颖,刘惠义. 基于EBGAN的图像风格化技术[J]. 计算机与现代化, 2020, 0(04): 24-.