深度卷积神经网络的多GPU并行框架

doi:10.3969/j.issn.1006-2475.2016.11.017

计算机与现代化 ›› 2016, Vol. 0 ›› Issue (11): 95-98.doi: 10.3969/j.issn.1006-2475.2016.11.017

深度卷积神经网络的多GPU并行框架

南京晓庄学院信息工程学院,江苏南京211171

收稿日期:2016-03-11 出版日期:2016-11-15 发布日期:2016-11-23
作者简介:杨宁(1972-),男,江苏泰州人,南京晓庄学院信息工程学院讲师,硕士,研究方向:数据库,数据挖掘,机器学习。
基金资助:
国家自然科学基金青年基金资助项目(61202136)

Multi-GPU Parallel Framework of Deep Convolutional Neural Networks

College of Information Engineering, Nanjing Xiaozhuang University, Nanjing 211171, China

Received:2016-03-11 Online:2016-11-15 Published:2016-11-23

摘要/Abstract

摘要： 近年来，深度卷积神经网络在图像识别和语音识别等领域被广泛运用，取得了很好的效果。深度卷积神经网络是层数较多的卷积神经网络，有数千万参数需要学习，计算开销大，导致训练非常耗时。针对这种情况，本文提出深度卷积神经网络的多GPU并行框架,设计并实现模型并行引擎，依托多GPU的强大协同并行计算能力，结合深度卷积神经网络在训练中的并行特点，实现快速高效的深度卷积神经网络训练。


关键词: 深度卷积神经网络, GPU, 并行框架, 图像识别, 大数据

Abstract: In recent years, deep convolutional neural network is widely used in the fields of image recognition and speech recognition, and achieves good results. Deep convolutional neural networks are the convolutional neural networks with multiple layers, tens of millions of parameters need to be learned, and computational overhead is large, so the training is very time-consuming. In view of this situation, we propose a multi-GPU parallel framework of deep convolutional neural networks, design and implement model parallel engine, relying on the powerful collaborative parallel computing ability of multi-GPU, combined with the parallel characteristics of deep convolutional neural networks in training, to achieve fast and efficient deep convolution neural networks training.


Key words: deep convolutional neural networks, graphic processing unit, parallel framework, image recognition, big data

中图分类号:

TP183

杨宁. 深度卷积神经网络的多GPU并行框架[J]. 计算机与现代化, 2016, 0(11): 95-98.

YANG Ning. Multi-GPU Parallel Framework of Deep Convolutional Neural Networks[J]. Computer and Modernization, 2016, 0(11): 95-98.

参考文献

［1］ LeCun Y， Boser B， Denker J S， et al. Back propagation applied to handwritten Zip code recognition［J］. Neural Computation， 1989,1(4):541-551.
［2］ Hinton G E， Osindero S， Teh Y W. A fast learning algorithm for deep belief nets［J］. Neural Computation, 2006,18(7):1527-1554.
［3］ Krizhevsky A， Sutskever I， Hinton G E. ImageNet classification with deep convolutional neural networks［C］// Neural Information Processing Systems Conference and Workshop(NIPS). 2012:1106-1114.
［4］ Chen Xueyun, Xiang Shiming, Liu Cheng-lin, et al. Vehicle detection in satellite images by hybrid deep convolutional neural networks［J］. IEEE Geoscience and Remote Sensing Letters, 2014,11(10):1797-1801.
［5］ Abdel-Hamid O， Mohamed A, Hui Jiang, et al. Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition［C］// 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2012:4277-4280.
［6］ Zhang Cha, Zhang Zhengyou. Improving multi-view face detection with multi-task deep convolutional neural networks［C］// 2014 IEEE Winter Conference on Applications of Computer Vision (WACV). 2014:1036-1041.
［7］ Dahl G E, Dong Yu, Li Deng, et al. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition［J］. IEEE Transactions on Audio, Speech, and Language Processing, 2012,20(1):30-42.
［8］ LeCun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition［J］. Proceedings of the IEEE, 1998,86(11):2278-2324.
［9］许可. 卷积神经网络在图像识别上的应用的研究［D］. 杭州:浙江大学， 2012.
［10］凡保磊. 卷积神经网络的并行化研究［D］. 河南:郑州大学， 2013.
［11］段宝彬,韩立新. 改进的深度卷积网络及在碎纸片拼接中的应用［J］. 计算机工程与应用, 2014,50(9):176-183.
［12］邢健飞,罗志增,席旭刚. 基于深度神经网络的实时人脸识别［J］. 杭州电子科技大学学报, 2013,33(6):107-110.
［13］陈先昌. 基于卷积神经网络的深度学习算法与应用研究［D］. 杭州:浙江工商大学， 2014.
［14］Lee K F, Reddy R. Automatic Speech Recognition: The Development of the Sphinx Recognition System［M］. Norwell, MA, USA:Kluwer Academic Publishers, 1988.
［15］Frank Seide, Gang Li, Dong Yu. Conversational speech transcription using context-dependent deep neural networks［C］// Proceedings of the 29th International Conference on Machine Learning. 2012:437-440.
［16］王伟臻. 基于神经网络的语音识别研究［D］. 杭州:浙江大学, 2008.
［17］包亚萍,郑骏,武晓光. 基于HMM和遗传神经网络的语音识别系统［J］. 计算机工程与科学, 2011(4):139-144.

[1]	杨波, 王宏杰, 徐胜超, 毛明扬, 蒋金陵, 蒋大锐. 一种面向生物基因多序列比对算法的任务调度方法[J]. 计算机与现代化, 2024, 0(07): 7-12.
[2]	邱玲1, 2, 宋智1, 2, 吕爽1, 2, 杨雪1, 2. 数据同步技术在气象大数据云平台对外服务中的应用[J]. 计算机与现代化, 2024, 0(07): 76-81.
[3]	张高义1, 徐杨1, 2, 曹斌1, 2, 石进1. 全局跨层交互网络学习细粒度图像特征表示[J]. 计算机与现代化, 2024, 0(03): 97-104.
[4]	黎世达, 项剑文. 一种提高图像识别模型鲁棒性的弱化强化方法[J]. 计算机与现代化, 2023, 0(10): 70-76.
[5]	李实秋. 一种基于协作表示的判别局部保持投影方法[J]. 计算机与现代化, 2023, 0(06): 43-47.
[6]	周明升, 张雯. 一种面向多源数据的智慧园区管理平台[J]. 计算机与现代化, 2023, 0(05): 68-74.
[7]	朱理清, 李祥, . 改进YOLOv5算法的遥感图像车辆检测[J]. 计算机与现代化, 2023, 0(05): 117-121.
[8]	邱金水, 庄会富, 金涛. 面向海量植物图像的智能检索系统设计[J]. 计算机与现代化, 2022, 0(10): 62-67.
[9]	单珂, 张一鸣, 刘瑞霞, . 面向中原城市群的科技服务资源池研究与设计[J]. 计算机与现代化, 2022, 0(07): 91-96.
[10]	黄安琪, 苗放, 杨文晖, 倪雅婷, 蒋媛. 基于数据架构的结构化数据注册引擎设计[J]. 计算机与现代化, 2022, 0(05): 82-89.
[11]	曹禹, 李晓辉, 刘忠麟, 贾贺, 费志伟. 云环境大数据工作流编排管理系统研究综述[J]. 计算机与现代化, 2022, 0(01): 41-53.
[12]	张小芳, 冯慧芳. 基于轨迹大数据的动态最优路径规划[J]. 计算机与现代化, 2021, 0(11): 82-88.
[13]	李明, 陈积富, 易小荣, 刘书铭. 基于JFinal框架的洞庭湖环境监测系统[J]. 计算机与现代化, 2021, 0(10): 41-48.
[14]	魏云东. 基于大数据技术的人才智能推荐方法[J]. 计算机与现代化, 2021, 0(07): 60-64.
[15]	雷鸣, 姜罕盛, 武国良, 赵玉娟, 梁健. 基于HBase的大数据架构下负载平衡技术[J]. 计算机与现代化, 2021, 0(06): 91-95.

深度卷积神经网络的多GPU并行框架

Multi-GPU Parallel Framework of Deep Convolutional Neural Networks

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价