飞桨：源于产业实践的开源深度学习平台

doi:10.11871/jfdc.issn.2096.742X.2019.01.011

数据与计算发展前沿 ›› 2019, Vol. 1 ›› Issue (1): 105-115.

doi: 10.11871/jfdc.issn.2096.742X.2019.01.011

所属专题： “数据与计算平台”专刊

飞桨：源于产业实践的开源深度学习平台

马艳军,于佃海,吴甜,王海峰

百度公司,北京 100085

收稿日期:2019-08-15 出版日期:2019-01-20 发布日期:2019-10-09
作者简介:马艳军,1981年生,博士,百度深度学习技术平台部总监,负责深度学习平台飞桨（PaddlePaddle）的产品和研发工作,曾在ACL等自然语言处理顶级会议期刊发表论文数20余篇,并多次担任国际会议的Area Chair等。2015年,相关成果曾获国家科技进步二等奖。
马艳军参与文章整体框架设计并完成论文写作。于佃海承担了文章第二部分的设计与写作。吴甜参与了文章第一和第四部分的设计和写作。王海峰对文章整体框架进行了设计,并参与第三部分的写作。
Dr. Ma Yanjun is a director of deep learning platform at Baidu, overseeing the development of deep learning framework PaddlePaddle. He was born in 1981 in China. He has authored and co-authored over 20 research publications in Natural Language Processing, and served as area co-chairs for a number of top international conferences. In 2015, He received National Technology Advancement Award.
As to this paper, he contributed to the organization of the paper and wrote the manuscript. Yu Dianhai contributed to the design and writing of Section 2. Wu Tian contributed to Section 1 and 4.Wang Haifeng oversaw the overall design and contributed to Section 3.
E-mail: mayanjun02@baidu.com

PaddlePaddle: An Open-Source Deep Learning Platform from Industrial Practice

Yanjun Ma,Dianhai Yu,Tian Wu,Haifeng Wang

Baidu Inc.,Beijing 100085,China

Received:2019-08-15 Online:2019-01-20 Published:2019-10-09

摘要/Abstract

摘要：

【目的】深度学习是近年来人工智能取得突破的驱动性核心技术,深度学习框架也被称作智能时代的操作系统,本文对国内唯一功能完备的开源深度学习平台飞桨（PaddlePaddle）进行了系统性介绍。【方法】首先介绍深度学习框架的发展历程,并概述飞桨深度学习平台的技术全景和生态建设进展,然后详细介绍飞桨核心框架的关键技术,包括前端语言、组网编程范式、核心架构图、算子库以及高效率计算核心五个部分。【结果】飞桨经过多年来产业实践中持续迭代创新,已经在超大规模分布式训练、多端高速推理等方面形成了独特的优势。【结论】系统性总结飞桨的主要创新点并对未来发展趋势进行展望。

关键词: 飞桨, 人工智能, 深度学习, 深度学习框架

Abstract:

[Objective] Deep learning is widely recognized as core technology driving the breakthroughs in artificial intelligence. Deep learning frameworks can be considered as the operating system in the era of artificial intelligence. PaddlePaddle, as the only fully-functioning open-source deep learning platform in China, is introduced comprehensively. [Methods] In this paper, a brief history of the deep learning frameworks is introduced, followed by an overview of PaddlePaddle, which is comprised of the core framework, toolkits and service platforms. After that, we elaborate on the core technologies of PaddlePaddle, including the front-end programming language, the modeling paradigm etc. Finally, the main innovations in PaddlePaddle are summarized. [Results] PaddlePaddle has been intensively tested in Baidu production for years, with unique features in supporting distributed training with ultra-large data and fast inference on server, mobile as well as edges. [Conclusions] The main innovations, research and development trends are discussed systematically.

Key words: PaddlePaddle, artificial intelligence, deep learning, deep learning framework

马艳军,于佃海,吴甜,王海峰. 飞桨：源于产业实践的开源深度学习平台[J]. 数据与计算发展前沿, 2019, 1(1): 105-115.

Yanjun Ma,Dianhai Yu,Tian Wu,Haifeng Wang. PaddlePaddle: An Open-Source Deep Learning Platform from Industrial Practice[J]. Frontiers of Data and Computing, 2019, 1(1): 105-115.

图/表 5

图1

图2

图4

表1

参考文献 22

[1]	Bergstra J, Breuleux O, Bastien F, et al. Theano: a CPU and GPU math expression compiler [C]. Proceedings of the Python for scientific computing conference (SciPy). 2010,4(3).
[2]	Jia Y, Shelhamer E, Donahue J, et al. Caffe: Convolutional architecture for fast feature embedding [C]. Proceedings of the 22nd ACM international conference on Multimedia. ACM, 2014: 675-678.
[3]	Abadi M, Barham P, Chen J, et al. Tensorflow: A system for large-scale machine learning [C]. 12th {USENIX} Symposium on Operating Systems Design and Implementation({OSDI} 16). 2016: 265-283.
[4]	Paszke A, Gross S, Chintala S , et al. Pytorch: Tensors and dynamic neural networks in python with strong gpu acceleration[J]. PyTorch: Tensors and dynamic neural networks in Python with strong GPU acceleration, 2017,6.
[5]	飞桨:源于产业实践的开源深度学习平台[EB/OL]. https://www.paddlepaddle.org.cn/.
[6]	Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Xuyi Chen, Han Zhang, Xin Tian, Danxiang Zhu, Hao Tian, Hua Wu . Ernie: Enhanced representation through knowledge integration. arXiv:1904.09223, 2019.
[7]	Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova . Bert: Pre-training of deep bidirectional transformers for language understanding . arXiv preprint arXiv:1810.04805, 2018.
[8]	Mnih V, Kavukcuoglu K, Silver D , et al. Human-level control through deep reinforcement learning[J]. Nature, 2015,518(7540):529.
[9]	Lillicrap T P, Hunt J J, Pritzel A , et al. Continuous control with deep reinforcement learning[J]. arXiv preprint arXiv:1509.02971, 2015.
[10]	Schulman J, Wolski F, Dhariwal P , et al. Proximal policy optimization algorithms[J]. arXiv preprint arXiv:1707.06347, 2017.
[11]	Espeholt L, Soyer H, Munos R , et al. Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures[J]. arXiv preprint arXiv:1802.01561, 2018.
[12]	Babaeizadeh M, Frosio I, Tyree S , et al. GA3C: GPU-based A3C for deep reinforcement learning[J]. CoRR abs/1611.06256, 2016.
[13]	OpenAI Baselines: ACKTR & A2C[EB/OL] https://openai.com/blog/baselines-acktr-a2c/.
[14]	XBYAK - X86, X64 JIT ASSEMBLER[EB/OL] http://herumi.in.coocan.jp/soft/xbyak_e.html.
[15]	OpenBLAS: An optimized BLAS library[EB/OL] https://www.openblas.net/.
[16]	Intel(R) Math Kernel Librray (Intel(R) MKL) [EB/OL] https://software.intel.com/en-us/mkl.
[17]	Intel(R) Math Kernel Library for Deep Neural Networks (Intel(R) MKL-DNN) [EB/OL] https://github.com/intel/mkl-dnn.
[18]	nGraph Compiler stack (Beta) [EB/OL] https://github.com/NervanaSystems/ngraph.
[19]	NVIDIA cuBLAS, Dense Library Algebra on GPUs[EB/OL] https://developer.nvidia.com/cublas.
[20]	NVIDIA CUDA(R) Deep Neural Network library (cuDNN)[ EB/OL] https://developer.nvidia.com/cudnn.
[21]	Yu Sun, Shuohuan Wang, Yukun Li, Shikun Feng, Hao Tian, Hua Wu, Haifeng Wang . ERNIE 2.0: A Continual Pre-training Framework for Language Understanding. arXiv preprint arXiv:1907.12412v1, 2019.
[22]	Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V . Le. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arXiv preprint arXiv:1906.08237, 2019.

领域	模型名称	对标开源框架	Paddle Fluid 1.5.0	对标开源框架
智能视觉（CV）	SE-ResNeXt50	PyTorch 1.1.0	168.33 images/s	163.13 images/s
	Mask-RCNN	PyTorch 1.1.0	3.81 images/s	3.24 images/s
	YOLOv3	MXNet 1.4.1	29.90 images/s	18.58 images/s
	DeepLab V3+	TensorFlow 1.12.0	13.70 images/s	6.40 images/s
	CycleGAN	TensorFlow 1.12.0	7.51 images/s	6.45 images/s
智能文本处理（NLP）	BERT	TensorFlow 1.12.0	129.41 sentences/s	109.44 sentences/s
智能文本处理（NLP）	Transformer	TensorFlow 1.12.0	19927 tokens/s	19456 tokens/s

飞桨：源于产业实践的开源深度学习平台

PaddlePaddle: An Open-Source Deep Learning Platform from Industrial Practice

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 5

参考文献 22

相关文章 15

编辑推荐

Metrics

本文评价

[1]	许淞源,刘峰. ESDRec：一种面向地球大数据平台的数据推荐模型[J]. 数据与计算发展前沿, 2023, 5(1): 55-64.
[2]	刘嘉琪,杨斌艳. 我国人工智能与社会科学耦合发展的热点与趋势研究——基于CiteSpace的文献计量分析[J]. 数据与计算发展前沿, 2022, 4(6): 77-91.
[3]	陈琼,杨咏,黄天林,冯媛. 小样本图像语义分割综述[J]. 数据与计算发展前沿, 2021, 3(6): 17-34.
[4]	蒲晓蓉,黄佳欣,刘军池,孙家瑜,罗纪翔,赵越,陈柯成,任亚洲. 面向临床需求的CT图像降噪综述[J]. 数据与计算发展前沿, 2021, 3(6): 35-49.
[5]	何涛,王桂芳,马廷灿. 基于词嵌入语义异常的跨学科研究内容发现方法[J]. 数据与计算发展前沿, 2021, 3(6): 50-59.
[6]	张怡宁,何洪波,王闰强. 热门数字音频预测技术综述[J]. 数据与计算发展前沿, 2021, 3(4): 81-92.
[7]	陈子健,李俊,岳兆娟,赵泽方. 基于自编码器与属性信息的混合推荐模型[J]. 数据与计算发展前沿, 2021, 3(3): 148-155.
[8]	张润滋,刘文懋. AISecOps智能安全运营技术体系框架[J]. 数据与计算发展前沿, 2021, 3(3): 32-47.
[9]	肖建平,龙春,赵静,魏金侠,胡安磊,杜冠瑶. 基于深度学习的网络入侵检测研究综述[J]. 数据与计算发展前沿, 2021, 3(3): 59-74.
[10]	李序,连一峰,张海霞,黄克振. 网络安全知识图谱关键技术[J]. 数据与计算发展前沿, 2021, 3(3): 9-18.
[11]	赵伟昱,张宏海,仲波. 基于深度学习的遥感影像地块分割方法[J]. 数据与计算发展前沿, 2021, 3(2): 133-141.
[12]	袁勇,欧阳丽炜,王晓,王飞跃. 基于区块链的智能组件：一种分布式人工智能研究新范式[J]. 数据与计算发展前沿, 2021, 3(1): 1-14.
[13]	沈飙,陈扬,杨琛,刘博文. 海洋科学中尺度涡的计算机视觉检测和分析方法[J]. 数据与计算发展前沿, 2020, 2(6): 30-41.
[14]	任荟颖,王婧,王彦棡. 基于AutoML的湍流建模[J]. 数据与计算发展前沿, 2020, 2(4): 121-131.
[15]	莫梓嘉,高志鹏,苗东. 边缘智能：人工智能向边缘分布式拓展的新触角[J]. 数据与计算发展前沿, 2020, 2(4): 16-27.