[1] |
Abadi M, Barham P, Chen J, et al. Tensorflow: A system for large-scale machine learning[C]// 12th USENIX sym-posium on operating systems design and implement-ation ({OSDI} 16), 2016:265-283.
|
[2] |
Paszke A, Gross S, Massa F, et al. PyTorch: an imperative style, high-performance deep learning library[C]// Proc-eedings of the 33rd International Conference on Neural Information Processing Systems, 2019: 8026-8037.
|
[3] |
Chen T, Li M, Li Y, et al. Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems[J]. arXiv preprint arXiv:1512.0274, 2015.
|
[4] |
Cheng J. CUDA by Example: An Introduction to General-Purpose GPU Programming[J]. Scalable Computing: Pra-ctice and Experience, 2010, 11(4): 401-401.
|
[5] |
AMD ROCm™ Documents[EB/OL].[2022-02-17]. https://rocmdocs.amd.com/en/latest/index.html.
|
[6] |
丁立德. 支持国产计算平台的深度学习加速技术研究[D]. 中国电子科技集团公司电子科学研究院, 2020.DOI: 10.27728/d.cnki.gdzkx.2020.000010.
doi: 10.27728/d.cnki.gdzkx.2020.000010
|
[7] |
Munshi A. The opencl specification[C]// 2009 IEEE Hot Chips 21 Symposium (HCS), IEEE, 2009: 1-314.
|
[8] |
Jung K H. A study on machine learning for steganalysis[C]// Proceedings of the 3rd International Conference on Machine Learning and Soft Computing, 2019: 12-15.
|
[9] |
Guennebaud G, Jacob B. Eigen[EB/OL]. [2022-02-15]. http://eigen.tuxfamily.org.
|
[10] |
Chetlur S, Woolley C, Vandermersch P, et al. cudnn: Efficient primitives for deep learning[J]. arXiv preprint arXiv:1410.0759, 2014.
|
[11] |
Goli M, Iwanski L, Richards A. Accelerated machine learning using TensorFlow and SYCL on OpenCL Devic-es[C]// Proceedings of the 5th International Work-shop on OpenCL, 2017: 1-4.
|
[12] |
Fang J, Huang C, Tang T, et al. Parallel programming models for heterogeneous many-cores: a comprehensive survey[J]. CCF Transactions on High Performance Com-puting, 2020, 2(4): 382-400.
|
[13] |
Python Software Foundation. unittest introduction[EB/OL]. [2022-02-17]. https://docs.python.org/zh-cn/3.7/library/unittest.html.
|
[14] |
Tensorflow. Public API for tf.raw_ops namespace. [EB/OL]. [2022-02-17]. https://www.tensorflow.org/api_docs/python/tf/raw_ops.
|