Frontiers of Data and Computing ›› 2022, Vol. 4 ›› Issue (1): 5-19.

doi: 10.11871/jfdc.issn.2096-742X.2022.01.001

• Special Issue: Union of National Scientific Data Center • Previous Articles     Next Articles

Data Engineering Discipline Construction and Practice

ZHANG Yaonan1,2,3,*()   

  1. 1. National Cryosphere Desert Scientific Data Center, Lanzhou, Gansu 730000, China
    2. Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou, Gansu 730000, China
    3. Gansu Data Engineering and Technology Research Center for Resource and Environment, Lanzhou, Gansu 730000, China
  • Received:2021-09-20 Online:2022-02-20 Published:2022-03-04
  • Contact: ZHANG Yaonan E-mail:yaonan@lzb.ac.cn

Abstract:

[Objective] While data science can handle a large amount of data and solve a lot of problems, it is changing the models of scientific research, enterprise operation, and social governance. Owing to the difficulty in data science engineering, it is necessary to establish a data engineering discipline to convert the data assets and their intrinsic value to effective services, decision making, and data products to enabledigital economy. [Methods] This paper introduces the idea of engineering, extends the concept of narrow data engineering to broad data engineering, discusses the necessity of establishing the discipline of data engineering, and analyzes the characteristics of the data engineering knowledge based on data material basis by referring to the characteristics of the civil engineering discipline and its construction. This paper presents the concept, theoretical basis, research content, research framework, and main technical system of the data engineering discipline, and illustrates the necessity of establishing a new methodology of data engineering through two data engineering application cases. [Conclusions] The data engineering discipline is of a unique knowledge system based on data matters and special research methods that integrate mathematics, electronics, information science, computer science, data science, and some other disciplines. The material, theoretical, technical, and demand basis for data engineering construction have been established. It is urgent to establish a data engineering support to transform data assets into engineering applications to enable the digital economy.

Key words: narrow data engineering, generalized data engineering, Data Engineering Discipline, data science, the digital economy