数据与计算发展前沿 ›› 2022, Vol. 4 ›› Issue (1): 53-68.

doi: 10.11871/jfdc.issn.2096-742X.2022.01.005

• 专刊:“国家科学数据中心联合”专刊 • 上一篇    下一篇

新一代“生态网络云”大数据平台的设计与实现

唐新斋1,5(),陈昕2(),何洪林1,3,5,*(),郭学兵1,5(),苏文1,5(),谢传节4(),沈志宏2(),张黎1,3,5(),任小丽1,3,5(),侯艳飞1,5(),刘峰2()   

  1. 1.中国科学院地理科学与资源研究所,生态系统网络观测与模拟重点实验室,北京 100101
    2.中国科学院计算机网络信息中心,北京 100083
    3.中国科学院大学资源与环境学院,北京 100049
    4.中国科学院地理科学与资源研究所,资源与环境信息系统国家重点实验室,北京 100101
    5.国家生态科学数据中心,北京 100101
  • 收稿日期:2021-10-07 出版日期:2022-02-20 发布日期:2022-03-04
  • 通讯作者: 何洪林
  • 作者简介:唐新斋,中国科学院地理科学与资源研究所,生态系统网络观测与模拟重点实验室,工程师,主要研究方向为生态信息学,长期从事生态科学数据管理、质控和共享服务,开展生态网络信息化建设与实践。
    本文中负责总体统稿、数据中心业务需求分析、总体架构设计。
    TANG Xinzhai is an engineer at the Key Laboratory of Eco-system Network Observation and Modeling, Institute of Geo-graphic Sciences and Natural Resources Research, Chinese Academy of Sciences. His current research interests include ecoin-formatics.
    In this paper, he is responsible for the overall draft, business requirements analysis and platform architecture design. E-mail: tangxz@igsnrr.ac.cn|陈昕,中国科学院计算机网络信息中心,大数据技术与应用发展部,高级工程师,博士,硕士生导师,目前主要从事科学数据中心体系架构设计、科学大数据管理与共享以及领域应用等工作。
    本文中负责新平台总体架构设计与技术路线。
    CHEN Xin, Ph.D, is a senior engineer of Computer Network Information Center of CAS. Her current research interests include scientific data integration and sharing, the framework of data infrastructure, and data visual analytics.
    In this paper, she is responsible for platform architecture design and technical roadmap. E-mail: chx@cnic.cn|何洪林,中国科学院地理科学与资源研究所,研究员,中国科学院大学岗位教授,博士生导师,中国科学院现有关键技术人才,国家重点研发项目首席科学家,现任科技部国家生态科学数据中心主任。发表相关学术论文100余篇,其中SCI 50余篇,编写专著3部。获得国家科技进步二等奖 2 项,环保部科技进步一等奖 1 项。主要研究方向为生态信息学、生态系统模型数据融合、生态遥感研究。
    本文中负责整体把关、数据中心业务指导。
    HE Honglin is a professor at the Institute of Geographic Sci-ences and Natural Resources Research, Chinese Academy of Sciences and the University of Chinese Academy of Sciences. He is currently the Director of National Ecosystem Science Data Center, Ministry of Science and Technology. He has completed 3 monographs and published more than 100 papers of which more than 50 are indexed by SCI. His research has won two second-class National Science and Technology Pro-gress Award and one first-class Science and Technology Pro-gress Award of the Ministry of Environmental Protection. His main research interests include ecoinformatics, ecosystem mo-deldata fusion, and ecological remote sensing.
    In this paper, he is responsible for overall control and business guidance for the National Ecosystem Science Data Center. E-mail: hehl@igsnrr.ac.cn|郭学兵,中国科学院地理科学与资源研究所,生态系统网络观测与模拟重点实验室,高级工程师,主要研究方向为生态信息学。
    本文中负责数据中心业务需求分析,平台数据内容建设。
    GUO Xuebing is a senior engineer at the Key Laboratory of Ecosystem Network Observation and Modeling, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences. Her main research interest covers Eco-Informatics.
    In this paper, she is responsible for business requirements ana-lysis and construction of ecosystem data on Eco-Cloud. E-mail: guoxb@igsnrr.ac.cn|苏文,中国科学院地理科学与资源研究所,生态系统网络观测与模拟重点实验室,高级工程师,主要研究方向为生态信息学,长期从事生态网络科学数据的整合、加工、管理和共享服务,依托生态信息科学和技术,服务于中国生态系统研究网络(CERN)、国家生态系统观测研究网络(CNERN)的信息化研究与实践。
    本文中负责数据中心业务需求分析,平台数据内容建设。
    SU Wen is a senior engineer at the Key Laboratory of Eco-system Network Observation and Modeling, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences. Her research interests include ecoinfor-matics. She has been engaged in the integration, processing, mana-gement and sharing services of ecological network scientific data, relying on ecological information science and technology, serving the informatization research and practice of CERN and CNERN.
    In this paper, she is responsible for business requirements analysis and ecosystem data publishing on Eco-Cloud. E-mail: suw@igsnrr.ac.cn|谢传节,中国科学院地理科学与资源研究所,资源与环境信息系统国家重点实验室,副研究员,硕士生导师,主要从事分布式地理信息系统研究,在空间数据并行计算、地理大数据分析、生态物联网等方面取得了技术发明专利和软件著作权。
    本文中负责生态站综合信息管理系统与全国生态站定位监测业务动态可视化系统设计。
    XIE Chuanjie ia an associate researcher and master tutor of the State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences. His research interests include distributed geographic infor-mation systems. He obtained technical invention patents and software copyrights in spatial data parallel computing, geo-graphic big data analysis, ecological Internet of Things, etc.
    In this paper, he is responsible for design of Synthesis information management system and Ecosystem network visua-lization system for ecological stations. E-mail: xiecj@lreis.ac.cn|沈志宏,中国科学院计算机网络信息中心,大数据技术与应用发展部主任,正研级高工,博士,博士生导师,大数据分析与计算技术国家地方联合工程实验室总工程师,主要研究领域为大数据、图数据管理、语义网等。
    本文中负责新平台总体架构设计与技术路线。
    SHEN Zhihong is a professor and supervisor of Big Data Tech-nology and Application Development Department, CNIC, CAS. His research interests include big data, graph data management and semantic web, etc.
    In this paper, he is responsible for platform architecture design and technical roadmap. E-mail: bluejoe@cnic.cn|张黎,中国科学院地理科学与资源研究所,副研究员,中国科学院大学岗位教师,硕士生导师,主要研究方向为陆地生态系统碳氮水循环过程模拟、模型数据融合和生态系统评估。在国内外学术期刊上发表论文80余篇,参与编写专著6部,获得软件著作权8项。
    本文中负责数据中心业务指导、生态模型数据融合研究。
    ZHANG Li is an associate professor at the Key Laboratory of Ecosystem Network Observation and Modeling, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences. Her research interests include process-based modelling of terrestrial carbon, nitrogen, and water cycles, model-data fusion, and ecosystem assessment.
    In this paper, she is responsible for business guidance and eco-system model data fusion research. E-mail: li.zhang@igsnrr.ac.cn|任小丽,中国科学院地理科学与资源研究所,副研究员,硕士生导师,中国科学院地理资源研究所“秉维优秀青年人才”获得者,在国内外重要学术刊物上发表论文50余篇,参编专著2部、译著1部。主要研究方向为碳循环模型数据融合和生态信息学。
    本文中负责数据同化与生态预测系统架构与设计。
    REN Xiaoli, is an associate professor and master’s supervisor at the Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, winner of “Bingwei excellent young talents”. She has published more than 50 papers in academic journals at home and abroad, and parti-cipated in the compilation of 2 monographs and 1 translation. Her main research interests include carbon cycle model data fusion and ecological informatics.
    In this paper, she is responsible for design of ecosystem data assimilation and prediction system. E-mail: renxl@igsnrr.ac.cn|侯艳飞,中国科学院地球科学与资源研究所,生态系统网络观测与模拟重点实验室,高级工程师,主要研究方向为科学数据组织管理与开放共享。
    本文中负责数据中心业务需求分析,平台数据内容建设。
    HOU Yanfei is a senior engineer at Key Laboratory of Eco-system Network Observation and Modeling, Institute of Geo-graphic Sciences and Natural Resources Research, Chinese Academy of Sciences. Her research interests include scientific data management and sharing.
    In this paper, she is responsible for business requirements an-alysis and ecosystem data publishing on Eco-Cloud. E-mail: houyf@cern.ac.cn|刘峰,中国科学院计算机网络信息中心,大数据技术与应用发展部,项目研究员,博士,硕士生导师。主要从事科学数据组织管理、数据发布共享、数据资源服务体系等方面的研究及集成软件平台的研发。
    本文中负责新平台整体集成与实现。
    LIU Feng is a project professor of Big Data Technology and Application Development Department, CNIC, CAS. His main research interests include software development of scientific data organization, management, publishing and sharing.
    In this paper, he is responsible for platform integration and implementation. E-mail: liufeng@cnic.cn
  • 基金资助:
    国家重点研发计划(2019YFE0126500)

Design and Implementation of a New Eco-Cloud Platform for National Ecosystem Science Data Center

TANG Xinzhai1,5(),CHEN Xin2(),HE Honglin1,3,5,*(),GUO Xuebing1,5(),SU Wen1,5(),XIE Chuanjie4(),SHEN Zhihong2(),ZHANG Li1,3,5(),REN Xiaoli1,3,5(),HOU Yanfei1,5(),LIU Feng2()   

  1. 1. Key Laboratory of Ecosystem Network Observation and Modeling, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
    2. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China
    3. College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China
    4. State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
    5. National Ecosystem Science Data Center, Beijing 100101, China
  • Received:2021-10-07 Online:2022-02-20 Published:2022-03-04
  • Contact: HE Honglin

摘要:

【目的】构建高效的信息化云平台,是实现多源异构生态大数据仓储与开放共享的重要支撑手段。【方法】结合生态大数据的演变、国家科学数据政策的影响,按照科学数据全生命周期管理过程,分析了国家生态科学数据中心现有信息化平台面临的问题和挑战,尝试采用领域驱动设计方法,开展生态科学数据汇聚微服务拆分。【结果】基于“开放汇聚、协同管理、智慧服务”理念,提出了新一代“生态网络云”大数据平台(Eco-Cloud)的总体架构设计,结合当前需求从多源数据汇交、统一存储管理、数据加工与挖掘分析、服务与展现四个层次给出了主要系统组成及其应用场景。【结论】新平台有助于推动生态科学数据多源开放汇聚、资产化管理,提升生态科学数据分析能力与共享服务水平。

关键词: 生态网络云, 科学数据中心, 全生命周期管理, 数据汇聚, 资源服务

Abstract:

[Objective] Information cloud platform plays an important role supporting multi-source heterogeneous ecosystem scientific data storage and open sharing. [Methods] We analyze the problems and challenges faced by the current information platform of the National Ecosystem Science Data Center in view of the evolution of ecosystem scientific data and the impact of national scientific data policies. A domain-driven design method is adopted to carry out microservice recognition. [Results] Based on the concept of “Open convergence, Collaborative management, and intelligent services”, we propose the design of a new ecosystem network cloud platform (Eco-Cloud), and present main system composition and application scenarios from four levels: multi-source data convergence, unified storage management, data processing and mining, sharing and presentation. [Conclusions] The new platform will help promote the level of ecological scientific data convergence, management, analysis, and sharing.

Key words: eco-cloud platform, science data center, life cycle management, data convergence, resources sharing