Frontiers of Data and Computing ›› 2024, Vol. 6 ›› Issue (4): 22-33.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.04.002

doi: 10.11871/jfdc.issn.2096-742X.2024.04.002

• Special Issue: Fundamental Software Stack and Systems for National Scientific Data Centers • Previous Articles     Next Articles

A Technical Framework of Cross-Center Trusted Sharing of Scientific Data for the New Paradigm of Convergence Science

YANG Jingru1,2(),CAI Huaqian1,2,*(),YANG Yong1,2,LI Ying3,LIU Jia4   

  1. 1. National Key Laboratory of Dataspace Technology and System, Beijing 100091, China
    2. School of Computer Science, Peking University, Beijing 100871, China
    3. School of Software and Microelectronics, Peking University, Beijing 100871, China
    4. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China
  • Received:2024-02-05 Online:2024-08-20 Published:2024-08-20

Abstract:

[Objective] The advent of big data has given rise to a new research paradigm, termed the "Convergence Science" paradigm, which addresses significant technological challenges through the fusion of multidisciplinary data. Collaborative analysis and application of scientific data across disciplines have become crucial for maximizing its value, making cross-center trustworthy data sharing a key issue in constructing scientific data centers. [Methods] Considering the characteristics and challenges of scientific data centers, including heterogeneous data from multiple sources, large volumes, dispersed resources, strong expertise, and clear intellectual property rights, this paper proposes a technical framework for cross-center trustworthy sharing of scientific data. This framework includes key technologies such as scientific data modeling and interoperability methods, dual-identifier fusion resolution, trustworthy storage and certification, and data ownership and circulation traceability. [Results] This framework's effectiveness has been confirmed in the context of data sharing spanning five scientific data centers. [Conclusions] This framework provides a feasible technical path for cross-center trusted sharing of scientific data in the new paradigm of convergence science.

Key words: scientific data, convergence science, trusted sharing, interoperability, identifier resolution