Frontiers of Data and Computing ›› 2024, Vol. 6 ›› Issue (4): 77-86.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.04.006

doi: 10.11871/jfdc.issn.2096-742X.2024.04.006

• Special Issue: Fundamental Software Stack and Systems for National Scientific Data Centers • Previous Articles     Next Articles

Large Scale Dynamic Graph Version Management: Requirements, Technologies, and Challenges

ZENG Chenglin1,2(),WANG Huajin1,2,ZHU Xiaojie1,2,SHEN Zhihong1,2,*()   

  1. 1. Computer Network Information Center, The Chinese Academy of Sciences, Beijing 100083, China
    2. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2024-02-05 Online:2024-08-20 Published:2024-08-20

Abstract:

[Objective] In the era of big data, from daily life and production to scientific research, a massive amount of dynamic graph data has been generated. Managing and analyzing this data can effectively assist people in process design, intelligent decision-making, and scientific research. [Coverage] This article used keywords such as dynamic graph, evolution graph, and version graph management to search on CNKI and Google Scholar, and collected dozens of relevant literature. This article classifies and summarizes relevant research literature based on three categories: data models, management systems, and mining analysis methods, and analyzes the current research status at home and abroad. [Results] A theoretical study was conducted on the spatial consumption of three mainstream storage strategies for dynamic graph data, and preliminary conclusions were obtained. Secondly, a deeper summary of existing dynamic graph query requirements was conducted from the perspective of sets. Finally, based on the number of classifications in the paper, it was found that current research on dynamic graphs focuses more on mining and analysis. [Limitations] The relevant literature collected in this article mainly includes attribute graphs, and RDF related literature has not been covered. [Conclusions] After analyzing the relevant requirements and technologies for large-scale dynamic graph version management, this article also proposes some challenges, including the high spatial inflation brought by multi version management of dynamic graphs, efficient random retrieval of specified versions, and precise characterization of evolution relationships between versions.

Key words: dynamic graph, version management, graph data