Frontiers of Data and Computing ›› 2019, Vol. 1 ›› Issue (1): 73-81.doi: 10.11871/jfdc.issn.2096.742X.2019.01.008

Special Issue: “数据与计算平台”专刊

Previous Articles     Next Articles

Survey on Federated RDF Systems

Peng Peng1,Lei Zou2   

  1. 1.College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
    2.Wangxuan Institute of Computer Technology, Peking University, Beijing 100080, China
  • Received:2019-08-30 Online:2019-01-20 Published:2019-10-09

Abstract:

[Objective] Resource Description Framework (RDF), a standard model for knowledge representation, has been widely used in various scientific data management applications to represent the scientific data as a knowledge graph. Meanwhile, Simple Protocol And RDF Query Language (SPARQL) is a structured query language to access RDF repository. As more and more data publishers release their datasets in the model of RDF, how to integrate the RDF datasets provided by different data publishers into a federated RDF system becomes a challenge. [Coverage] In this paper we provide an overview of the studies of federated RDF systems. [Methods]The major differences among different federated RDF systems are different strategies for source selection guided query decomposition and query processing optimization. [Results] Existing query decomposition and source selection strategies in federated RDF systems can be divided into two categories: metadata-based and ASK-based strategies; Query optimization strategies in existing federated RDF systems are some joint optimizations based on System-R style dynamic programming. [Limitations] Existing federated RDF systems still do not discuss how to support SPARQL 1.1. [Conclusions] Federated RDF systems can integrate distributed RDF graphs among different sources, which means that it is an important future research direction.

Key words: Federated RDF Systems, SPARQL query evaluation, query optimization