Frontiers of Data and Computing ›› 2023, Vol. 5 ›› Issue (5): 74-97.

CSTR: 32002.14.jfdc.CN10-1649/TP.2023.05.007

doi: 10.11871/jfdc.issn.2096-742X.2023.05.007

• Special Issue: Key Technologies for Safe and Efficient Circulation of Data Elements • Previous Articles     Next Articles

A Survey on Local Differential Privacy

SUN Yifan1,2(),ZHANG Rui1,2,*(),TAO Yang1,2,GAO Birou1,2,QIN Shihan1,2,AN Chao1,2   

  1. 1. SKLOIS, Institute of Information Engineering, Beijing 100085, China
    2. School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2023-05-04 Online:2023-10-20 Published:2023-10-31

Abstract:

[Objective] This paper systematically introduces local differential privacy and provides a reference for the protection of personal data privacy under data sharing and publishing. [Coverage] This paper investigates and summarizes papers from mainstream conferences and journals in the field of local differential privacy. [Methods] This paper takes the type of statistical data analysis task as a clue and conducts research based on local differential privacy from four aspects, which concludes frequency estimation, mean estimation, multidimensional data statistical analysis, and machine learning. This paper makes a comparative analysis of relevant studies, summarizes key issues, discusses the shortcomings of existing work, and looks forward to future research directions. [Results] The local differential privacy model can provide strong privacy protection for users' personal data privacy when user data is collected and analyzed. [Limitations] This paper takes the type of statistical data analysis task as a clue and does not summarize the research related to graph data. [Conclusions] Local differential privacy, as an excellent privacy-preserving model, has developed rapidly after gaining the attention of scholars. But it still faces many problems and challenges, which are worthy of further research and exploration.

Key words: local differential privacy, frequency estimation, mean estimation, multidimensional data statistical analysis, machine learning