Streaming Data Collection With a Private Sketch-Based Protocol

Ying Li, Xiaodong Lee,Botao Peng,Themis Palpanas,Jingan Xue

IEEE Internet of Things Journal(2024)

Cited 0|Views3
No score
Abstract
Data stream collection is critical to analyze service conditions and detect anomalies in time, especially in Internet of Things. However, it may undermine the individual privacy. Local differential privacy (LDP) has recently become a popular privacy-preserving technique protecting users’ privacy. However, most of them are still limited to the assumption of one-item collection, resulting in poor utility when extended to the multi-item collection from a very large domain. This paper proposes a private streaming data collection framework, PSF, which takes advantage of sketches. Combining the proposed background information and a decode-first collection-side workflow, the framework improves the utility by reducing the errors introduced by the sketching algorithm and the privacy budget utilization when collecting multiple items. We analytically prove the superior accuracy and privacy characteristics of PSF. In order to support specific computing tasks, we build two private protocols based on PSF, PrivSketch and PrivSketch+, aiming at frequency estimation and mean estimation, respectively. We demonstrate the utility of PrivSketch and PrivSketch+ theoretically, and also evaluate them experimentally. Our evaluation, with several diverse synthetic and real datasets, demonstrates that PrivSketch is 1-3 orders of magnitude better than the competitors in terms of utility in both frequency estimation and frequent item estimation, while being up to 100x faster. PrivSketch+ performs 4 orders of magnitude better than advanced solutions, such as Piecewise Mechanism (PM) and Hybrid Mechanism (HM), under a limited privacy budget.
More
Translated text
Key words
LDP,Sketch,Frequency estimation,Mean estimation
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined