Abstract
We consider the problem of computing differentially private approximate histograms and heavy hitters in a stream of elements. In the non-private setting, this is often done using the sketch of Misra and Gries [Science of Computer Programming, 1982]. Chan, Li, Shi, and Xu [PETS 2012] describe a differentially private version of the Misra-Gries sketch, but the amount of noise it adds can be large and scales linearly with the size of the sketch: the more accurate the sketch is, the more noise this approach has to add. We present a better mechanism for releasing a Misra-Gries sketch under (ε,δ)-differential privacy. It adds noise with magnitude independent of the size of the sketch size, in fact, the maximum error coming from the noise is the same as the best known in the private non-streaming setting, up to a constant factor. Our mechanism is simple and likely to be practical. We also give a simple post-processing step of the Misra-Gries sketch that does not increase the worst-case error guarantee. It is sufficient to add noise to this new sketch with less than twice the magnitude of the non-streaming setting. This improves on the previous result for ε-differential privacy where the noise scales linearly to the size of the sketch.
Original language | English |
---|---|
Title of host publication | Proceedings of the 42nd ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2023 |
Place of Publication | New York |
Publisher | Association for Computing Machinery |
Publication date | 18 Jun 2023 |
Pages | 79-88 |
ISBN (Print) | 9798400701276 |
DOIs | |
Publication status | Published - 18 Jun 2023 |
Event | SIGMOD/PODS '23: International Conference on Management of Data - Seattle, United States Duration: 18 Jun 2023 → 23 Jun 2023 Conference number: 42 https://2023.sigmod.org/ |
Conference
Conference | SIGMOD/PODS '23: International Conference on Management of Data |
---|---|
Number | 42 |
Country/Territory | United States |
City | Seattle |
Period | 18/06/2023 → 23/06/2023 |
Internet address |
Keywords
- Differential Privacy
- Approximate Histograms
- Heavy Hitters
- Misra-Gries Sketch
- Streaming Algorithms