16.01.2014 Views

Privacy with Prior Information

Privacy with Prior Information

Privacy with Prior Information

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

• assume every secret pair is of the form (s, ¬s) and Pr(s|θ)/ Pr(¬s|θ) ∈ [1/c, c] for some constant c.<br />

This is reasonable in that, secret s is only worthy to protect if the adversary doesn’t know much about<br />

it in advance.<br />

• ɛ and δ are parameters<br />

The algorithm works as follows:<br />

1. set α to some initial noise level α 0<br />

2. sample O((c + 1)k) = O(k) databases independently from θ<br />

3. compute the output for each sample and add noise Lap(α)<br />

4. For every pair of secrets (s, ¬s)<br />

5. find O s and O ¬s : outputs for samples <strong>with</strong> s and ¬s true<br />

6. for every possible output w<br />

7. denote p ws = Pr(w|s, θ) and p w¬s = Pr(w|¬s, θ)<br />

8. estimate ¯p ws and ¯p ws ′ using fraction of w in O s and O ¬s .<br />

9. if ¯p ws > e ɛ ¯p ws ′ + δ/2<br />

10. set α = 2α and go to step 3<br />

11. end for<br />

12. end for<br />

13. use α as the noise level for any future query.<br />

Before discussing the running time of the above procedure, we first show the accuracy of the estimation.<br />

Claim 2. Set the size of the sample k = O( (1+eɛ ) 2<br />

δ<br />

log U), where U is the upper bound of the size of the<br />

2<br />

output range and the number of pairs of secrets. Let α ɛ and α ɛ,δ be the minimum noise levels that guarantee<br />

ɛ-Pufferfish and (ɛ, δ)-Pufferfish privacy. With high probability, our algorithm will output a noise level α<br />

such that α ɛ,δ ≤ α ≤ α ɛ .<br />

Proof. Denote S as the set of databases that satisfy secret s and θ and ¯S as θ − S. Firstly, given 2(c + 1)k<br />

independent samples from θ, <strong>with</strong> probability at least 1 − 1 , at least k samples are from each of S or ¯S.<br />

U O(1)<br />

Fix some output w and some secret s. Let X i be the indicator random variable of the event that the i-th<br />

sample outputs w. Note that E[X i ] = p ws . Let X = ∑ k<br />

i=1 X i. We have<br />

Pr(|¯p ws − p ws | ≥ ∆) = Pr(|X/k − p ws | ≥ ∆)<br />

≤ 2e −2k∆2 Set ∆ =<br />

≤ 1<br />

U O(1)<br />

δ<br />

2(1 + e ɛ )<br />

Therefore, the probability that the estimated ¯p ws is more than ∆ away from its true value for any w and s<br />

is at most 1/U. In other words, <strong>with</strong> high probability, all estimates are <strong>with</strong>in ∆ distance away from their<br />

true values.<br />

6

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!