trademark
2c2kIhh
2c2kIhh
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
6.0<br />
Scalable and Composable Privacy-<br />
Preserving Analytics<br />
Studies show that anonymizing data for analytics is insufficient for ensuring user privacy.<br />
Below are the best techniques to ensure privacy in a big data environment.<br />
6.1 Implement differential privacy<br />
6.1.1 Why?<br />
To protect privacy even when data is linked with external data sources. Anonymizing<br />
public records have failed when researchers manage to identify personal information by<br />
linking two or more separately innocuous databases<br />
6.1.2 How?<br />
Differential privacy [Dwo06] aims to provide a means to maximize the accuracy of<br />
queries from statistical databases while minimizing the chances of identifying its<br />
records. Differential privacy is the mathematical concept to measure how much (or how<br />
little) anonymity is preserved on a database. For example, adding random noise is a<br />
method to achieve some level of differential privacy. Users are encouraged to use the<br />
appropriate mechanism for a given use.<br />
6.2 Implement Utilize homomorphic encryption<br />
6.2.1 Why?<br />
To enable encrypted data to be stored and processed on the cloud. Data stored in<br />
plaintext on the cloud may be compromised and cause privacy risks. On the other hand,<br />
when only encrypted data is stored on the cloud, utility of data is significantly limited.<br />
6.2.2 How?<br />
Homomorphic encryption is a form of encryption that allows specific types of<br />
computations to be carried out on ciphertext. The method allows users to obtain an<br />
encrypted result that, when decrypted, matches the result of operations performed on<br />
6.1.1 Why?<br />
CLOUD SECURITY ALLIANCE Big Data Working Group Guidance<br />
© Copyright 2016, Cloud Security Alliance. All rights reserved.<br />
32