TY - GEN
T1 - OptRR
T2 - 2008 IEEE 24th International Conference on Data Engineering, ICDE'08
AU - Huang, Zhengli
AU - Du, Wenliang
PY - 2008
Y1 - 2008
N2 - The randomized response (RR) technique is a promising technique to disguise private categorical data in Privacy-Preserving Data Mining (PPDM). Although a number of RR-based methods have been proposed for various data mining computations, no study has systematically compared them to find optimal RR schemes. The difficulty of comparison lies in the fact that to compare two PPDM schemes, one needs to consider two conflicting metrics: privacy and utility. An optimal scheme based on one metric is usually the worst based on the other metric. In this paper, we first describe a method to quantify privacy and utility. We formulate the quantification as estimate problems, and use estimate theories to derive quantification. We then use an evolutionary multi-objective optimization method to find optimal disguise matrices for the randomized response technique. The experimental results have shown that our scheme has a much better performance than the existing RR schemes.
AB - The randomized response (RR) technique is a promising technique to disguise private categorical data in Privacy-Preserving Data Mining (PPDM). Although a number of RR-based methods have been proposed for various data mining computations, no study has systematically compared them to find optimal RR schemes. The difficulty of comparison lies in the fact that to compare two PPDM schemes, one needs to consider two conflicting metrics: privacy and utility. An optimal scheme based on one metric is usually the worst based on the other metric. In this paper, we first describe a method to quantify privacy and utility. We formulate the quantification as estimate problems, and use estimate theories to derive quantification. We then use an evolutionary multi-objective optimization method to find optimal disguise matrices for the randomized response technique. The experimental results have shown that our scheme has a much better performance than the existing RR schemes.
UR - http://www.scopus.com/inward/record.url?scp=52649154023&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=52649154023&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2008.4497479
DO - 10.1109/ICDE.2008.4497479
M3 - Conference contribution
AN - SCOPUS:52649154023
SN - 9781424418374
T3 - Proceedings - International Conference on Data Engineering
SP - 705
EP - 714
BT - Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08
Y2 - 7 April 2008 through 12 April 2008
ER -