TY - GEN
T1 - Searching for better randomized response schemes for privacy-preserving data mining
AU - Huang, Zhengli
AU - Du, Wenliang
AU - Teng, Zhouxuan
PY - 2007
Y1 - 2007
N2 - To preserve user privacy in Privacy-Preserving Data Mining (PPDM), the randomized response (RR) technique is widely used for categorical data. Although various RR schemes have been proposed, there is no study to systematically compare them in order to find optimal RR schemes. In the paper, we choose the R-U (Risk-Utility) confidentiality map to compare different randomization schemes. Using the R-U map as our metric, we present an optimal RR scheme for binary data, which helps us find an optimal class of RR matrices. From this optimal scheme, we have discovered several heuristic rules among the elements in the optimal class. We generalize these rules to find optimal class of RR matrices for categorical data. Based on these rules, we propose an RR scheme to find a class of RR matrices for categorical data. Our experimental results have shown that our scheme has much better performance than the existing RR schemes.
AB - To preserve user privacy in Privacy-Preserving Data Mining (PPDM), the randomized response (RR) technique is widely used for categorical data. Although various RR schemes have been proposed, there is no study to systematically compare them in order to find optimal RR schemes. In the paper, we choose the R-U (Risk-Utility) confidentiality map to compare different randomization schemes. Using the R-U map as our metric, we present an optimal RR scheme for binary data, which helps us find an optimal class of RR matrices. From this optimal scheme, we have discovered several heuristic rules among the elements in the optimal class. We generalize these rules to find optimal class of RR matrices for categorical data. Based on these rules, we propose an RR scheme to find a class of RR matrices for categorical data. Our experimental results have shown that our scheme has much better performance than the existing RR schemes.
UR - http://www.scopus.com/inward/record.url?scp=38049173463&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38049173463&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-74976-9_50
DO - 10.1007/978-3-540-74976-9_50
M3 - Conference contribution
AN - SCOPUS:38049173463
SN - 9783540749752
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 487
EP - 497
BT - Knowledge Discovery in Database
PB - Springer Verlag
T2 - 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007
Y2 - 17 September 2007 through 21 September 2007
ER -