A hybrid multi-group privacy-preserving approach for building decision trees

Zhouxuan Teng, Wenliang Du

Research output: Chapter in Book/Entry/PoemConference contribution

14 Scopus citations

Abstract

In this paper, we study the privacy-preserving decision tree building problem on vertically partitioned data. We made two contributions. First, we propose a novel hybrid approach, which takes advantage of the strength of the two existing approaches, randomization and the secure multi-party computation (SMC), to balance the accuracy and efficiency constraints. Compared to these two existing approaches, our proposed approach can achieve much better accuracy than randomization approach and much reduced computation cost than SMC approach. We also propose a multi-group scheme that makes it flexible for data miners to control the balance between data mining accuracy and privacy. We partition attributes into groups, and develop a scheme to conduct group-based randomization to achieve better data mining accuracy. We have implemented and evaluated the proposed schemes for the ID3 decision tree algorithm.

Original languageEnglish (US)
Title of host publicationAdvances in Knowledge Discovery and Data Mining - 11th Pacific-Asia Conference, PAKDD 2007, Proceedings
PublisherSpringer Verlag
Pages296-307
Number of pages12
ISBN (Print)9783540717003
DOIs
StatePublished - 2007
Event11th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2007 - Nanjing, China
Duration: May 22 2007May 25 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4426 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2007
Country/TerritoryChina
CityNanjing
Period5/22/075/25/07

Keywords

  • Privacy
  • Randomization
  • SMC

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A hybrid multi-group privacy-preserving approach for building decision trees'. Together they form a unique fingerprint.

Cite this