Privacy-preserving collaborative filtering

Huseyin Polat, Wenliang Du

Research output: Contribution to journalArticlepeer-review

75 Scopus citations

Abstract

Collaborative filtering (CF) techniques are becoming very popular on the Internet and are widely used in several domains to cope with information overload. E-commerce sites use filtering systems to recommend products to customers based on the preferences of like-minded customers, but their systems do not protect user privacy. Because users concerned about privacy may give false information, it is not easy to collect high-quality user data for collaborative filtering, and recommendation systems using poor data produce inaccurate recommendations. This means that privacy measures are key to the success of collecting high-quality data and providing accurate recommendations. This article discusses col laborative filtering with privacy based on both correlation and singular-value decomposition (SVD) and proposes the use of randomized perturbation techniques to protect user privacy while producing reasonably accurate recommendations. Such techniques add randomness to the original data, preventing the data collector (the server) from learning private user data, but this scheme can still provide accurate recommendations. Experiments were conducted with real datasets to evaluate the overall performance of the proposed scheme. The results were used for analysis of how different parameters affect accuracy. Collaborative filtering systems using randomized perturbation techniques were found to provide accurate recommendations while preserving user privacy.

Original languageEnglish (US)
Pages (from-to)9-35
Number of pages27
JournalInternational Journal of Electronic Commerce
Volume9
Issue number4
DOIs
StatePublished - 2005

Keywords

  • Accuracy
  • Collaborative filtering
  • Privacy
  • Randomized perturbation
  • SVD

ASJC Scopus subject areas

  • Business and International Management
  • Economics and Econometrics

Fingerprint

Dive into the research topics of 'Privacy-preserving collaborative filtering'. Together they form a unique fingerprint.

Cite this