User k-anonymity for privacy preserving data mining of query logs

Guillermo Navarro-Arribas, Vicen Torra, Arnau Erola, Jordi Castellà-Roca

Research output: Contribution to journalArticleResearchpeer-review

43 Citations (Scopus)

Abstract

The anonymization of query logs is an important process that needs to be performed prior to the publication of such sensitive data. This ensures the anonymity of the users in the logs, a problem that has been already found in released logs from well known companies. This paper presents the anonymization of query logs using microaggregation. Our proposal ensures the k-anonymity of the users in the query log, while preserving its utility. We provide the evaluation of our proposal in real query logs, showing the privacy and utility achieved, as well as providing estimations for the use of such data in data mining processes based on clustering. © 2011 Elsevier Ltd. All rights reserved.
Original languageEnglish
Pages (from-to)476-487
JournalInformation Processing and Management
Volume48
Issue number3
DOIs
Publication statusPublished - 1 May 2012

Keywords

  • Clustering
  • k-Anonymity
  • Microaggregation
  • Privacy
  • Query log
  • Web search

Fingerprint Dive into the research topics of 'User k-anonymity for privacy preserving data mining of query logs'. Together they form a unique fingerprint.

Cite this