k-Degree anonymity and edge selection: improving data utility in large networks

Jordi Casas-Roma, Jordi Herrera-Joancomartí, Vicenç Torra

Research output: Contribution to journalArticleResearchpeer-review

35 Citations (Scopus)

Abstract

© 2016, Springer-Verlag London. The problem of anonymization in large networks and the utility of released data are considered in this paper. Although there are some anonymization methods for networks, most of them cannot be applied in large networks because of their complexity. In this paper, we devise a simple and efficient algorithm for k-degree anonymity in large networks. Our algorithm constructs a k-degree anonymous network by the minimum number of edge modifications. We compare our algorithm with other well-known k-degree anonymous algorithms and demonstrate that information loss in real networks is lowered. Moreover, we consider the edge relevance in order to improve the data utility on anonymized networks. By considering the neighbourhood centrality score of each edge, we preserve the most important edges of the network, reducing the information loss and increasing the data utility. An evaluation of clustering processes is performed on our algorithm, proving that edge neighbourhood centrality increases data utility. Lastly, we apply our algorithm to different large real datasets and demonstrate their efficiency and practical utility.
Original languageEnglish
Pages (from-to)447-474
JournalKnowledge and Information Systems
Volume50
Issue number2
DOIs
Publication statusPublished - 1 Feb 2017

Keywords

  • Data utility
  • Edge measures
  • Information loss
  • k-Anonymity
  • Privacy
  • Social networks

Cite this