Downloads provided by UsageCounts
handle: 10261/160003 , 2117/12855
Microaggregation is a protection method used by statistical agencies to limit the disclosure risk of confidential information. Formally, microaggregation assigns each original datum to a small cluster and then replaces the original data with the centroid of such cluster. As clusters contain at least k records, microaggregation can be considered as preserving k-anonymity. Nevertheless, this is only so when multivariate microaggregation is applied and, moreover, when all variables are microaggregated at the same time. When different variables are protected using univariate microaggregation, k-anonymity is only ensured at the variable level. Therefore, the real k-anonymity decreases for most of the records and it is then possible to cause a leakage of privacy. Due to this, the analysis of the disclosure risk is still meaningful in microaggregation. This paper proposes a new record linkage method for univariate microaggregation based on finding the optimal alignment between the original and the protected sorted variables. We show that our method, which uses a DTW distance to compute the optimal alignment, provides the intruder with enough information in many cases to to decide if the link is correct or not. Note that, standard record linkage methods never ensure the correctness of the linkage. Furthermore, we present some experiments using two well-known data sets, which show that our method has better results (larger number of correct links) than the best standard record linkage method. © 2009 Ohmsha and Springer Japan jointly hold copyright of the journal.
Partial support by the Spanish MEC (projects ARES – CONSOLIDER INGENIO 2010 CSD2007-00004 – and eAEGIS – TSI2007-65406-C03-02) and by the Government of Catalunya (grant 2005-SGR-00093) is acknowledged. Jordi Nin thanks the Spanish National Research Council (CSIC) for his I3P grant.
Peer Reviewed
privacy on statistical databases, Database theory, Microaggregation, privacy preserving data mining, DTW Distance, Privac Preserving Data Mining, Àrees temàtiques de la UPC::Informàtica::Seguretat informàtica, Protecció de dades, Record linkage, :Informàtica::Seguretat informàtica [Àrees temàtiques de la UPC], Privacy preserving data mining, DTW distance, record linkage, Privacy on statistical databases, Record Linkage, microaggregation, Data protection
privacy on statistical databases, Database theory, Microaggregation, privacy preserving data mining, DTW Distance, Privac Preserving Data Mining, Àrees temàtiques de la UPC::Informàtica::Seguretat informàtica, Protecció de dades, Record linkage, :Informàtica::Seguretat informàtica [Àrees temàtiques de la UPC], Privacy preserving data mining, DTW distance, record linkage, Privacy on statistical databases, Record Linkage, microaggregation, Data protection
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 11 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 55 | |
| downloads | 17 |

Views provided by UsageCounts
Downloads provided by UsageCounts