Correlation inference attacks against machine learning models

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type , Preprint 12 Jul 2024Embargo end date: 01 Jan 2021 United Kingdom English Publisher:American Association for the Advancement of Science (AAAS)Journal:Science Advances, volume 10 (eissn: 2375-2548,

Copyright policy )Funded by:UKRI | PETRAS 2

Authors: Ana-Maria Creţu; Florent Guépin; Yves-Alexandre de Montjoye;

doi: 10.1126/sciadv.adj9260 , 10.48550/arxiv.2112.08806

pmid: 38985874

arXiv: 2112.08806

handle: 10044/1/112597

Correlation inference attacks against machine learning models

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Despite machine learning models being widely used today, the relationship between a model and its training dataset is not well understood. We explore correlation inference attacks, whether and when a model leaks information about the correlations between the input variables of its training dataset. We first propose a model-less attack, where an adversary exploits the spherical parameterization of correlation matrices alone to make an informed guess. Second, we propose a model-based attack, where an adversary exploits black-box model access to infer the correlations using minimal and realistic assumptions. Third, we evaluate our attacks against logistic regression and multilayer perceptron models on three tabular datasets and show the models to leak correlations. We lastly show how extracted correlations can be used as building blocks for attribute inference attacks and enable weaker adversaries. Our results raise fundamental questions on what a model does and should remember from its training set.

Country

United Kingdom

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Cryptography and Security, Social and Interdisciplinary Sciences and Public Health, Cryptography and Security (cs.CR), 004, Machine Learning (cs.LG)

1 Research products, page 1 of 1

ml-correlation-inference software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average