Attribute inference attack risk for AI

Last updated: Dec 12, 2024

Risks associated with input

Inference

Privacy

Amplified by generative AI

Description

An attribute inference attack repeatedly queries a model to detect whether certain sensitive features can be inferred about individuals who participated in training a model. These attacks occur when an adversary has some prior knowledge about the training data and uses that knowledge to infer the sensitive data.

Why is attribute inference attack a concern for foundation models?

With a successful attack, the attacker can gain valuable information such as sensitive personal information or intellectual property.

Parent topic: AI risk atlas

We provide examples covered by the press to help explain many of the foundation models' risks. Many of these events covered by the press are either still evolving or have been resolved, and referencing them can help the reader understand the potential risks and work towards mitigations. Highlighting these examples are for illustrative purposes only.