0 / 0
Prompt priming risk for AI
Last updated: Dec 12, 2024
Prompt priming risk for AI
Multi-category Icon representing multi-category risks.
Risks associated with input
Inference
Multi-category
New to generative AI

Description

Because generative models tend to produce output like the input provided, the model can be prompted to reveal specific kinds of information. For example, adding personal information in the prompt increases its likelihood of generating similar kinds of personal information in its output. If personal data was included as part of the model’s training, there is a possibility it could be revealed.

Why is prompt priming a concern for foundation models?

Jailbreaking attacks can be used to alter model behavior and benefit the attacker. 

Parent topic: AI risk atlas

We provide examples covered by the press to help explain many of the foundation models' risks. Many of these events covered by the press are either still evolving or have been resolved, and referencing them can help the reader understand the potential risks and work towards mitigations. Highlighting these examples are for illustrative purposes only.

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more