0 / 0
Harmful output risk for AI

Harmful output risk for AI

Alignment Icon representing alignment risks.
Risks associated with output
Value alignment
New to generative AI

Description

A model might generate language that leads to physical harm The language might include overtly violent, covertly dangerous, or otherwise indirectly unsafe statements.

Why is harmful output a concern for foundation models?

A model generating harmful output can cause immediate physical harm or create prejudices that might lead to future harm.

Parent topic: AI risk atlas

We provide examples covered by the press to help explain many of the foundation models' risks. Many of these events covered by the press are either still evolving or have been resolved, and referencing them can help the reader understand the potential risks and work towards mitigations. Highlighting these examples are for illustrative purposes only.

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more