0 / 0
Data provenance risk for AI

Data provenance risk for AI

Transparency Icon representing transparency risks.
Risks associated with input
Training and tuning phase
Transparency
Amplified by generative AI

Description

Without standardized and established methods for verifying where data came from, there are no guarantees that available data is what it claims to be.

Why is data provenance a concern for foundation models?

Not all data sources are trustworthy. Data might have been unethically collected, manipulated, or falsified. Using such data can result in undesirable behaviors in the model. Business entities could face fines, reputational harms, and other legal consequences.

Parent topic: AI risk atlas

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more