Quality stages in DataStage
Use the QualityStage stages in DataStage to investigate, cleanse, and manage your data.
With the QualityStage stages, also known as data quality
stages, you can manipulate your data in the following ways:
- Resolve data conflicts and ambiguities.
- Uncover new or hidden attributes from free-form or loosely controlled source columns.
- Conform data by transforming data types into a standard format.
Stage functions
The following table lists the available stages and gives details on their functions:
Stage | Icon | Function |
---|---|---|
Data rules | Checks data quality anywhere in the flow of a job. | |
Investigate | The character investigation type of Investigate stage analyzes and classifies data, parsing it into a single-pattern report. The word investigation type of Investigate stage uses a set of rules for classifying data such as personal names, business names, and addresses. | |
Match Frequency | Generates the frequency distribution of values for columns in the input data. You use the frequency distribution and the input data in match jobs. | |
One-source Match | Matches records from a single source file. | |
Standardize | Makes source data internally consistent, so each data type has the same kind of content and format. | |
Two-source Match | Compares two sources of input data (reference records and data records) for matches. |