Rule sets that are used by the Standardize stage
You can apply rule sets in the Standardize stage to create output columns that are consistent, meet industry standards, and that you can use in a variety of ways for data matching.
Rule sets check and normalize input data. The following categories
of rule sets are available:
- Country or region identifier rule sets read area information and attempt to identify the associated country or region.
- Domain preprocessor rule sets evaluate mixed-domain input, such as free-form name and address information, and categorize the data into domain-specific column sets.
- Domain-specific rule sets process free-form data from a single domain such as name, address, or area information.
- Validation rule sets generate business intelligence and reporting fields, and are applied to common business data such as dates, email addresses, and phone numbers.
The provided rule sets are designed for optimal results. However, if the results are not satisfactory, or if you want to create rule sets for other data domains, you can create a new rule set, copy an existing rule set, or modify an existing rule set. You can modify rule set behavior by enhancing the rule set in DataStage®, adding user overrides, or editing the rule set files directly.