Managing data quality rules
You can create and manage data quality rules for assessing the quality of the data in your project.
If you have the required permissions, you can manage data quality rules in these ways:
- Create data quality rules
- Edit a data quality rule
- Delete data quality rules that you don't need anymore
You can also complete these tasks with APIs instead of the user interface. The links to these APIs are listed in the Learn more section.
To view data quality rules, you must have at least the Viewer role in the project.
To create, edit, or delete data quality rules, you must have the Admin or the Editor role in the project.
Project settings for rule execution
Project-level settings determine certain aspects of data quality rule execution, for example, whether trailing spaces in string values are ignored in equality checks. These settings apply to all data quality rules for a given project. You can check or update these settings for each project by using the Watson Data API Get project settings for data quality rules and Replace project settings for data quality rules.
Create data quality rules
You can create different types of data quality rules:
Editing data quality rules
You can edit a data quality rule to update its description, the selected data quality dimension, any business term assignments, or the rule configuration.
To edit a data quality rule, open the asset and perform the appropriate actions:
To update the description or the data quality dimensions, click the edit icon () next to the property.
To manage business terms, go to the Governance artifacts section of the asset and add or remove terms as needed.
To assign or delete governance rules, go to the Governance artifacts section of the asset, and add or remove governance rules as needed.
To update the rule configuration, click Edit rule. When you edit the rule configuration, you can also change the way the rule is built: from using data quality definitions to using SQL statements and vice versa. However, when you do that, all existing rule configuration is discarded and you basically must start from scratch.
You can also change the output type. Depending on your new selection, any configured output settings are reset or overwritten. Rule output that was written before the change remains untouched.
When you view a data quality rule, you can click the icon to view more details such as output settings or related assets.
Deleting data quality rules
You can delete a data quality rule in one of these ways:
- In the project, select the data quality rule and click Delete.
- Open the data quality rule and select Delete from the overflow menu next to the name of the data quality rule.
When you delete a data quality rule, its run history, any associated DataStage flow and jobs are also deleted from the project. Output tables in the project and in the database are kept. The issues that were returned by this data quality rule are removed, and the data quality and dimension scores are recalculated.
Parent topic: Managing data quality