0 / 0
Quick start: Protect data
Quick start: Protect data

Quick start: Protect data

You can protect data with Watson Knowledge Catalog by creating data protection rules that define the type of data to protect and the protection method. Data protection rules are enforced for assets in governed catalogs. Read about data protection rules, then watch a video and take a tutorial that’s suitable for users with some knowledge of data masking, but does not require coding.

Required service Watson Knowledge Catalog

Required permission You must have the Manage data protection rules permission that comes with any of the Watson Knowledge Catalog user roles.

Your basic workflow is to create a data protection rule. It goes into effect immediately after you create it.

Read about protecting data

You create data protection rules to identify the data to control and to specify the method of control. Within data protection rules, you can include classifications, data classes, business terms, or tags to identify the data to control. You can choose to deny access to data, to mask sensitive data values, or to filter rows.

Data masking helps you protect sensitive data, such as personally identifiable information or restricted business data to avoid the risk of compromising confidential information. It is defined in data protection rules that are enforced for an asset. Depending on the method of data masking, data is redacted, substituted, or obfuscated with retained formatting in the asset preview.

Read more about data masking

Watch a video about masking data

Watch Video Watch this video to see how to create data protection rules to mask data using different masking types.

This video provides a visual method as an alternative to following the written steps in this documentation.

Tip: Start the video, then as you scroll through the tutorial, the video moves to picture-in-picture mode. You can use picture-in-picture mode so you can follow the video as you complete the tasks in this tutorial. Click the timestamps for each task to follow along.

Try a tutorial to mask data

In this tutorial, you will complete these tasks:

This tutorial will take approximately 20 minutes to complete.

Prerequisites

  • A previously created catalog with data polices enforced or access to create a catalog.

Task 1: Add the data set to your catalog

The data set you will use in this tutorial includes several columns with sensitive data. Follow these steps to add the data set to your catalog:

  1. Access the Auto Insurance Customers data set in the Gallery.

  2. Click the download icon, and save the file locally.

  3. From the navigation menu, click Catalogs > View all catalogs.

  4. Open your catalog, or create one if needed.

  5. From the Assets page of a catalog, click Add to Catalog > Local files.

  6. On the Add data assets from local files page, click browse, select the AutoInsuranceCustomers.csv from your computer, and click Open.

  7. Click Add. Stay on the page until the load completes.

  8. When the file loads, open the AutoInsuranceCustomers.csv data asset.

  9. Click the Asset tab to preview the data.

  10. Scroll to the right to see all of the columns in the data set.

  11. Look at the National_ID, CreditCard_Number, and Marital_Status columns. These columns contain sensitive data that needs to be masked.

Checkpoint for The following image shows the AutoInsuranceCustomers.csv file in the catalog. Check your progress

The following image shows the AutoInsuranceCustomers.csv file in the catalog.

The following image shows the AutoInsuranceCustomers.csv file in the catalog.

Task 2: Create a data protection rule to obfuscate data

preview tutorial video To preview this task, watch the video beginning at 03:36.

The first data protection rule will obfuscate government identities such as US Social Security Number. Follow these steps to create a data protection rule to obfuscate data:

  1. Click the Navigation Menu icon, and click Governance > Rules.

  2. From the rules page, click Add rule > New rule. You will first create a data protection rule that obfuscates government identity data which hides the values of the data but preserves the format.

  3. Select Data protection rule, and click Next.

  4. In the New data protection rule page that opens, complete the basic fields.

  5. In the Name section type Obfuscate government identity.

  6. In the Business definition section type Rule to mask sensitive information.

  7. For Condition 1, in the If field, select Data class.

  8. In the Search for a data class field, type US Social Security Number, and select US Social Security Number from the list.

  9. Click Add new condition.

  10. Change And to Or.

  11. For Condition 2, specify Data class and Canadian Social Insurance Number.

  12. Click Add new condition.

  13. For Condition 3, specify Data class and UK National Insurance Number.

  14. For the Action, select mask columns.

  15. For the in columns containing field, select Data class. The same three data classes previously selected are filled in for you.

  16. For the masking method, select Obfuscate. You can hover over each masking option to see an example of masked data with the selected option.

  17. Click Create.

Checkpoint for The following image shows the obfuscate data protection rule. Check your progress

The following image shows the obfuscate data protection rule.

The following image shows the obfuscate data protection rule.

Task 3: Create a data protection rule to redact data

The second rule will redact personal demographic information such as ethnicity. Follow these steps to create a data protection rule to redact data:

Tip: You will need the Watson Knowledge Catalog Standard or Professional plan to create more than one data protection rule.
  1. From the rules page, click Add rule > New rule. You will next create a data protection rule that redacts personal demographic data which hides the value of the data by replacing it with ten X characters.

  2. Select Data protection rule, and click Next.

  3. In the New data protection rule page that opens, complete the basic fields.

  4. In the Name section type Redact personal demographic information.

  5. In the Business definition section type Rule to mask sensitive information.

  6. For Condition 1, in the If field, select Data class.

  7. In the Search for a data class field, type Religion, and select Religion from the list.

  8. Click Add new condition.

  9. Change And to Or.

  10. For Condition 2, specify Data class and Ethnicity.

  11. Click Add new condition.

  12. For Condition 3, specify Data class and Legal Marital/Civil Status.

  13. Click Add new condition.

  14. For Condition 4, specify Data class and Political Party.

  15. For the Action, select mask columns.

  16. For the in columns containing field, select Data class. The same three data classes previously selected are filled in for you.

  17. For the masking method, select Redact. You can hover over each masking option to see an example of masked data with the selected option.

  18. Click Create.

Checkpoint for The following image shows the redact data protection rule. Check your progress

The following image shows the redact data protection rule.

The following image shows the redact data protection rule.

Task 4: Create a data protection rule to substitute data

The third rule will substitute financial account data such as credit card numbers.

Tip: You will need the Watson Knowledge Catalog Standard or Professional plan to create more than one data protection rule.
  1. From the rules page, click Add rule > New rule. You will next create a data protection rule that substitutes data which hides the values of the data by replacing them with hash values.

  2. From the rules page, click Add rule > New rule.

  3. Select Data protection rule and click Next.

  4. In the New data protection rule page that opens, complete the basic fields.

  5. In the Name section type Substitute financial account data.

  6. In the Business definition section type Rule to mask sensitive information.

  7. For Condition 1, in the If field, select Data class.

  8. In the Search for a data class field, type Credit Card Number, and select Credit Card Number from the list.

  9. Click Add new condition.

  10. Change And to Or.

  11. For Condition 2, specify Data class and Account Number.

  12. For the Action, select mask columns.

  13. For the in columns containing field, select Data class. The same three data classes previously selected are filled in for you.

  14. For the masking method, select Substitute. You can hover over each masking option to see an example of masked data with the selected option.

  15. Click Create.

Checkpoint for The following image shows the substitute data protection rule. Check your progress

The following image shows the substitute data protection rule.

The following image shows the substitute data protection rule.

(Optional) Task 5: View the masked data

Watch Video Watch the video beginning at 02:28 to see the effect of masked columns in a data set.

Tip: If you are the owner of AutoInsuranceCustomers.csv, you must log in as a different user to view the masked data.

Now that the rules are in place, you view the masked data from the perspective of a different user.

  1. From the navigation menu Navigation menu, click Catalogs > View all catalogs.

  2. Select the catalog that contains the AutoInsuranceCustomers.csv data asset, and click AutoInsuranceCustomers.csv to view the asset.

  3. Select the Asset tab to preview the data.

  4. Horizontally scroll through the columns of data and move your mouse cursor over the Lock icons Lock icon above each data column to view the data protection rule that is masking the data.

Checkpoint for The following image shows the AutoInsuranceCustomers asset with masked columns. Check your progress

The following image shows the AutoInsuranceCustomers asset with masked columns.

The following image shows the AutoInsuranceCustomers asset with masked columns.

Next steps

Now the data is ready to be used. For example, you or other users can do any of these tasks:

Additional resources

  • View more videos for catalog governance.
  • Find sample data sets to gain hands-on experience with protecting data assets in a catalog in the Gallery.

Parent topic: Quick start tutorials

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more