0 / 0
Mimic and mask your data
Last updated: Oct 09, 2024
Mimic and mask your data

Using the Synthetic Data Generator graphical editor flow tool, you can generate a structured synthetic data set based on your production data. You can import data, anonymize, mimic (to generate synthetic data), export, and review your data.

Before you can use mimic and mask to create synthetic data, you need to create a task.

1. The Generate synthetic tabular data flow window opens. Select use case Leverage your existing data. Click Next. Generate synthetic tabular data flow window

2. Select Import data. You can also drag-and-drop a data file into your project. You can also select data from a project. For more information, see Importing data. Import data

3. Once you have imported your data, you can use the Synthetic Data Generator graphical flow editor tool to anonymize your production data, masking the data. You can disguise column names, column values, or both, when working with data that is to be included in a model downstream of the node. For example, you can use bank customer data and hide marital status. Anonymize data

4. You can then use the Synthetic Data Generator tool to mimic your production data. This will generate synthetic data, based on your production data, using a set of candidate statistical distributions to modify each column in your data. Mimic data

5. You can export your synthetic data and review it. For more information, see Exporting data. Export data

Learn more

If you choose the Generate flow, you can learn more Generate flow.

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more