Using the Synthetic Data Generator graphical editor flow tool, you can generate a structured synthetic data set based on your production data. You can import data, anonymize, mimic (to generate synthetic data), export, and review your data.
Before you can use mimic and mask to create synthetic data, you need to create a task.
1. The Generate synthetic tabular data flow window opens. Select use case Leverage your existing data. Click Next.
2. Select Import data. You can also drag-and-drop a data file into your project. You can also select data from a project. For more information, see Importing data.
3. Once you have imported your data, you can use the Synthetic Data Generator graphical flow editor tool to anonymize your production data, masking the data. You can disguise column names, column values, or both, when working with data that is to be included in a model downstream of the node. For example, you can use bank customer data and hide marital status.
4. You can then use the Synthetic Data Generator tool to mimic your production data. This will generate synthetic data, based on your production data, using a set of candidate statistical distributions to modify each column in your data.
5. You can export your synthetic data and review it. For more information, see Exporting data.
Learn more
If you choose the Generate flow, you can learn more Generate flow.