Divide data assets into smaller, more targeted data assets by segmenting them based on column values.
To add data assets for segmented data:
-
From the project page, click the Assets tab, and then click New asset > Create a dynamic view of data.
-
Select the data selection method Segment column data.
-
Select the column that you want to use for segmenting the data. You can select a column from a data asset that is available in your project or you select a column from a connection in your project without creating a connected data asset first. In either case, the connection to the data source must support SQL queries.
However, you cannot select any data assets that were added to the project as output tables for data quality rules. If you need to segment such tables, you must explicitly import them from the data source.
Also consider that some data types are not supported in WHERE clauses. Columns with such data types are excluded from the list of columns from which you can choose.
If a data profile exists for the data asset, the number of distinct values in the selected column is shown based on the profiled sample. This number might not match the actual count if the data asset contains more records than the sample. The actual count becomes available when the values are retrieved from the source.
The default maximum number of distinct values that are retrieved is 100.
Depending on the query performance of the data source, retrieval of the values might take some time.
If assets are added to the project while you are viewing the list of data assets to pick the column for segmentation, these new assets are listed, but you cannot select them. If you want to work with one of these assets, cancel the creation process and start anew.
- Review the column values and select the ones for which you want to create separate data assets.
-
Click Create.
A data asset for each selected value is added to the project Assets page with this naming convention: name of the source data assetcolumn namedistinct value
To differentiate such dynamic views from other data assets, the assets are marked with the reference data icon (). If folders are enabled in the project, a dynamic view is always created in the root folder, but you can move it to any subdirectory to organize your dynamic views.
Next steps
Parent topic: Adding a dynamic view of data from a connection