Adding data and mapping it to your data model
Each data source or asset must be mapped and loaded into the data model before it can be used in Master Data Management functions such as matching.
Master Data Management includes a powerful automapping capability that removes the need for data engineers to manually map each column of data into the model. The automapping feature detects, analyzes, and categorizes each column of data to the corresponding attributes or fields in the data model. Before you can run automapping, you must profile your data. Profiling analyzes and classifies your data to enable the automapping process to take place.
Publishing sample data
If you don’t have your own data assets ready to go but want to get started using Master Data Management, load the provided sample data and model instead.
- Go to the master data home page.
- From the Master data tile, click Publish sample model.
- After the sample model publish is complete, click Publish sample data.
- Optionally, go to the Jobs tab to watch the progress of your sample load jobs. If you don’t want to watch the progress, you can navigate to another screen and the jobs will continue working in the background.
Adding data from a flat data file
- From the master data home page, click Configuration to open the Data setup screen. Click Start with data assets.
- Click the Find and add data icon in the action bar at the top of the screen.
- From the Data panel that opens, choose whether to add data by upload, from the project, or from the catalog. For this tutorial, choose Load to upload a data file.
- On your local machine, select a flat data file in CSV or TSV format and drag it into the Data panel. When the file finishes uploading, it is added to your assets summary list.
- Review the details of your newly added asset. If your asset does not have any information in the Asset record type column, you must define the record type.
- Select your asset in the assets summary list.
- Click Assign record type and select the correct record type from the list. If the appropriate record type is not in the list, then you might have to customize your data model.
Adding data or sources through your project
You can also add data assets, sources, or connections to your project through the Cloud Pak for Data as a Service dashboard.
Watch this video to see how to create a connection and add connected data to a project.
For more information about adding data directly to your project, see Adding data to a project.
Mapping your data into the data model
- On the Data setup screen, click the Mapping tab.
From the Asset list in the left panel, select the data asset that you want to map into the system. The data from the asset displays in tabular format with a number of rows and columns. Each column represents an attribute that must be mapped to a corresponding attribute type in the data model. When you first open a data source or asset, each column is marked with a Not Mapped tag.
You can manually map each column if you choose, but you can greatly speed up the mapping process by taking advantage of the automapping feature.
To enable automapping for this source or asset, you must first profile the data. Click Profile. Profiling analyzes and classifies your data to enable the automapping process to take place. Profiling can take some time to complete, so it runs in the background to allow you to continue working. You might want to start reviewing and manually mapping some columns.
Automapping will never overwrite any manual mapping that you have done.
- When profiling completes, click Auto map. Master Data Management analyzes your data and automatically maps as many columns as possible into the data model. Even if it cannot map a given column, the automap function can suggest some of the most likely mapping selections.
- Review the automapping results. If any of the mappings are incorrect, or if a column remains unmapped, then manually map it correctly. Alternately, if a given column is not required, you can exclude it from your Master Data Management data load.
To manually map a column, select it, then use the Mapping targets panel on the right to search for and select the appropriate attribute or field from the data model. Click Map and save to data model.
Scroll right and left through the columns to ensure that every column in your data source or asset is mapped.
- When you’ve finished mapping the data source, you’re ready to publish the data into the system.
- If your data model is new or has changed, you’ll need to publish your model first by clicking the publish model icon in the action bar. Wait for the publish job to complete.
- To publish your data, click the publish data icon in the action bar. Wait for the publish job to complete.
- Return to the configuration overview page by clicking Data setup and selecting Overview from the navigation menu.
- On the Overview page, confirm that you have at least one data source or asset added and mapped.
Set up and run matching to create master data entities and records from your mapped data sources. For more information, see Matching your data to create master data entities.