Matching your data to create master data entities (IBM Match 360)
Run matching on your data to let IBM Match 360 determine whether your records can be collected into master data entities. Each master data entity contains one or more records that were found to be matches.
- Required permissions
- To configure and run matching in your master data instance, you must be a member of the DataEngineer user group for the IBM Match 360 service.
There are two common reasons to run matching on your data:
- For record deduplication and entity resolution, the matching process analyzes your data to determine whether any duplicate records exist in your data. Suspected duplicate records are merged into master data entities to establish a single, trusted, 360-degree view of your data.
- To create other types of entity associations, the matching process analyzes your data to collect records into entities that represent different kinds of groupings, such as a household or family.
For more information about matching algorithms and how they work to match data and create entities, see IBM Match 360 matching algorithms.
Preparing to run matching
Before you run the matching algorithm, ensure that you publish your data model and assets into the IBM Match 360 with Watson service. For more information, see Adding data and mapping it to your data model.
Each entity type defined in your data model has its own separate matching algorithm to create entities of that type. For example, a Person record type can have more than one entity type, such as Person and Household.
- A Person entity type could be used for deduplication and entity resolution of your person record data.
- A Household entity type could be used to create associations of person records that all belong to the same household.
Setting up your matching algorithm
To set up your matching algorithm:
-
Click the navigation menu and select Matching setup to open the matching setup page.
-
From the entity type menu, select the type of entity whose matching algorithm you want to configure. The list of entity types is taken from your data model and follows the format Record type - Entity type.
It is possible for this list to include entity types that have not yet been published into the system. In this case, matching will not be able to start. Always ensure that you publish your data model after making any changes, such as adding a new entity type.
-
Go to the Match settings tab to select the attributes to use in matching data. The first time that you open this tab, the IBM Match 360 service automatically generates some suggested attributes from your data model to use in matching.
-
Review the list of matching attributes. These attributes will be used as the basis of comparison to match records and create master data entities. To add or remove attributes from the list, click Edit attributes then select or clear attributes and their component fields as needed.
As you choose your matching attributes, use the Match strength indicator to see an estimate of how your changes affect the matching algorithm.
If you have added any custom attributes to the data model, they are not selected for consideration in matching by default. If you want to use a custom attribute type in matching, you must select it and then specify which of its fields to consider. If you do not specify any fields, then the matching algorithim cannot use the attribute.
For predefined (non-custom) attribute types, if you do not specify which fields to consider, the matching algorithm uses a default set of fields.
-
When you are satisfied with your matching attributes, click Save.
Selecting which records get considered for matching
Optionally, you can configure conditions that define what records IBM Match 360 selects for matching consideration.
By default, all records associated with the selected entity type can participate in matching. To narrow the scope of records available to match, you can specify filter conditions that include or exclude records that have certain attribute values.
To define what records get selected for matching:
-
Click the navigation menu and select Matching setup to open the matching setup page.
-
Select Record selection in the sidebar to access the record selection settings.
-
Ensure that the correct matching algorithm is selected. The default matching algorithm names are Person - Person entity and Organization - Organization entity.
-
If you want to enable record selection filters for this entity type, click Enable record selection.
To later disable record selection filters, select Disable record selection.
-
Define record selection conditions:
a. Click +, then select the attribute that you want to consider.
b. Click +, then select the corresponding attribute value that you want to select for.
c. Repeat these steps to add conditions.
After you define a condition, the Action setting becomes available.
-
Define the action associated with the condition. Click +, then select either Use in matching or Do not use in matching.
- If you select Use in matching, then only records that meet the conditions are able to participate in matching for this entity type.
- If you select Do not use in matching, then any records that meet the conditions are unable to participate in matching for this entity type.
Matching your data and reviewing results
To run matching on your record data to create master data entities:
-
Click the navigation menu and select Matching setup to open the matching setup page.
-
From the entity type menu, select the type of entity whose matching algorithm you want to run. The list of entity types is taken from your data model and follows the format Record type - Entity type.
-
Click the run matching icon in the action bar.
The matching process takes a while to complete. It runs in the background so that you can continue working. You'll be notified when it's complete.
If the matching process ends before it completes, either due to a failure or because you canceled the process, then the next time you run matching, you are prompted to choose between resuming matching from the stage that it stopped or restarting the process from the beginning. Choose Restart matching if you have made changes to the data being matched. Otherwise, choose Resume matching to save time by skipping the matching stages that completed successfully.
When matching is complete, go to the Match results tab to see a dashboard of statistics and visualizations to provide insight about your master data.
Tip: If your record data changes after matching is complete, the statistics on the match results screen can become out of date. To update the statistics, click Refresh match results or run matching again. Refreshing the match results can take some time to complete.
You can adjust your matching algorithm at any time by choosing different matching attributes or changing your autolink threshold
As you add more data sources and assets to your IBM Match 360 instance and rerun matching, the new data is matched both within itself and against the existing data in the system. In this way, you can build a unified, single, 360-degree view of your customers across your entire enterprise.
Next steps
Learn more
Parent topic: Configuring master data