Two-source Match stage workflow
The Two-source Match stage requires standardized data and reference data as source data, a two-source match specification, and frequency information for both sources.
A typical workflow for using the Two-source Match stage includes the following tasks.
- Standardize the source data for the data source and the reference source.
- Prepare representative sample data sets from the source data.
- Use the Match Frequency stage to generate frequency information.
- Optional. If you want to reduce the amount of frequency data that will be used in the Two-source Match job, you can run the Frequency Match stage job again. However, for this job run, select the two-source match specification that you created. Selecting the two-source match specification in the Frequency Match stage job limits the frequency data to only the columns that will participate in the match job.
- Create a DataStage® asset that includes the Two-source Match stage, with data source, reference source, and the frequency information for each source as inputs.
- Configure the Two-source Match stage, which includes selecting the two-source match specification that you created.