0 / 0
Creating Two-source Match stage jobs in DataStage

Creating Two-source Match stage jobs

A Two-source Match stage job requires that you add the Two-source Match stage to the job and link it to data and reference sources and output stages.

  1. Click New asset +.
  2. Choose DataStage.
  3. From the palette, select the Two-source Match stage and drag it on the middle of the canvas.
  4. From the palette, add four input stages. Add one for each of the following data sources.
    1. Data source.
    2. Frequency information for the data source.
    3. Reference source.
    4. Frequency information for the reference source.
    Typically, your source data comes from a file or database but you can also use other stages to preprocess it before you input it to the Two-source Match stage.
  5. Link the input stages in the following order:
    1. Data
    2. Reference
    3. Data Frequency
    4. Reference Frequency
  6. From the palette, add up to seven output stages, one for each of the output options that you intend to use in the Two-source Match stage.
    The following list shows the seven output options.
    • Match
    • Clerical
    • Data Duplicate
    • Reference Duplicate
    • Data Nonmatched
    • Reference Nonmatched
    • Match Statistics
    For the output stages, you can use any file, database, or processing stage.
  7. For the match output options that you intend to use, link the stages in the following order.
    1. Match
    2. Clerical
    3. Data Duplicate
    4. Reference Duplicate
    5. Data Nonmatched
    6. Reference Nonmatched
    7. Match Statistics
    Note: Verify in the Link ordering section on the Input tab and the Output tab that the link labels and link names are correct.
  8. Double-click an input stage.
    1. In the Properties section of the Output tab, in the text box for File, type the file name, including the path, which the input data will read.
      If you used a stage other than a file stage for the input, select the appropriate input and define any additional properties that the source requires.
    2. In the Columns section of the Output tab, click Edit, then choose Importing existing data definitions.
      When you select the data definition for the frequency input, use the table definition that was created when the Frequency Match job was run.
    3. Click Apply and return, then click Save.
    4. Repeat these steps until all the input stages are configured.
  9. Double-click an output stage.
    1. In the Properties section of the Input tab, in the text box for File, enter the file name to write the output data to.
      If you used a stage other than a file stage for the output, select the appropriate output and define the properties that the target requires.
    2. Click Apply and return, then click Save.
    3. Repeat these steps until all the output stages are configured.
  10. Optional: Rename the stages and links with meaningful names that reflect their functions in the job or project.
Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more