Difference stage: fast path (DataStage)
Last updated: Mar 12, 2025
This section specifies the minimum steps to take to get a Difference stage functioning.
About this task
To use a Difference stage:
Procedure
- Open the stage and expand Properties.
- Specify the key column. You can repeat this property to specify a composite key. Before and after rows are considered to be the same if they have the same value in the key column or columns.
- Optionally specify one or more Difference Value columns. This enables you to determine if
an after row is an edited version of a before row. Note: You can also set the All non-Key columns are Values property to have IBM DataStage treat all columns not defined as keys treated as values.
- Specify whether the stage will output the changed row or drop it. You can specify this individually for each type of change (copy, delete, edit, or insert).
- In the Link Ordering section on the Stage tab, specify which of the two links carries the before data set and which carries the after data set.
- If the two incoming data sets aren't already hash partitioned on the key columns and sorted, set IBM® DataStage® to do this on the Input tab in the partitioning section.
Was the topic helpful?
0/1000