About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 11, 2025
With the Anonymize node, you can disguise field names, field values, or both when working with data that's to be included in a model downstream of the node. In this way, the generated model can be freely distributed (for example, to Technical Support) with no danger that unauthorized users will be able to view confidential data, such as employee records or patients' medical records.
Depending on where you place the Anonymize node in your flow, you may need to make changes to other nodes. For example, if you insert an Anonymize node upstream from a Select node, the selection criteria in the Select node will need to be changed if they are acting on values that have now become anonymized.
The method to be used for anonymizing depends on various factors. For field names and all field values except Continuous measurement levels, the data is replaced by a string of the form:
prefix_Sn
where
is either a user-specified string or the default string
prefix_
, and anon_
is an integer value that starts at 0 and is
incremented for each unique value (for example, n
, anon_S0
,
etc.).anon_S1
Note: If you want to anonymize data that exports to a table with limits on valid field entries (such
as field size), check that the new string (for example,
anon_S1
) fits within these
limits. If the new string does not fit these limits, then you might have issues writing data to
tables downstream. Field values of type Continuous must be transformed because numeric ranges deal with integer or
real values rather than strings. As such, they can be anonymized only by transforming the range into
a different range, thus disguising the original data. Transformation of a value
in the range is performed in the following way:x
A*(x + B)
where:
is a scale factor, which must be greater than 0.A
is a translation offset to be added to the values.B
Example
In the case of a field
where the scale factor AGE
is set to 7
and the translation offset A
is set to 3, the values for B
are
transformed into:AGE
7*(AGE + 3)