The TwoStep node uses a two-step clustering method. The first step makes a
single pass through the data to compress the raw input data into a manageable set of subclusters.
The second step uses a hierarchical clustering method to progressively merge the subclusters into
larger and larger clusters. TwoStep has the advantage of automatically estimating the optimal number
of clusters for the training data. It can handle mixed field types and large data sets
efficiently.
TwoStep models use a list of input fields, but no target. Weight and frequency fields are not
recognized. See Common modeling node properties
for more information.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.