About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 11, 2025
TwoStep Cluster is an exploratory tool that's designed to
reveal natural groupings (or clusters) within a data set that would otherwise not be apparent. The
algorithm that's employed by this procedure has several desirable features that differentiate it
from traditional clustering techniques, such as handling of categorical and continuous variables,
automatic selection of number of clusters, and scalability.
Properties |
Values | Property description |
---|---|---|
|
[f1 ... fN] | TwoStepAS models use a list of input fields, but no target. Weight and frequency fields are not recognized. |
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
|
|
|
|
|
|
|
|
|
Boolean | |
|
integer | |
|
|
|
|
Boolean | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
number | Default=
|
|
Boolean | Default=
|
|
[f1 ... fN] | |
|
Boolean | Default=
|
|
integer | Default=
|
|
number | Default=
|
|
integer | Default=
|
|
integer | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=True |
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
Boolean | Default=
|
|
|
|
|
|
|
|
integer | The maximum number of outliers to display in the output. If there are more than twenty outlier clusters, a pivot table will be displayed instead. |
|
Boolean | Table and charts of feature importance and cluster centers for each input (field) used in the cluster solution. Selecting different rows in the table displays a different chart. For categorical fields, a bar chart is displayed. For continuous fields, a chart of means and standard deviations is displayed. |
Was the topic helpful?
0/1000