About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 11, 2025
The Random Forest node uses an advanced implementation of a bagging algorithm
with a tree model as the base model. This Random Forest modeling node in SPSS Modeler is implemented
in Python and requires the scikit-learn© Python library.
properties |
Data type | Property description |
---|---|---|
|
boolean | This option tells the node to use field information specified here instead of that given in any upstream Type node(s). After selecting this option, specify the following fields as required. |
|
field | List of the field names for input. |
|
field | One field name for target. |
|
boolean | Utilize multiple CPU cores to improve model building. |
|
string | Specify to use predefined roles or to use
custom field assignments. Default is predefined. |
|
field | List of the field names for split. |
|
integer | Number of trees to build. Default is . |
|
Boolean | Specify custom max depth. If , nodes are expanded until all leaves are
pure or until all leaves contain less than samples. Default is
. |
|
integer | The maximum depth of the tree. Default is . |
|
integer | Minimum leaf node size. Default is . |
|
string | The number of features to consider when looking for the best split:
. |
|
Boolean | Use bootstrap samples when building trees. Default is . |
|
Boolean | Use out-of-bag samples to estimate the generalization accuracy. Default value is
. |
|
Boolean | Use extremely randomized trees. Default is . |
|
Boolean | Specify this to get replicated results. Default is . |
|
integer | The random number seed to use when build trees. Specify any integer. |
|
float | The size of the kernel cache in MB. Default is . |
|
Boolean | Enables the parameter. Specify true or false. Default is
. |
|
Boolean | Specify or to enable or disable the HPO options.
If set to , Rbfopt will be applied to determine the "best" Random Forest model
automatically, which reaches the target objective value defined by the user with the following
parameter. |
|
float | The objective function value (error rate of the model on the samples) you want to reach (for
example, the value of the unknown optimum). Set this parameter to the appropriate value if the
optimum is unknown (for example, ). |
|
integer | Maximum number of iterations for trying the model. Default is . |
|
integer | Maximum number of function evaluations for trying the model, where the focus is accuracy over
speed. Default is . |
Was the topic helpful?
0/1000