About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 11, 2025
XGBoost is an advanced implementation of a gradient boosting
algorithm. Boosting algorithms iteratively learn weak classifiers and then add them to a final
strong classifier. XGBoost is very flexible and provides many parameters that can be overwhelming to
most users, so the XGBoost-AS node in SPSS Modeler exposes the core features and commonly used
parameters. The XGBoost-AS node is implemented in Spark.
properties |
Data type | Property description |
---|---|---|
|
field | List of the field names for target. |
|
field | List of the field names for inputs. |
|
integer | The number of workers used to train the XGBoost model. Default is . |
|
integer | The number of threads used per worker. Default is . |
|
Boolean | Whether to use external memory as cache. Default is false. |
|
string | The booster type to use. Available options are ,
, or . Default is . |
|
integer | The number of rounds for boosting. Specify a value of or higher. Default
is . |
|
Double | Control the balance of positive and negative weights. Default is . |
|
integer | The seed used by the random number generator. Default is 0. |
|
string | The learning objective. Possible values are ,
, , ,
, , or . Note
that for flag targets, only or can be used.
If is used, the score result will show the and
XGBoost objective types. Default is
. |
|
string | Evaluation metrics for validation data. A default metric will be assigned according to the
objective. Possible values are , ,
, , ,
, , , , or
. Default is . |
|
Double | L2 regularization term on weights. Increasing this value will make the model more
conservative. Specify any number or greater. Default is
. |
|
Double | L1 regularization term on weights. Increasing this value will make the model more
conservative. Specify any number or greater. Default is
. |
|
Double | L2 regularization term on bias. If the booster type is used, this
lambda bias linear booster parameter is available. Specify any number or greater.
Default is . |
|
string | If the or booster type is used, this tree
method parameter for tree growth (and the other tree parameters that follow) is available. It
specifies the XGBoost tree construction algorithm to use. Available options are
, , or . Default is
. |
|
integer | The maximum depth for trees. Specify a value of or higher. Default is
. |
|
Double | The minimum sum of instance weight (hessian) needed in a child. Specify a value of
or higher. Default is . |
|
Double | The maximum delta step to allow for each tree's weight estimation. Specify a value of
or higher. Default is . |
|
Double | The sub sample for is the ratio of the training instance. Specify a value between
and . Default is . |
|
Double | The step size shrinkage used during the update step to prevent overfitting. Specify a value
between and . Default is . |
|
Double | The minimum loss reduction required to make a further partition on a leaf node of the tree.
Specify any number or greater. Default is . |
|
Double | The sub sample ratio of columns when constructing each tree. Specify a value between
and . Default is . |
|
Double | The sub sample ratio of columns for each split, in each level. Specify a value between
and . Default is . |
|
string | If the dart booster type is used, this dart parameter and the following three dart parameters
are available. This parameter sets the normalization algorithm. Specify or
. Default is . |
|
string | The sampling algorithm type. Specify or .
Default is . |
|
Double | The dropout rate dart booster parameter. Specify a value between and
. Default is . |
|
Double | The dart booster parameter for the probability of skip dropout. Specify a value between
and . Default is . |
Was the topic helpful?
0/1000