About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 11, 2025
The GLE model identifies the dependent variable that is linearly related to the factors and covariates via a specified link function. Moreover, the model allows for the dependent variable to have a non-normal distribution. It covers widely used statistical models, such as linear regression for normally distributed responses, logistic models for binary data, loglinear models for count data, complementary log-log models for interval-censored survival data, plus many other statistical models through its very general model formulation.
Examples. A shipping company can use generalized linear models to fit a Poisson regression to damage counts for several types of ships constructed in different time periods, and the resulting model can help determine which ship types are most prone to damage.
A car insurance company can use generalized linear models to fit a gamma regression to damage claims for cars, and the resulting model can help determine the factors that contribute the most to claim size.
Medical researchers can use generalized linear models to fit a complementary log-log regression to interval-censored survival data to predict the time to recurrence for a medical condition.
GLE models work by building an equation that relates the input field values to the output field values. After the model is generated, you can use it to estimate values for new data.
For a categorical target, for each record, a probability of membership is computed for each possible output category. The target category with the highest probability is assigned as the predicted output value for that record.
Requirements. You need one or more input fields and
exactly one target field (which can have a measurement level of
,
Continuous
, or Categorical
) with two or more categories. Fields used in
the model must have their types fully instantiated. Flag
Note: When first creating a flow, you select which runtime to use. By default,
flows use the IBM SPSS Modeler runtime. If you want to use native Spark
algorithms instead of SPSS algorithms, select the Spark runtime. Properties
for this node will vary depending on which runtime option you choose.
Was the topic helpful?
0/1000