- OutputTable
- Specify the name for the output table of coefficients. This table must not exist.
- InputColumns
- [Optional] Specify the name of the column that contains the dependent variable (Y) followed by the names of the columns that contain the predictor variables (Xi), in this format: 'Y,X1,X2,...,Xp'.
- CategoricalColumns
- [Optional] Specify columnname-value pairs, each of which contains the name of a categorical input column and the category values in that column that the function is to include in the model that it creates.
columnname_value_pair Description 'columnname:max_cardinality' Limits categories in column to max_cardinality to most common ones and groups others together as 'others'. For example, 'column_a:3' specifies that for column_a, function uses 3 most common categories and sets category of rows that do not belong to those 3 categories to 'others'.
'columnname:(category [,...])' Limits categories in column to those that you specify and groups others together as 'others'. For example, 'column_a : (red, yellow, blue)' specifies that for column_a, function uses categories red, yellow, and blue, and sets category of rows that do not belong to those categories to 'others'.
'columnname' All category values appear in model. If you specify the InputColumns argument, the columns that you specify in the CategoricalColumns argument must also appear in the InputColumns argument.
For information about columns that you must identify as categorical, see Identification of Categorical Columns. - Family
- [Optional] Specify the distribution exponential family, which is one of the following:
- 'BINOMIAL' (Default)
- 'LOGISTIC' (equivalent to 'BINOMIAL')
- 'POISSON'
- 'GAUSSIAN'
- 'GAMMA'
- 'INVERSE_GAUSSIAN'
- 'NEGATIVE_BINOMIAL'
- LinkFunction
- [Optional] Specify the link function.
- WeightColumn
- [Optional] Specify the name of an input table column that contains the weights to assign to responses.
- StopThreshold
- [Optional] Specify the convergence threshold.
- MaxIterNum
- [Optional] Specify the maximum number of iterations that the algorithm runs before quitting if the convergence threshold has not been met. The parameter max_iterations must be a positive INTEGER value.
- Intercept
- [Optional] Specify whether the function uses an intercept. For example, in ß0+ß1*X1+ß2*X2+ ....+ ßpXp, the intercept is ß0.
- Step
- [Optional] Specify whether the function uses a step. If the function uses a step, it runs with the GLM model that has the lowest Akaike information criterion (AIC) score, drops one predictor from the current predictor group, and repeats this process until no predictor remains.