- FactorTable
- [Optional] Specify the name for the FactorTable. The function encodes categorical predictors as integer values in the FactorTable and copies numeric predictors to the FactorTable unchanged.
- TargetColumns
- Specify the names of the InputTable columns that contain the variables to use as predictors (independent variables) in the model.
- CategoricalColumns
- [Optional] Specify columnname-value pairs, each of which contains the name of a categorical input column and the category values in that column that the function is to include in the model that it creates.
columnname_value_pair Description 'columnname:max_cardinality' Limits categories in column to max_cardinality to most common ones and groups others together as 'others'. For example, 'column_a:3' specifies that for column_a, function uses 3 most common categories and sets category of rows that do not belong to those 3 categories to 'others'.
'columnname:(category [,...])' Limits categories in column to those that you specify and groups others together as 'others'. For example, 'column_a : (red, yellow, blue)' specifies that for column_a, function uses categories red, yellow, and blue, and sets category of rows that do not belong to those categories to 'others'.
'columnname' All category values appear in model. - ResponseColumn
- Specify the name of the InputTable column that contains the responses.
- Family
- [Optional] Specify the distribution exponential family:
Option Model-Training Type 'GAUSSIAN' (default) Regression 'BINOMIAL' Classification - Alpha
- [Optional] Specify the mixing parameter for penalty computation (see the following table). The alpha must be in [0, 1]. If alpha is in (0,1), it represents α in the elastic net regularization formula in Generalized Linear Model (GLM) Functions (ML Engine).
alpha Regularization Type Parameter Description 0 Ridge (0,1) Elastic net 1 LASSO - RegularizationLambda
- [Optional] Specify the parameter that controls the magnitude of the regularization term. The value lambda must be in the range [0, 100]. The value 0 disables regularization.
- StopThreshold
- [Optional] Specify the convergence threshold. The threshold must be a nonnegative DOUBLE PRECISION value.
- MaxIterNum
- [Optional] Specify the maximum number of iterations over the data. The parameter max_iterations must be a positive INTEGER value in the range [1, 100000].
- Randomization
- [Optional. Unnecessary if input is randomized.] Specify whether to randomize the input.