GLML1L2 Syntax Elements - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product

Teradata Vantage

Release Number

9.02

9.01

2.0

1.3

Published

February 2022

Language

English (United States)

Last Update

2022-02-10

dita:mapPath

rnn1580259159235.ditamap

dita:ditavalPath

ybt1582220416951.ditaval

dita:id

B700-4003

lifecycle

Product Category

Teradata Vantage™

FactorTable

[Optional] Specify the name for the FactorTable. The function encodes categorical predictors as integer values in the FactorTable and copies numeric predictors to the FactorTable unchanged.

If you specify FactorTable, you must either specify CategoricalColumns or Randomization ('true').

You can use factor_table as InputTable for future GLML1L2 function calls, thereby saving the function from repeating the categorical-to-numerical conversion.

TargetColumns

Specify the names of the InputTable columns that contain the variables to use as predictors (independent variables) in the model.

Every target_column is numerical unless you specify it with CategoricalColumns.

CategoricalColumns

[Optional] Specify columnname-value pairs, each of which contains the name of a categorical input column and the category values in that column that the function is to include in the model that it creates.

columnname_value_pair	Description
'columnname:max_cardinality'	Limits categories in column to max_cardinality to most common ones and groups others together as 'others'. For example, 'column_a:3' specifies that for column_a, function uses 3 most common categories and sets category of rows that do not belong to those 3 categories to 'others'.
'columnname:(category [,...])'	Limits categories in column to those that you specify and groups others together as 'others'. For example, 'column_a : (red, yellow, blue)' specifies that for column_a, function uses categories red, yellow, and blue, and sets category of rows that do not belong to those categories to 'others'.
'columnname'	All category values appear in model.

For information about columns that you must identify as numeric or categorical, see Identification of Numeric and Categorical Columns.

ResponseColumn

Specify the name of the InputTable column that contains the responses.

Family

[Optional] Specify the distribution exponential family:

Option	Model-Training Type
'GAUSSIAN' (default)	Regression
'BINOMIAL'	Classification

Alpha

[Optional] Specify the mixing parameter for penalty computation (see the following table). The alpha must be in [0, 1]. If alpha is in (0,1), it represents α in the elastic net regularization formula in Generalized Linear Model (GLM) Functions (ML Engine).

alpha	Regularization Type	Parameter Description
0	Ridge
(0,1)	Elastic net
1	LASSO

Default: 0

RegularizationLambda

[Optional] Specify the parameter that controls the magnitude of the regularization term. The value lambda must be in the range [0, 100]. The value 0 disables regularization.

Default: 0

StopThreshold

[Optional] Specify the convergence threshold. The threshold must be a nonnegative DOUBLE PRECISION value.

Default: 1.0e-7

MaxIterNum

[Optional] Specify the maximum number of iterations over the data. The parameter max_iterations must be a positive INTEGER value in the range [1, 100000].

Default: 10000

Randomization

[Optional. Unnecessary if input is randomized.] Specify whether to randomize the input.

Default: 'false'