Optional TD_GLM Syntax Elements - Analytics Database

Database Analytic Functions

Analytics Database
Release Number
June 2022
English (United States)
Last Update
Product Category
Teradata Vantageā„¢
Name of the InputTable columns on which to partition the input. The name must be consistent with the partition_by_column in the ON clause. If the partition_by_column is unicode with foreign language characters, then it is necessary to specify PartitionColumn argument.
Specify the distribution exponential family. Options are Gaussian and Binomial. Default value is Gaussian.
Specify the number of observations (training samples) processed in a single minibatch per AMP. A value of 0 or higher than the number of rows on an AMP processes all rows on the AMP, such that the entire dataset is processed in a single iteration, and the algorithm becomes Gradient Descent. Specify a non-negative integer value. The default value is 10.
Specify the maximum number of iterations (minibatches) over the training data batches. Value is a positive integer less than 10,000,000. Default value is 300.
Specify the regularization amount. The higher the value, stronger the regularization. It is also used to compute learning rate when learning rate is set to optimal. Must be a non-negative float value. A value of 0 means no regularization. Default value is 0.02.
Specify the Elasticnet parameter for penalty computation. It is only effective when RegularizationLambda is greater than 0. The value represents the contribution ratio of L1 in the penalty. A value of 1.0 indicates L1 (LASSO) only, a value of 0 indicates L2 (Ridge) only, and a value between is a combination of L1 and L2. Value is a float value between 0 and 1. The default value is 0.15 (15% L1, 85% L2).
Specify the number of iterations (minibatches) with no improvement in loss including the tolerance to stop training. A value of 0 indicates no early stopping and the algorithm continues until MaxIterNum iterations are reached. Specify a non-negative integer value. The default value is 50.
Specify the stopping criteria in terms of loss function improvement. Applicable when IterNumNoChange is greater than 0. Specify a positive integer value. The default value is 0.001.
Specify whether to estimate intercept based on whether the data is already centered. The default value is true.
Specify weights associated with classes. Only applicable for Binomial Family. The format is 0:weight,1:weight. For example, 0:1.0,1:0.5 gives twice the weight to each observation in class 0. If the weight of a class is omitted, it is assumed to be 1.0. The default value is 0:1.0,1:1.0.
Specify one of the learning rate algorithms:
  • Constant
  • InvTime
  • Optimal
  • Adaptive

The default value is invtime for Gaussian, and optimal for Binomial.

Specify the initial learning rate eta value. If you specify the learning rate as constant, the eta value is applicable for all iterations. The default value is 0.05.
Specify the decay rate for the learning rate. Only applicable for invtime and adaptive learning rates. The default value is 0.25.
Specify the number of iterations without decay for the adaptive learning rate. The learning rate changes by decay rate after the specified number of iterations are completed. The default value is 5.
Specify the value to use for momentum learning rate optimizer. A larger value indicates higher momentum contribution. A value of 0 means momentum optimizer is disabled. For a good momentum contribution, a value between 0.6-0.95 is recommended. Value is a non-negative float between 0 and 1. The default value is 0.
Specify whether to use Nesterov optimization for the Momentum optimizer. Only applicable when the Momentum optimizer value is greater than 0. The default value is True.
Specify the number of local iterations for the Local SGD algorithm. A value of 0 implies that the algorithm is disabled. A value greater than 0 enables the algorithm and specifies the number of iterations for the algorithm. The recommended values for the arguments are as follows:
  • LocalSGDIterations: 10
  • MaxIterNum: 100
  • BatchSize: 50
  • IterNumNoChange: 5

The default value is 0.

Specify the type of algorithm to be used. The algorithm stops when the addition or deletion of variables in the model does not improve the score. The following arguments are available:
  • forward

    Forward selection starts with an empty model and adds features one at a time, keeping the variable that provides the best model. Once a feature has been added to the model it cannot be removed.

  • backward

    Backward elimination starts with a full model with all the features to be tested. In each step, a single feature is removed from the model keeping only those that improve the selected criterion. Once a feature has been removed it cannot be added to the model.

  • both or bidirectional

    Both and bidirectional behave the same way. The function starts from a given model, which can be full, empty, or any combination of variables. The features inside the model are deleted and those outside the model are added one at a time, selecting in each step the combination that improves the most the score of the selected criterion.

Specify the maximum number of steps to be used for the StepwiseDirection algorithm. If the value is set to zero, then the algorithm runs until convergence. Default is 5.
Specify the names of the initial state model columns that need to be used as starting point for the stepwise regression algorithm, such as predictors, features or independent variables.
The iteration mode. Values are Batch and Epoch. Default is Batch.
  • Batch: One iteration per batch. After processing rows in a batch, update the weight of the parameters and proceed to the next iteration.
  • Epoch: One iteration per epoch. After processing all rows in a partition (with one or more batches), update the weight of the parameters and proceed to the next epoch. See BatchSize argument for batch size.