Optional Syntax Elements for TD_DecisionForest - Teradata Vantage

Teradata® VantageCloud Lake

Deployment
VantageCloud
Edition
Lake
Product
Teradata Vantage
Published
January 2023
Language
English (United States)
Last Update
2024-04-03
dita:mapPath
phg1621910019905.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
phg1621910019905

The following elements are optional when using TD_DecisionForest:

MaxDepth
Specify the maximum depth of a tree. The algorithm stops splitting a node beyond this depth. Decision trees can grow to 2(max_depth+1)-1 nodes. The default value is 5. You must specify a non-negative integer value.
NumTrees
Specify the number of trees for the forest model. You must specify a value greater than or equal to the number of data AMPs. By default, the function builds the minimum number of trees that provides the specified coverage level in the CoverageFactor argument for the input dataset. The default value is -1.
Maximum number of supported trees is 65536.
MinNodeSize
Specify the minimum number of observations in a tree node. The algorithm stops splitting a node if the number of observations in the node is equal to or smaller than this value. You must specify a non-negative integer value. The default value is 1.
Mtry
Specify the number of features from input columns for evaluating the best split of a node. A higher value improves the splitting and performance of a tree. A smaller value improves the robustness of the forest and prevents it from overfitting. When the value is -1, all variables are used for each split. The default value is -1.
MtrySeed
Specify the random seed that the algorithm uses for the Mtry argument. The default value is 1.
Seed
Specify the random seed the algorithm uses for repeatable results. The default value is 1.
ModelType
Specify whether the analysis is a regression (continuous response variable) or a multiple-class classification (predicting result from the number of classes).
Allowed Values: Regression, Classification. The default value is Regression.
A maximum of 500 classes is supported for classification.
TreeSize
Specify the number of rows that each tree uses as its input dataset. The function builds a tree using either the number of rows on an AMP, the number of rows that fit into the AMP’s memory (whichever is less), or the number of rows given by the TreeSize argument. By default, this value is the minimum number of rows on an AMP and the number of rows that fit into the AMP’s memory. The default value is -1.
MinImpurity
Specify the minimum impurity of a tree node. The algorithm stops splitting a node if the value is equal to or smaller than the specified value. The default value is 0.0.