Syntax | Gain Ratio Extreme Decision Tree | Vantage Analytics Library - Syntax - Vantage Analytics Library

Vantage Analytics Library User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
Lake
VMware
Product
Vantage Analytics Library
Release Number
2.2.0
Published
March 2023
Language
English (United States)
Last Update
2024-01-02
dita:mapPath
ibw1595473364329.ditamap
dita:ditavalPath
iup1603985291876.ditaval
dita:id
zyl1473786378775
Product Category
Teradata Vantage
CALL td_analyze (
  'decisiontree',
  'required_parameter_list [ optional_parameter; [...] ]'
);
required_parameter_list
database = input_database_name;
tablename = input_table_name;
columns = { all | column_name [,...] };
dependent = column_name;
optional_parameter
{ algorithm = gainratio |
  binning = { true | false } |
  columnstoexclude = column_name [,...] |
  max_depth = max_depth |
  min_records = min_records |
  operatordatabase = operator_database_name |
  outputdatabase = output_database_name |
  outputtablename = output_table_name |
  override = { true | false } |
  overwrite = { true | false } |
  pruning = { gainratio | none }
}

Syntax Elements

database
The database containing the input table.
tablename
The input table from which to build a predictive model.
columns
The columns to analyze.
keyword Description
all All columns.
allnumeric All numeric columns.
allcharacter All character columns.
dependent
The name of an input table column whose values are to be predicted.
algorithm
[Optional] The algorithm the decision tree uses during building.
binning
[Optional] Whether to separate continuous data into 100 bins.
If the variable has fewer than 100 distinct values, the function ignores this option.
Default: false
columnstoexclude
[Optional] The columns to exclude when columns specifies a keyword.
Any groupby columns are automatically excluded.
max_depth
[Optional] The maximum number of levels the tree can grow.
Default: 100
min_records
[Optional] How far the decision tree can split. Unless a node is pure (meaning it has only observations with the same dependent value), it splits if each branch that can come off this node contains at least this many observations.
Default: minimum of two cases for each branch
operatordatabase
[Optional] The database where the table operators that td_analyze calls reside.
Default behavior: The function searches the standard search path for table operators.
outputdatabase
[Optional] The name of the database containing the output table.
outputtablename
[Optional] The name of the output table representing the decision tree model.
override
[Optional] An error occurs if the dependent variable has more than 100 distinct values. You can override this limitation and build a decision tree by adding the override parameter and setting it to true. Note that if you have too many distinct values, a Segmentation Violation may occur.
Default: false
overwrite
[Optional] Whether to drop the output tables before creating new ones.
Default: true
pruning
[Optional] The style of pruning to use after the tree is built.
Default: gainratio