DecisionTreePredict Input - Teradata® Database

Database Analytic Functions

Product
Teradata® Database
Release Number
17.10
Published
July 2021
Language
English (United States)
Last Update
2021-07-28
dita:mapPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/wnd1589838592459.ditamap
dita:ditavalPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/ayr1485454803741.ditaval
dita:id
B035-1206
lifecycle
previous
Product Category
Teradata Vantage™
Table Description
AttributeTable Contains test data. Has same schema as ML Engine DecisionTree InputTable.
Model Model output by ML Engine DecisionTree function.

AttributeTable Schema

See Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003.

Model Schema

For CHARACTER and VARCHAR columns, CHARACTER SET must be either UNICODE or LATIN.

Double quotation marks around some column names are required because the names contain special characters.

Column Data Type Description
node_id INTEGER, SMALLINT, or BIGINT Node identifier.
node_size INTEGER, SMALLINT, or BIGINT Number of objects in node.
"node_gini(p)" or node_gini INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION GINI impurity value for information in node. For ImpurityMeasurement ('gini'), column name is node_gini(p); otherwise, it is node_gini.
"node_entropy(p)" or node_entropy INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION Entropy impurity value for the information in the node. For ImpurityMeasurement ('entropy'), column name is node_entropy(p); otherwise, it is node_entropy.
"node_chisq_pv(p)" or node_chisq_pv INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION Chi-square impurity value for the information in the node. For ImpurityMeasurement ('chisquare'), column name is node_chisq_pv(p); otherwise, it is node_chisq_pv.
node_label CHARACTER or VARCHAR Output category for node.
node_majorvotes INTEGER, SMALLINT, or BIGINT Number of objects that belong to category identified by node_label.
split_value INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION Numeric split value.
"split_gini(p)" or split_gini INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION GINI impurity measurement for information in node after splitting. For ImpurityMeasurement ('gini'), column name is split_gini(p); otherwise, it is split_gini.
"split_entropy(p)" or split_entropy INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION Entropy impurity measurement for the information in node after splitting. For ImpurityMeasurement ('entropy'), column name is split_entropy(p); otherwise, it is split_entropy.
"split_chisq_pv(p)" or split_chisq_pv INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION Chi-square impurity measurement for information in node after splitting. For ImpurityMeasurement ('chisquare'), column name is split_chisq_pv(p); otherwise, it is split_chisq_pv.
left_id INTEGER, SMALLINT, or BIGINT Identifier of left child of node.
left_size INTEGER, SMALLINT, or BIGINT Number of objects in left child of node.
left_label CHARACTER or VARCHAR Output category for left child of node.
left_majorvotes INTEGER, SMALLINT, or BIGINT Number of objects that belong to category identified by left_label.
right_id INTEGER, SMALLINT, or BIGINT Identifier of right child of node.
right_size INTEGER, SMALLINT, or BIGINT Number of objects in right child of node.
right_label CHARACTER or VARCHAR Output category for right child of node.
right_majorvotes INTEGER, SMALLINT, or BIGINT Number of objects that belong to category identified by right_label.
left_bucket CHARACTER or VARCHAR When split value is categorical attribute, value in left child of node.
right_bucket CHARACTER or VARCHAR When split value is categorical attribute, value in right child of node.
left_label_prob_list CHARACTER or VARCHAR [Column appears only with OutputResponseProbList ('true').] Probability of each label for left child of node.
right_label_prob_list CHARACTER or VARCHAR [Column appears only with OutputResponseProbList ('true').] Probability of each label for right child of node.
prob_label_order CHARACTER or VARCHAR [Column appears only with OutputResponseProbList ('true').] Label order probability for left and right children of node.
attribute CHARACTER or VARCHAR Split attribute.
node_majorfreq INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by node_label.
left_majorfreq INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by left_label.
right_majorfreq INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by right_label.