DecisionTreePredict Input - Teradata® Database

Database Analytic Functions

Product

Teradata® Database

Release Number

17.10

Published

July 2021

Language

English (United States)

Last Update

2021-07-28

dita:mapPath

Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/wnd1589838592459.ditamap

dita:ditavalPath

Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/ayr1485454803741.ditaval

dita:id

B035-1206

lifecycle

Product Category

Teradata Vantage™

Table	Description
AttributeTable	Contains test data. Has same schema as ML Engine DecisionTree InputTable.
Model	Model output by ML Engine DecisionTree function.

AttributeTable Schema

See Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003.

Model Schema

For CHARACTER and VARCHAR columns, CHARACTER SET must be either UNICODE or LATIN.

Double quotation marks around some column names are required because the names contain special characters.

Column	Data Type	Description
node_id	INTEGER, SMALLINT, or BIGINT	Node identifier.
node_size	INTEGER, SMALLINT, or BIGINT	Number of objects in node.
"node_gini(p)" or node_gini	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	GINI impurity value for information in node. For ImpurityMeasurement ('gini'), column name is node_gini(p); otherwise, it is node_gini.
"node_entropy(p)" or node_entropy	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	Entropy impurity value for the information in the node. For ImpurityMeasurement ('entropy'), column name is node_entropy(p); otherwise, it is node_entropy.
"node_chisq_pv(p)" or node_chisq_pv	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	Chi-square impurity value for the information in the node. For ImpurityMeasurement ('chisquare'), column name is node_chisq_pv(p); otherwise, it is node_chisq_pv.
node_label	CHARACTER or VARCHAR	Output category for node.
node_majorvotes	INTEGER, SMALLINT, or BIGINT	Number of objects that belong to category identified by node_label.
split_value	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	Numeric split value.
"split_gini(p)" or split_gini	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	GINI impurity measurement for information in node after splitting. For ImpurityMeasurement ('gini'), column name is split_gini(p); otherwise, it is split_gini.
"split_entropy(p)" or split_entropy	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	Entropy impurity measurement for the information in node after splitting. For ImpurityMeasurement ('entropy'), column name is split_entropy(p); otherwise, it is split_entropy.
"split_chisq_pv(p)" or split_chisq_pv	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	Chi-square impurity measurement for information in node after splitting. For ImpurityMeasurement ('chisquare'), column name is split_chisq_pv(p); otherwise, it is split_chisq_pv.
left_id	INTEGER, SMALLINT, or BIGINT	Identifier of left child of node.
left_size	INTEGER, SMALLINT, or BIGINT	Number of objects in left child of node.
left_label	CHARACTER or VARCHAR	Output category for left child of node.
left_majorvotes	INTEGER, SMALLINT, or BIGINT	Number of objects that belong to category identified by left_label.
right_id	INTEGER, SMALLINT, or BIGINT	Identifier of right child of node.
right_size	INTEGER, SMALLINT, or BIGINT	Number of objects in right child of node.
right_label	CHARACTER or VARCHAR	Output category for right child of node.
right_majorvotes	INTEGER, SMALLINT, or BIGINT	Number of objects that belong to category identified by right_label.
left_bucket	CHARACTER or VARCHAR	When split value is categorical attribute, value in left child of node.
right_bucket	CHARACTER or VARCHAR	When split value is categorical attribute, value in right child of node.
left_label_prob_list	CHARACTER or VARCHAR	[Column appears only with OutputResponseProbList ('true').] Probability of each label for left child of node.
right_label_prob_list	CHARACTER or VARCHAR	[Column appears only with OutputResponseProbList ('true').] Probability of each label for right child of node.
prob_label_order	CHARACTER or VARCHAR	[Column appears only with OutputResponseProbList ('true').] Label order probability for left and right children of node.
attribute	CHARACTER or VARCHAR	Split attribute.
node_majorfreq	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	[Column appears only with Weighted ('true').] Weighted objects that belong to category identified by node_label.
left_majorfreq	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	[Column appears only with Weighted ('true').] Weighted objects that belong to category identified by left_label.
right_majorfreq	INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION	[Column appears only with Weighted ('true').] Weighted objects that belong to category identified by right_label.