Table | Description |
---|---|
AttributeTable | Contains test data. Has same schema as ML Engine DecisionTree InputTable. |
Model | Model output by ML Engine DecisionTree function. |
AttributeTable Schema
See Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003.
Model Schema
For CHARACTER and VARCHAR columns, CHARACTER SET must be either UNICODE or LATIN.
Double quotation marks around some column names are required because the names contain special characters.
Column | Data Type | Description |
---|---|---|
node_id | INTEGER, SMALLINT, or BIGINT | Node identifier. |
node_size | INTEGER, SMALLINT, or BIGINT | Number of objects in node. |
"node_gini(p)" or node_gini | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | GINI impurity value for information in node. For ImpurityMeasurement ('gini'), column name is node_gini(p); otherwise, it is node_gini. |
"node_entropy(p)" or node_entropy | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | Entropy impurity value for the information in the node. For ImpurityMeasurement ('entropy'), column name is node_entropy(p); otherwise, it is node_entropy. |
"node_chisq_pv(p)" or node_chisq_pv | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | Chi-square impurity value for the information in the node. For ImpurityMeasurement ('chisquare'), column name is node_chisq_pv(p); otherwise, it is node_chisq_pv. |
node_label | CHARACTER or VARCHAR | Output category for node. |
node_majorvotes | INTEGER, SMALLINT, or BIGINT | Number of objects that belong to category identified by node_label. |
split_value | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | Numeric split value. |
"split_gini(p)" or split_gini | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | GINI impurity measurement for information in node after splitting. For ImpurityMeasurement ('gini'), column name is split_gini(p); otherwise, it is split_gini. |
"split_entropy(p)" or split_entropy | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | Entropy impurity measurement for the information in node after splitting. For ImpurityMeasurement ('entropy'), column name is split_entropy(p); otherwise, it is split_entropy. |
"split_chisq_pv(p)" or split_chisq_pv | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | Chi-square impurity measurement for information in node after splitting. For ImpurityMeasurement ('chisquare'), column name is split_chisq_pv(p); otherwise, it is split_chisq_pv. |
left_id | INTEGER, SMALLINT, or BIGINT | Identifier of left child of node. |
left_size | INTEGER, SMALLINT, or BIGINT | Number of objects in left child of node. |
left_label | CHARACTER or VARCHAR | Output category for left child of node. |
left_majorvotes | INTEGER, SMALLINT, or BIGINT | Number of objects that belong to category identified by left_label. |
right_id | INTEGER, SMALLINT, or BIGINT | Identifier of right child of node. |
right_size | INTEGER, SMALLINT, or BIGINT | Number of objects in right child of node. |
right_label | CHARACTER or VARCHAR | Output category for right child of node. |
right_majorvotes | INTEGER, SMALLINT, or BIGINT | Number of objects that belong to category identified by right_label. |
left_bucket | CHARACTER or VARCHAR | When split value is categorical attribute, value in left child of node. |
right_bucket | CHARACTER or VARCHAR | When split value is categorical attribute, value in right child of node. |
left_label_prob_list | CHARACTER or VARCHAR | [Column appears only with OutputResponseProbList ('true').] Probability of each label for left child of node. |
right_label_prob_list | CHARACTER or VARCHAR | [Column appears only with OutputResponseProbList ('true').] Probability of each label for right child of node. |
prob_label_order | CHARACTER or VARCHAR | [Column appears only with OutputResponseProbList ('true').] Label order probability for left and right children of node. |
attribute | CHARACTER or VARCHAR | Split attribute. |
node_majorfreq | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by node_label. |
left_majorfreq | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by left_label. |
right_majorfreq | INTEGER, SMALLINT, BIGINT, NUMBER, or DOUBLE PRECISION | [Column appears only with Weighted ('true').] Weighted objects that belong to category identified by right_label. |