Input - Aster Analytics

Teradata AsterĀ® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Published
September 2017
Language
English (United States)
Last Update
2018-04-17
dita:mapPath
uce1497542673292.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1022
lifecycle
previous
Product Category
Software

Single decision trees support millions of attributes. Because the database cannot have millions of columns, you must spread the attributes across rows in the form of key-value pairs, where key is the name of the attribute and value is the value of the attribute. The Unpivot function is useful for this purpose (see Example 1).

The Single_Tree_Drive function requires either an input table or both an attribute table and a response table. The function has two optional input tables, the splits table and the categorical splits table.

If the function encounters an input table row with missing values, it issues an error message.
Single_Tree_Drive Input Table Schema
Column Name Data Type Description
id_column Any Data point identifier. Cannot be NULL.
attribute_column VARCHAR Attribute name. Cannot be NULL.

Every attribute in the attribute table must be given a non-empty partition in the splits table.

node_column

Numeric attribute:

NUMERIC, INTEGER, BIGINT, or DOUBLE PRECISION

Categorical attribute:

Any

Attribute value. Can be NULL, in which case the function estimates its value by arithmetic means on an attribute basis. If this value is out of range, the function cannot use it to partition the training data; therefore, it is useless.
response_column NUMERIC, INTEGER, BIGINT, or DOUBLE PRECISION Response value for the data point. Can be NULL.
weight_column DOUBLE PRECISION Weight of the data point. Cannot be NULL. This column appears only if the decision tree is weighted.
actual_label VARCHAR Actual label of data point.
Single_Tree_Drive Attribute Table Schema
Column Name Data Type Description
id_column Any Data point identifier. Cannot be NULL.
attribute_column VARCHAR Attribute name. Cannot be NULL.

Every attribute in the attribute table must be given a non-empty partition in the splits table.

node_column

Numeric attribute:

NUMERIC, INTEGER, BIGINT, or DOUBLE PRECISION

Categorical attribute:

Any

Attribute value. Can be NULL, in which case the function estimates its value by arithmetic means on an attribute basis. If this value is out of range, the function cannot use it to partition the training data; therefore, it is useless.
actual_label VARCHAR Actual label of data point.
Single_Tree_Drive Response Table Schema
Column Name Data Type Description
id_column Any Data point identifier. Cannot be NULL.
response_column NUMERIC, INTEGER, BIGINT, or DOUBLE PRECISION Response value for the data point. Can be NULL.
weight_column DOUBLE PRECISION Weight of the data point. Cannot be NULL. This column appears only if the decision tree is weighted.
The response table must not have a column named node_id.
Single_Tree_Drive Splits Table Schema
Column Name Data Type Description
attribute_column VARCHAR Attribute name. Cannot be NULL.

Every attribute in the attribute table must be given a non-empty partition in the splits table.

split_id INTEGER Split identifier. Cannot be NULL.
splits_valcol NUMERIC, INTEGER, BIGINT, or DOUBLE PRECISION Split value. Cannot be NULL.
Single_Tree_Drive Categorical Splits Table Schema
Column Name Data Type Description
attribute VARCHAR Categorical attribute name.