Cluster - OUTPUT - Teradata Warehouse Miner

Teradata Warehouse Miner User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.4
Published
July 2017
Language
English (United States)
Last Update
2018-05-03
dita:mapPath
lov1499730320967.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2302
Product Category
Software

This screen does not apply to the Fast K-Means algorithm.

  1. On the Clustering dialog box, click OUTPUT.
    Cluster > OUTPUT

  2. On this screen, select:
    • Store the variables table of this analysis in the database — Check this box to store the variables table of this analysis in two tables in the database, one for cluster columns and one for cluster results.
    • Database Name — The name of the database to create the output tables in.
    • Output Table Prefix — The prefix of the output tables. (For example, if test is entered here, tables test_ClusterColumns and test_ClusterResults will be created).
    • Advertise Output — The Advertise Output option “advertises” output by inserting information into one or more of the Advertise Output metadata tables according to the type of analysis and the options selected in the analysis.
    • Advertise Note — An Advertise Note may be specified if desired when the Advertise Output option is selected or when the Always Advertise option is selected on the Connection Properties dialog box. It is a free-form text field of up to 30 characters that may be used to categorize or describe the output.

      By way of an example, the tutorial example with prefix test yields table test_ClusterResults.

      test_ClusterResults
      column_ix cluster_id priors m v
      1 1 0.0692162138434691 -2231.95933518596 7306685.95957656
      1 2 0.403625379654599 -947.132576882845 846532.221977884
      1 3 0.527158406501931 -231.599917701351 105775.923364194
      2 1 0.0692162138434691 3733.31923440023 18669805.3968291
      2 2 0.403625379654599 1293.34863525092 1440668.11504453
      2 3 0.527158406501931 231.817911577847 102307.594966697
      3 1 0.0692162138434691 3725.87257974281 18930649.6488828
      3 2 0.403625379654599 632.603945909026 499736.882919713
      3 3 0.527158406501931 163.869611182736 57426.9984808451

      and test_ClusterColumns:

      test_ClusterColumns
      table_name column_name column_alias column_order index_flag variable_type
      twm_customer_analysis avg_cc_bal avg_cc_bal 1 0 1
      twm_customer_analysis avg_ck_bal avg_ck_bal 2 0 1
      twm_customer_analysis avg_sv_bal avg_sv_bal 3 0 1

      If Database Name is twm_results and Output Table Prefix is test, these tables are defined respectively as:

      CREATE SET TABLE twm_results.test_ClusterResults
           (
            column_ix INTEGER,
            cluster_id INTEGER,
            priors FLOAT,
            m FLOAT,
            v FLOAT)
      UNIQUE PRIMARY INDEX ( column_ix ,cluster_id );
      
      
      CREATE SET TABLE twm_results.test_ClusterColumns
           (
            table_name VARCHAR(30) CHARACTER SET UNICODE NOT CASESPECIFIC,
            column_name VARCHAR(30) CHARACTER SET UNICODE NOT CASESPECIFIC,
            column_alias VARCHAR(100) CHARACTER SET UNICODE NOT CASESPECIFIC,
            column_order SMALLINT,
            index_flag SMALLINT,
            variable_type INTEGER)
      UNIQUE PRIMARY INDEX ( table_name ,column_name );