5.4.5 - Chi Square Test - INPUT - Data Selection - Teradata Warehouse Miner

Teradata Warehouse Miner User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.5
Published
February 2018
Language
English (United States)
Last Update
2018-05-04
dita:mapPath
yuy1504291362546.ditamap
dita:ditavalPath
ft:empty
  1. On the Tests Based on Contingency Tables dialog box, click INPUT.
  2. Click data selection.
    Chi Square Test > Input > Data Selection

  3. On this screen, select:
    • Select Input Source — Users may select between different sources of input.

      By selecting the Input Source Table, the user can select from available databases, tables (or views) and columns in the usual manner. By selecting the Input Source Analysis, however, the user can select directly from the output of another analysis of qualifying type in the current project. Analyses that may be selected from directly include all of the Analytic Data Set (ADS) and Reorganization analyses (except Refresh). In place of Available Databases, the user can select from Available Analyses, while Available Tables then contains a list of all the output tables that are eventually produced by the selected analysis, or it contains a single entry with the name of the analysis under the label Volatile Table, representing the output of the analysis that is ordinarily produced by a Select statement.

    • Select Columns From a Single Table
      • Available Databases (or Analyses) — These are the databases (or analyses) available to be processed.
      • Available Tables — These are the tables and views that are available to be processed.
      • Available Columns — These are the columns within the table/view that are available for processing.
    • Select Statistical Test Style — These are the Tests Based on Contingency Tables available (Chi Square, Median). Select “Chi Square”.
    • Select Optional Columns
      • Selected Columns — Select columns by highlighting and then either dragging and dropping into the Selected Columns window, or click the arrow button to move highlighted columns into the Selected Columns window.
      • First Columns/Second Columns — Note that the Selected Columns window is a split window; you can insert columns as First Columns, Second Columns. Make sure you have the correct portion of the window highlighted.
      • First Columns — The set of columns that specifies the first of a pair of variables for Chi Square analysis.
      • Second Columns — The set of columns that specifies the second of a pair of variables for Chi Square analysis.
        Each combination of the first and second variables generates a separate Chi Square test.
        To avoid excessively long execution, the number of combinations is limited to 100, and unless the product of the number of distinct values of each pair is 2000 or less, the calculation will be skipped.
        Group-By Columns are not available in the Chi Square Test.