Histogram - INPUT - Data Selection

Teradata Warehouse Miner User Guide - Volume 1Introduction and Profiling

brand
Software
prodname
Teradata Warehouse Miner
vrm_release
5.4.4
category
User Guide
featnum
B035-2300-077K
  1. On the Histogram dialog box, click on INPUT.
  2. Click on data selection.
    Histogram > Input > Data Selection

  3. On this screen select:
    • Select Input Source — Users who are not using the Teradata Profiler program may select between different sources of input. By selecting the Table option, the user can select from available databases, tables (or views) and columns in the usual manner. By selecting the Analysis option, however, the user can select directly from the output of another analysis of qualifying type in the current project. Analyses that may be selected from directly include all of the Analytic Data Set (ADS) and Reorganization analyses (except Refresh). In place of Available Databases, the user may select from Available Analyses, while Available Tables then contains a list of all the output tables that will eventually be produced by the selected analysis, or it contains a single entry with the name of the analysis under the label Volatile Table, representing the output of the analysis that is ordinarily produced by a Select statement. For more information, see INPUT Tab.
      View is only available when a single column is selected or when crosstab is selected.
    • Select Columns From a Single Table
      • Available Databases (or Analyses) — Choose the database (or analysis) from which you will select data tables.
      • Available Tables — Select the table from which you will select columns.
    • Available Columns — Select columns by highlighting and then either dragging and dropping into the Selected Columns window, or click on the arrow button to move highlighted columns into the Selected Columns window.
    • Select Histogram Style
      • Histogram Style
        • Basic — Option to create a histogram for individual columns.
        • Crosstab — Option to create a multidimensional histogram by combining columns.
      • Select Optional Columns
        • Overlay Columns — Expand the Overlay Columns selector by clicking on the “double-up-arrow” (). This represents a list of overlay columns to subdivide each bin. An overlay column is typically a categorical variable with only a few values. If an overlay column is specified, frequencies within each bin are calculated for each value of that overlay column (frequencies for crosstabs of values are given if more than one overlay column is requested). A specific column can be used in either Overlay Columns or Statistics Columns, but not both. Select columns by highlighting and then either dragging and dropping into the Overlay Columns window, or click on the arrow button to move highlighted columns into the Overlay Columns window.
        • Statistics Columns — Expand the Statistics Columns selector by clicking on the “double-up-arrow” (). This represents a list of numeric columns/aliases which simple statistics will be calculated (minimum, maximum, mean and standard deviation) in each bin. Not available for DATE columns. A specific column can be used in either Statistics Columns or Overlay Columns but not both. Select columns by highlighting and then either dragging and dropping into the Statistics Columns window, or click on the arrow button to move highlighted columns into the Statistics Columns window.