Histogram - INPUT - Analysis Parameters

Teradata Warehouse Miner User Guide - Volume 1Introduction and Profiling

brand
Software
prodname
Teradata Warehouse Miner
vrm_release
5.4.4
category
User Guide
featnum
B035-2300-077K
  1. On the Histogram dialog box, click on INPUT.
  2. Click on analysis parameters.
    Histogram > Input > Analysis Parameters

  3. On this screen, select:
    • Bin Style
      • Bins — Specify a number of equal sized data bins. By default 10 bins are derived for each column.
      • Widths — Specify the desired width of each bin.
      • Quantiles — Specify a number of bins with a nearly equal number of values. By default, 10 bins are derived for each column.
      • Boundaries — Specify a list of the desired boundaries for each bin to start, with the final value indicating the end of the last bin. Note that bin 0 will be generated if necessary to contain data values less than the first boundary specified, and bin N+1 will be generated if necessary for those data values greater than the final boundary value.
      • Bins with Boundaries — Specify the number of desired equal sized data bins, along with minimum and maximum values. By default, 10 bins are derived for each column. Note that bin 0 will be generated to contain data values less than the minimum specified, and bin N+1 will be generated for those data values greater than the maximum specified.
    • Bin Values for Selected Columns

      Each column selected for the Histogram analysis appears in this list, along with the default bin values, depending upon the Bin Style selected. Next to Column Name, one of the following appears:

      • Bins — If Bins is selected, 10 appears as the number of bins to generate next to the column selected for the Histogram analysis. Highlight the Number of Bins to change the desired number of equal sized data bins. Entry must be an integer greater than 0.
      • Widths — If Widths is selected, 0 appears next to the column selected for the histogram analysis. Highlight the Bin Width to enter the desired number of equal sized data bins. The values specified must be greater than 0.
      • Quantiles — If Quantiles is selected, 10 appears as the number of bins to generate next to the column selected for the Histogram analysis. Highlight the Number of Quantiles to change the desired number of bins with a nearly equal number of values. Entry must be an integer greater than 0.
      • Boundaries — If Boundaries is selected, enter for each requested column a list of numeric values corresponding to the starting values of each bin, plus one final value indicating the closing boundary of the final bin. A boundary list must contain two or more increasing numeric values, with dates entered as integer values in YYYYMMDD format.
      • Bins with Boundaries — If Bins with Boundaries is selected, 10 appears by default as the number of bins to generate next to each column selected for the Histogram analysis. Highlight the Number of Bins to change the number of equal sized data bins if desired, and then enter a minimum and maximum value.