Adaptive Histogram - INPUT - Analysis Parameters - Teradata Warehouse Miner

Teradata® Warehouse Miner™ User Guide - Volume 1Introduction and Profiling

Product
Teradata Warehouse Miner
Release Number
5.4.6
Published
November 2018
Language
English (United States)
Last Update
2018-12-07
dita:mapPath
rfc1538171534881.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2300
Product Category
Software
  1. On Adaptive Histogram, select INPUT.
  2. Select analysis parameters.
    Adaptive Histogram > Input > Analysis Parameters

  3. On this screen select:
    • Adaptive Histogram Options
      • Spike Threshold — A percentage of rows, expressed as an integer (1 to 100), above which an individual value of a variable is identified as a separate bin. The default percentage is 10, (that is, 10% of the total number of rows). Values with this or a larger percentage of rows are identified as a Spike.
      • Subdivision Threshold — A percentage of rows, expressed as an integer (0 to 100), above which a bin is subdivided into sub-bins. The default percentage is 30, (that is, 30% of the total number of rows). Bins with this or a larger percentage of rows are subdivided into sub-bins using an algorithm that uses means and standard deviations.
    • Subdivision Method
      • Means — Option to subdivide overpopulated bins using means and standard deviations.
      • Quantiles — Option to subdivide overpopulated bins using quantiles.
    • Bin Values for Selected Columns — Each column selected for the Adaptive Histogram analysis appears in this list, along with the default bin values, depending upon the Bin Style selected. Next to Column Name, the following appear:
      • Bins — If Bins is selected, 10 appears as the number of bins to generate next to the column selected for the Histogram analysis. Select Change… to change the number of equal sized data bins. Entry must be an integer greater than 0.