Adaptive Histogram - INPUT - Analysis Parameters - Teradata Warehouse Miner
Teradata® Warehouse Miner™ User Guide - Volume 1Introduction and Profiling
- Product
- Teradata Warehouse Miner
- Release Number
- 5.4.6
- Published
- November 2018
- Language
- English (United States)
- Last Update
- 2018-12-07
- dita:mapPath
- rfc1538171534881.ditamap
- dita:ditavalPath
- ft:empty
- dita:id
- B035-2300
- Product Category
- Software
-
On Adaptive Histogram, select INPUT.
-
Select analysis parameters.
Adaptive Histogram > Input > Analysis Parameters
-
On this screen select:
-
Adaptive Histogram Options
-
Spike Threshold — A percentage of rows, expressed as an integer (1 to 100), above which an individual value of a variable is identified as a separate bin. The default percentage is 10, (that is, 10% of the total number of rows). Values with this or a larger percentage of rows are identified as a Spike.
-
Subdivision Threshold — A percentage of rows, expressed as an integer (0 to 100), above which a bin is subdivided into sub-bins. The default percentage is 30, (that is, 30% of the total number of rows). Bins with this or a larger percentage of rows are subdivided into sub-bins using an algorithm that uses means and standard deviations.
-
Subdivision Method
-
Means — Option to subdivide overpopulated bins using means and standard deviations.
-
Quantiles — Option to subdivide overpopulated bins using quantiles.
-
Bin Values for Selected Columns — Each column selected for the Adaptive Histogram analysis appears in this list, along with the default bin values, depending upon the Bin Style selected. Next to Column Name, the following appear:
-
Bins — If Bins is selected, 10 appears as the number of bins to generate next to the column selected for the Histogram analysis. Select Change… to change the number of equal sized data bins. Entry must be an integer greater than 0.