Large Input Tables - Teradata Warehouse Miner

Teradata® Warehouse Miner™ User Guide - Volume 1Introduction and Profiling

Product
Teradata Warehouse Miner
Release Number
5.4.6
Published
November 2018
Language
English (United States)
Last Update
2018-12-07
dita:mapPath
rfc1538171534881.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2300
Product Category
Software
The following types of analysis, when run against billions of rows of input data, may be impractical due to exceedingly long run times or spool file usage. These include:
  • Values analysis with count of unique values requested
  • Statistics analysis with extended options
  • Histogram analysis with quantiles option
  • Histogram analysis with overlay and stats options
  • Adaptive Histogram with quantiles option
  • Scatter Plot (long load time, 15 minutes for example)
  • Variable Transformation Bincoding with quantiles option
  • Logistic Regression/Scoring Reports (lift, and so on)
  • Cluster and Cluster Scoring (all varieties except In-Database Fast KMeans)
  • Various Statistical Tests
    • Median Test
    • 2-Way F-Test with Unequal Cell Counts
    • F-Test N-Way
    • Kolmogorov-Smirnov Test
    • Lilliefors Test
    • ShapiroWilk Test
    • D’Agostino and Pearson Test
    • Smirnov Test