Data Requirements - Teradata Warehouse Miner

Teradata Warehouse Miner User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.5
Published
February 2018
Language
English (United States)
Last Update
2018-05-04
dita:mapPath
yuy1504291362546.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2302
Product Category
Software

The following chart summarizes how the Statistical Test functions handle various types of input. Those cases with the note “should be normal numeric” will give warnings for any type of input that is not standard numeric (i.e., for character data, dates, big integers or decimals, etc.).

In the table below, cat is an abbreviation for categorical, num for numeric and bignum for big integers or decimals:

Statistical Test Functions Handling of Input
Test Input Columns Tests Return Results With Note
Median column of interest cat, num, date, bignum can be anything
Median columns cat, num, date, bignum can be anything
Median group by columns cat, num, date, bignum can be anything
Chi Square 1st columns cat, num, date, bignum can be anything (limit of 2000 distinct value pairs)
Chi Square 2nd columns cat, num, date, bignum can be anything
Mann Whitney column of interest cat, num, date, bignum can be anything
Mann Whitney columns cat, num, date, bignum can be anything
Mann Whitney group by columns cat, num, date, bignum can be anything
Wilcoxon 1st column num, date, bignum should be normal numeric
Wilcoxon 2nd column num, date, bignum should be normal numeric
Wilcoxon group by columns cat, num, date, bignum can be anything
Friedman column of interest num should be normal numeric
Friedman treatment column   special count requirements
Friedman block column   special count requirements
Friedman group by columns cat, num, date, bignum can be anything
F(n)way column of interest num should be normal numeric
F(n)way columns cat, num, date, bignum can be anything
F(n)way group by columns cat, num, date, bignum can be anything
F(2)way ucc column of interest num should be normal numeric
F(2)way ucc columns cat, num, date, bignum can be anything
F(2)way ucc group by columns cat, num, date, bignum can be anything
T Paired 1st column num should be normal numeric
T Paired 2nd column num, date, bignum should be normal numeric
T Paired group by columns cat, num, date, bignum can be anything
T Unpaired 1st column num should be normal numeric
T Unpaired 2nd column num, date, bignum should be normal numeric
T Unpaired group by columns cat, num, date, bignum can be anything
T Unpaired w ind 1st column num should be normal numeric
T Unpaired w ind indicator column cat, num, date, bignum can be anything
T Unpaired w ind group by columns cat, num, date, bignum can be anything
Kolmogorov-Smirnov column of interest num, date, bignum should be normal numeric
Kolmogorov-Smirnov group by columns cat, num, date, bignum can be anything
Lilliefors column of interest num, date, bignum should be normal numeric
Lilliefors group by columns cat, num, bignum can be anything but date
Shapiro-Wilk column of interest num, date, bignum should be normal numeric
Shapiro-Wilk group by columns cat, num, date, bignum can be anything
D'Agostino-Pearson column of interest num should be normal numeric
D'Agostino-Pearson group by columns cat, num, bignum can be anything but date
Smirnov column of interest cat, num, date, bignum should be normal numeric
Smirnov columns must be 2 distinct values must be 2 distinct values
Smirnov group by columns cat, num, bignum can be anything but date
Binomial 1st column num, date, bignum should be normal numeric
Binomial 2nd column num, date, bignum should be normal numeric
Binomial group by columns cat, num, date, bignum can be anything
Sign 1st column num, bignum should be normal numeric
Sign group by columns cat, num, date, bignum can be anything