Tutorial - Smirnov Test Analysis - Teradata Warehouse Miner

Teradata® Warehouse Miner™ User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.6
Published
November 2018
Language
English (United States)
Last Update
2018-12-07
dita:mapPath
yor1538171534879.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2302
Product Category
Software
In this example, a Smirnov test analysis is performed on the fictitious banking data to analyze account usage.
  1. Parameterize a Smirnov Test analysis as follows:
    • Available Tables — twm_customer
    • Column of Interest — income
    • Columns — gender
    • Group By Columns — years_with_bank
    • Analysis Parameters
      • Threshold Probability — 0.05
  2. Run the analysis.
  3. Click Results when complete.
    For this example, the Smirnov Test analysis generated the following table. The Smirnov Test was computed for each distinct value of the group by variable “years_with_bank”. Results were sorted by years_with_bank. The tests show distributions of incomes of males and females were different for all values of years_with_bank. ‘p’ means significantly nonnormal and an ‘a’ means accept the null hypothesis of normality.
    The SQL is available for viewing but not listed below.
    Smirnov Test
    years_with_bank M N D SmirnovPValue SmirnovPText SmirnovCallP_0.01
    0 37 51 1.422949567 0.000101   p
    1 39 48 1.371667516 0.000103   p
    2 37 57 1.465841724 0.000101   p
    3 40 46 1.409836326 0.000105   p
    4 39 39 1.397308541 0.000146   p
    5 36 46 1.309704108 0.000105   p
    6 37 46 1.287964978 0.000104   p
    7 29 36 1.336945293 0.000112   p
    8 19 26 1.448297864 0.00011   p
    9 16 23 1.403341724 0.000101   p