5.4.5 - Tutorial - Smirnov Test Analysis - Teradata Warehouse Miner

Teradata Warehouse Miner User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.5
Published
February 2018
Language
English (United States)
Last Update
2018-05-04
dita:mapPath
yuy1504291362546.ditamap
dita:ditavalPath
ft:empty

In this example, a Smirnov test analysis is performed on the fictitious banking data to analyze account usage.

  1. Parameterize a Smirnov Test analysis as follows:
    • Available Tables — twm_customer
    • Column of Interest — income
    • Columns — gender
    • Group By Columns — years_with_bank
    • Analysis Parameters
      • Threshold Probability — 0.05
  2. Run the analysis.
  3. Click Results when it completes.
    For this example, the Smirnov Test analysis generated the following table. The Smirnov Test was computed for each distinct value of the group by variable “years_with_bank”. Results were sorted by years_with_bank. The tests show distributions of incomes of males and females were different for all values of years_with_bank. ‘p’ means significantly nonnormal and an ‘a’ means accept the null hypothesis of normality.
    The SQL is available for viewing but not listed below.
    Smirnov Test
    years_with_bank M N D SmirnovPValue SmirnovPText SmirnovCallP_0.01
    0 37 51 1.422949567 0.000101   p
    1 39 48 1.371667516 0.000103   p
    2 37 57 1.465841724 0.000101   p
    3 40 46 1.409836326 0.000105   p
    4 39 39 1.397308541 0.000146   p
    5 36 46 1.309704108 0.000105   p
    6 37 46 1.287964978 0.000104   p
    7 29 36 1.336945293 0.000112   p
    8 19 26 1.448297864 0.00011   p
    9 16 23 1.403341724 0.000101   p