Tutorial - Shapiro-Wilk Test Analysis - Teradata Warehouse Miner

Teradata Warehouse Miner User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.4
Published
July 2017
Language
English (United States)
Last Update
2018-05-03
dita:mapPath
lov1499730320967.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2302
Product Category
Software

In this example, a Shapiro-Wilk test analysis is performed on the fictitious banking data to analyze account usage.

  1. Parameterize a Shapiro-Wilk Test analysis as follows:
    • Available Tables — twm_customer_analysis
    • Column of Interest — income
    • Group By Columns — years_with_bank
    • Analysis Parameters
      • Threshold Probability — 0.05
  2. Run the analysis.
  3. Click Results when it completes.

    For this example, the Shapiro-Wilk Test analysis generated the following table. The Shapiro-Wilk Test was computed for each distinct value of the group by variable “years_with_bank”. Results were sorted by years_with_bank. The tests show the user that all incomes were not normally distributed.

    ‘p’ means significantly nonnormal and an ‘a’ means accept the null hypothesis of normality.
    The SQL is available for viewing but not listed below.
    Shapiro-Wilk Test
    years_with_bank Shw N ShapiroWilkPValue ShapiroWilkPText ShapiroWilkCallP_0.05
    0 0.84919004 88 0.000001   p
    1 0.843099681 87 0.000001   p
    2 0.831069533 94 0.000001   p
    3 0.838965439 86 0.000001   p
    4 0.707924134 78 0.000001   p
    5 0.768444329 82 0.000001   p
    6 0.855276885 83 0.000001   p
    7 0.827399691 65 0.000001   p
    8 0.863932178 45 0.01 <0.01 p
    9 0.930834522 39 0.029586304   p