Chaid Trees - Teradata Warehouse Miner

Teradata® Warehouse Miner™ User Guide - Volume 3Analytic Functions

Product
Teradata Warehouse Miner
Release Number
5.4.6
Published
November 2018
Language
English (United States)
Last Update
2018-12-07
dita:mapPath
yor1538171534879.ditamap
dita:ditavalPath
ft:empty
dita:id
B035-2302
Product Category
Software

CHAID trees utilize the chi squared significance test as a means of partitioning data. Independent variables are tested by looping through the values and merging categories that have the least significant difference from one another and also are still below the merging significance level parameter (default .05). Once all independent variables have been optimally merged the one with the highest significance is chosen for the split, the data is subdivided, and the process is repeated on the subsets of the data. The splitting stops when the significance goes above the splitting significance level (default .05).

For a detailed description of this type of tree, see [Kass].