Distribution Matching Function | Teradata Vantage - Distribution Matching (ML Engine)

Distribution Matching Function | Teradata Vantage - Distribution Matching (ML Engine) - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product

Teradata Vantage

Release Number

9.02

9.01

2.0

1.3

Published

February 2022

Language

English (United States)

Last Update

2022-02-10

dita:mapPath

rnn1580259159235.ditamap

dita:ditavalPath

ybt1582220416951.ditaval

dita:id

B700-4003

lifecycle

Product Category

Teradata Vantage™

Given sample data and reference distributions, the function tests the hypothesis that the sample data comes from the distributions (Hypothesis-Test Mode (ML Engine)). Given the test results, the function finds the distribution that best matches the sample data (Best-Match Mode (ML Engine)).

The Distribution Matching function is composed of the functions DistributionMatchReduce and DistributionMatchMultiInput. DistributionMatchReduce supports these distributions:

For continuous variables:
- Beta
- Cauchy
- ChiSq
- Exponential
- F
- Gamma
- Lognormal
- Normal
- T
- Triangular
- Uniform
- Weibull
For discrete variables:
- Binomial
- Geometric
- Negative binomial
- Poisson
- Uniform

For evaluating the fit of the distribution to the data, the function supports these tests:

Anderson-Darling test
Kolmogorov-Smirnov test
Cramér-von Mises criterion (hypothesis testing only)
Pearson’s Chi-squared test