Distribution Matching Function | Teradata Vantage - Distribution Matching (ML Engine) - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantage™

Given sample data and reference distributions, the function tests the hypothesis that the sample data comes from the distributions (Hypothesis-Test Mode (ML Engine)). Given the test results, the function finds the distribution that best matches the sample data (Best-Match Mode (ML Engine)).

The Distribution Matching function is composed of the functions DistributionMatchReduce and DistributionMatchMultiInput. DistributionMatchReduce supports these distributions:

  • For continuous variables:
    • Beta
    • Cauchy
    • ChiSq
    • Exponential
    • F
    • Gamma
    • Lognormal
    • Normal
    • T
    • Triangular
    • Uniform
    • Weibull
  • For discrete variables:
    • Binomial
    • Geometric
    • Negative binomial
    • Poisson
    • Uniform

For evaluating the fit of the distribution to the data, the function supports these tests:

  • Anderson-Darling test
  • Kolmogorov-Smirnov test
  • Cramér-von Mises criterion (hypothesis testing only)
  • Pearson’s Chi-squared test