Distribution Matching Best-Match Mode Syntax Elements | Teradata Vantage - Best-Match Mode Syntax Elements - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢
TargetColumn
Specify the name of the InputTable column that contains the values of the sample data set.
Tests
[Optional] Specify one to three tests to perform:
test Description
'KS' Kolmogorov-Smirnov test.
'AD' Anderson-Darling test.
'CHISQ' Pearson's Chi-squared test.
Default: All tests
Distributions
[Optional] Specify the reference distributions (which must be continuous) and their parameters. The possible distribution and parameters values for continuous distributions are in the table, Continuous Distributions and Parameters, in Hypothesis-Test Mode Syntax Elements.
Default: All of these distributions:
  • Beta
  • Cauchy
  • CHISQ
  • Exponential
  • F
  • Gamma
  • Lognormal
  • Normal
  • T
  • Triangular
  • Uniformcontinuous
  • Weibull
GroupByColumns
[Optional] Specify the names of the InputTable columns that contain the group identifications over which to run the test. The function can run multiple tests for different partitions of the data in parallel. If you omit this syntax element, specify PARTITION BY 1 and omit the GROUP BY clause in the second ON clause.
MinGroupSize
[Optional] Specify the minimum group size. The function ignores groups smaller than the minimum size when calculating statistics.
Default: 50
NumCell
[Optional] Specify the number of cells to make discrete in a continuous distribution. The cell_size must be greater than 3 if distribution is NORMAL; otherwise, it must be greater than 1. The quotient min_group_size/cell_size cannot be less than 5.
If you specify NumCell, you must specify 'CHISQ' in the Tests syntax element.
Default: 10
NumTopMatches
[Optional] Specify the number of the top matching distributions for the function to output.
Default: 1