Arguments - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software
Argument Category Description
ModelColumn Optional The name of the column that contains the model attribute. If you specify this argument, then model_attribute must match a model_key in the PARTITION BY clause. The values in the column can be integers or strings.
SeqColumn Required The name of the column that contains the sequence attribute. The sequence_attribute must be a sequence attribute in the PARTITION BY clause. A sequence must contain more than two observation symbol.
ObsColumn Required The name of the column that contains the observed symbols. The function scans the input table to find all possible observed symbols.
Observed symbols are case-sensitive.
HiddenStateNum Required The number of hidden states.
The number of hidden states can influence model quality and performance, so choose the number appropriately.
MaxIterNum Optional The number of iterations that the training process runs before the function completes. The default is 10.
Epsilon Optional The threshold value in determining the convergence of HMM training. If the parameter value difference is less than the threshold, the training process converges. There is no default value. If you do not specify Epsilon, only MaxIterNum determines when the training process converges.
SkipColumn Optional The name of the column whose values determine whether the function skips the row. The function skips the row if the value is “true”, “yes”, “y”, or “1”. The function does not skip the row if the value is “false”, “f”, “no”, “n”, “0”, or NULL.
InitMethods Optional The method that the function uses to generate the initial parameters for the initial state probabilities, state transition probabilities, and emission probabilities. The possibilities are:
  • random (default): The initial parameters are based on uniform distribution.
  • flat: The probabilities are equal. Each cell holds the same probability in the matrix or vector.
  • input: The function takes the initial parameters from the InitParams argument.

The names of the preceding methods are case-insensitive.

The seed number is meaningful only when the specified method is random.

InitParams Required when InitMethods has the value 'input' When InitMethods has the value 'input', this argument specifies the initial parameters for the models. The first parameter specifies the initial state probabilities, the second parameter specifies the state transition probabilities, and the third parameter specifies the emission probabilities.

For example, if the NumberHiddenStates argument specifies three hidden states and two observed symbols ('yes' and 'no'), then the InitParams values are:

  • init_state_probability_vector (the initial state probabilities):

    '0.3333333333 0.3333333333 0.3333333333'

  • state_transition_probability_matrix (the state transition probabilities):

    '0.3333333333 0.3333333333 0.3333333333;

    0.3333333333 0.3333333333 0.3333333333;

    0.3333333333 0.3333333333 0.3333333333'

  • observation_emission_probability_matrix (the emission probabilities):

    'no:0.25 yes:0.75; no:0.35 yes:0.65; no:0.45 yes:0.55'

The sum of the probabilities in each row for the initial state probabilities, state transition probabilities, or emission probabilities parameters must be rounded to 1.0. The observed symbols are case-sensitive. The number of states and the number of observed symbols must be consistent with the NumberHiddenStates argument and the observed symbols in the input table; otherwise, the function displays error messages.

OutputTables Optional The names of the output tables:
  • init_state_prob

    Initial state probability table (default name is Pi).

  • state_transition_prob

    State transition probabilities table (default name is A).

  • emit_prob

    Emission probability table (default name is B).