Arguments - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software
Argument Category Description
ComparisonColumnPairs Required Specifies pairs of input table columns that contain strings to be compared (column1 and column2), how to compare them (comparison_type), and (optionally) a constant and the name of the output column for their similarity (output_column). The similarity is a value in the range [0, 1].
For comparison_type, use one of these values:
  • 'jaro' (Jaro distance)
  • 'jaro_winkler' (Jaro-Winkler distance: 1 for an exact match, 0 otherwise)
  • 'n-gram' (N-gram similarity)

    If you specify this comparison type, you can specify the value of N with constant.

  • 'LD' (Levenshtein distance)

    The Levenshtein distance is the number of edits needed to transform one string into the other, where edits include insertions, deletions, or substitutions of individual characters.

You can specify a different comparison_type for every pair of columns.

The default output_column is 'sim_i', where i is the sequence number of the column pair.

CaseSensitive Optional Specifies whether string comparison is case-sensitive. The default value is 'false'.

You can specify either one value for all pairs or one value for each pair. If you specify one value for each pair, then the ith value applies to the ith pair.

Accumulate Optional Specifies the names of input table columns to be copied to the output table.