Use this syntax when one input (SourceTable or ReferenceTable) fits in memory. For best performance, identify the smaller table as ReferenceTable. Function results are the same regardless of how you identify the inputs.
The function compares each record from SourceTable and each record from ReferenceTable. The number of comparisons is |SourceTable|*|ReferenceTable|. In the syntax, a and b refer to the SourceTable and ReferenceTable, respectively.
Version 1.13
SELECT * FROM IdentityMatch ( ON source_input_table AS SourceTable PARTITION BY ANY ON reference_input_table AS ReferenceTable DIMENSION USING IDColumn ('a.id_column: b.id_column') { NominalMatchColumns ('a.columnX: b.columnY' [,...]) | FuzzyMatchColumns ('a.columnX: b.columnY, match_metric, match_weight [, synonym_file ]' [,...]) } [ NullHandling ({ 'mismatch' | 'match-if-null' | 'match-if-both-null' }) ] [ Accumulate ('{a|b}.accumulate_column' [,...]')] [ ThresholdScore (threshold) ] ) AS alias;