SQL-MapReduce Call - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software

The objective is to correctly match the information in Input to the applicant from Input and thus accurately identify the applicant’s credit score. Assume a default threshold of 0.5. A higher threshold means that the matching accuracy is higher. Look for exact matches (NominalMatchColumns) to the email address and allow approximate matches (FuzzyMatchColumns) for lastname, firstname, zipcode, city and department columns, with different match metrics and match weights.

SELECT * FROM IdentityMatch (
  ON applicant_reference AS a PARTITION BY ANY
  ON applicant_external AS b DIMENSION
  IDColumn ('a.id: b.id')
  NominalMatchColumns ('a.email: b.email')
  FuzzyMatchColumns ('a.lastname: b.lastname, JARO-WINKLER, 3',
                     'a.firstname: b.firstname, JARO-WINKLER, 2',
                     'a.zipcode: b.zipcode, JD, 2',
                     'a.city: b.city, LD, 2',
                     'a.department: b.department, COSINE, 1')
  Accumulate ('a.firstname','a.lastname' ,'b.lastname', 'a.email',
              'b.email','a.zipcode', 'b.zipcode', 'a.department',
              'b.department','b.creditscore')
  Threshold (0.5)
) ORDER BY "a.id", score DESC;