SQL-MapReduce Call - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product

Aster Analytics

Release Number

6.21

Published

November 2016

Language

English (United States)

Last Update

2018-04-14

dita:mapPath

kiu1466024880662.ditamap

dita:ditavalPath

AA-notempfilter_pdf_output.ditaval

dita:id

B700-1021

lifecycle

Product Category

Software

The objective is to correctly match the information in Input to the applicant from Input and thus accurately identify the applicant’s credit score. Assume a default threshold of 0.5. A higher threshold means that the matching accuracy is higher. Look for exact matches (NominalMatchColumns) to the email address and allow approximate matches (FuzzyMatchColumns) for lastname, firstname, zipcode, city and department columns, with different match metrics and match weights.

SELECT * FROM IdentityMatch (
  ON applicant_reference AS a PARTITION BY ANY
  ON applicant_external AS b DIMENSION
  IDColumn ('a.id: b.id')
  NominalMatchColumns ('a.email: b.email')
  FuzzyMatchColumns ('a.lastname: b.lastname, JARO-WINKLER, 3',
                     'a.firstname: b.firstname, JARO-WINKLER, 2',
                     'a.zipcode: b.zipcode, JD, 2',
                     'a.city: b.city, LD, 2',
                     'a.department: b.department, COSINE, 1')
  Accumulate ('a.firstname','a.lastname' ,'b.lastname', 'a.email',
              'b.email','a.zipcode', 'b.zipcode', 'a.department',
              'b.department','b.creditscore')
  Threshold (0.5)
) ORDER BY "a.id", score DESC;