SQL-MapReduce Call - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software
SELECT *
  FROM minhash(
  ON (SELECT 1)
  PARTITION BY 1
  InputTable ('salesdata')
  OutputTable ('minhashoutput')
  IDColumn ('userid')
  ItemsColumn ('itemid')
  HashNum ('1002')
  KeyGroups ('3')
  InputType ('integer')
  MinClusterSize ('3')
  MaxClusterSize ('5')
);

The number of hash functions must be an integer multiple of number of keygroups, while each clusterid is generated by concatenating KeyGroups’ hashcodes together. The larger the amount of keygroups, fewer clusters are obtained.