MinHash Input - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢
Table Description
InputTable Contains items to cluster.
SeedTable [Optional. Disallowed with OutputSeedsTable syntax element.] Contains seeds to use for hashing. Typically created by earlier MinHash call that specified its name with OutputSeedsTable syntax element.

InputTable Schema

Column Data Type Description
user_id_column Any Identifier to cluster (for example, user identifier).
item_id_column BIGINT, INTEGER, or VARCHAR Identifiers of items on which to base clustering (for example, items that user purchased). Items are separated by delimiter.

SeedTable Schema

Column Data Type Description
index INTEGER Hash identifier. Values are from 0 to number_of_hash_functions - 1.
a INTEGER Seed value that hash function used to create hash values that MinHash algorithm used.
b INTEGER Seed value that hash function used to create hash values that MinHash algorithm used.
p INTEGER Seed value that hash function used to create hash values that MinHash algorithm used.