Cluster Analysis - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.10
1.1
Published
October 2019
Language
English (United States)
Last Update
2019-12-31
dita:mapPath
ima1540829771750.ditamap
dita:ditavalPath
jsj1481748799576.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢
Function Description
Canopy (ML Engine) Simple, fast, accurate function for grouping objects into preliminary clusters. Often used as an initial step in more rigorous clustering techniques, such as k-means.
MinHash (ML Engine) Probabilistic clustering method that assigns a pair of users to the same cluster with probability proportional to the overlap between the sets of items that these users have bought.
Modularity (ML Engine) Discovers communities (clusters) in input graphs without advance information about the clusters. Detects communities by discovering the strength of relationships among data points.
Gaussian Mixture Model Functions (ML Engine) Fit a Gaussian mixture model (GMM) to input data, using either a basic GMM algorithm with a fixed number of clusters or a Dirichlet Process GMM (DP-GMM) algorithm with a variable number of clusters.
KMeans Functions (ML Engine) Create and use model that is table of cluster centroids. Optionally output clusters themselves.
KModes Functions (ML Engine) Extends KMeans functions to support categorical data.