1.0 - 8.00 - Cluster Analysis - Teradata Vantage

Teradata® Vantage Machine Learning Engine Analytic Function Reference

Teradata Vantage
Release Number
Release Date
May 2019
Content Type
Programming Reference
Publication ID
English (United States)
Function Description
Canopy Simple, fast, accurate function for grouping objects into preliminary clusters. Often used as an initial step in more rigorous clustering techniques, such as k-means.
Gaussian Mixture Model Functions Fit a Gaussian mixture model (GMM) to input data, using either a basic GMM algorithm with a fixed number of clusters or a Dirichlet Process GMM (DP-GMM) algorithm with a variable number of clusters.
KMeans Functions Create and use model that is table of cluster centroids. Optionally output clusters themselves.
KModes Functions Extends KMeans functions to support categorical data.
MinHash Probabilistic clustering method that assigns a pair of users to the same cluster with probability proportional to the overlap between the sets of items that these users have bought.