Canopy
|
Simple, fast, accurate function for grouping objects into preliminary clusters. Often used as an initial step in more rigorous clustering techniques, such as k-means. |
Gaussian Mixture Model Functions
|
Fit a Gaussian mixture model (GMM) to input data, using either a basic GMM algorithm with a fixed number of clusters or a Dirichlet Process GMM (DP-GMM) algorithm with a variable number of clusters. |
KMeans Functions
|
Create and use model that is table of cluster centroids. Optionally output clusters themselves. |
KModes Functions
|
Extends KMeans functions to support categorical data. |
MinHash
|
Probabilistic clustering method that assigns a pair of users to the same cluster with probability proportional to the overlap between the sets of items that these users have bought. |