KModes Output - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.00
1.0
Published
May 2019
Language
English (United States)
Last Update
2019-11-22
dita:mapPath
blj1506016597986.ditamap
dita:ditavalPath
blj1506016597986.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢

Output Summary Schema

This table is displayed on the screen.

Column Data Type Description
model_id VARCHAR [Column appears only if multiple models are trained.] Integer, starting with 0, identifying model.
summary VARCHAR Model data:
  • Number of Clusters (found in model)
  • Number of Iterations (required to create model)
  • Model Converged (whether algorithm converged)
  • Number of Data Points (number of input rows used to build model)
between_cluster_error DOUBLE PRECISION Sum of squared distances of centroids to global mean, where squared distance of each mean to global mean is multiplied by number of data points in cluster.
total_within_cluster_error DOUBLE PRECISION Sum of all within_cluster_ss values.
pseudo_f DOUBLE PRECISION Value given by this formula:

(between_cluster_errror / (K - 1)) /

(total_within_cluster_error / (N - K))

where N is total number of data points, or total weight if points are weighted, and K is number of clusters.

OutputTable Schema

Column Data Type Description
model_id INTEGER [Column appears only if multiple models are trained.] Model identifier.
cluster_id INTEGER Cluster identifier assigned by function.
numerical_attribute DOUBLE PRECISION [Column appears once for each numerical attribute.] Name of numerical attribute.
categorical_attribute VARCHAR [Column appears once for each categorical attribute and for each numerical attribute specified by NumericAsCategorical.] Name of attribute.
within_cluster_ss DOUBLE PRECISION Total distance summed over all points in cluster, between point and cluster center, as calculated by distance metric.
cluster_weight DOUBLE PRECISION Total weight of data points assigned to cluster.
distance_metric VARCHAR Value of Distance argument in function call (copied to output table so that you need not specify it again when calling KModesPredict).
category_weights VARCHAR Value of CategoryWeights argument in function call (copied to output table so that you need not specify it again when calling KModesPredict).