1.1 - 8.10 - KModes Output - Teradata Vantage

Teradata Vantage™ - Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
1.1
8.10
Release Date
October 2019
Content Type
Programming Reference
Publication ID
B700-4003-079K
Language
English (United States)

Output Summary Schema

This table is displayed on the screen.

Column Data Type Description
model_id VARCHAR [Column appears only if multiple models are trained.] Integer, starting with 0, identifying model.
summary VARCHAR Model data:
  • Number of Clusters (found in model)
  • Number of Iterations (required to create model)
  • Model Converged (whether algorithm converged)
  • Number of Data Points (number of input rows used to build model)
between_cluster_error DOUBLE PRECISION Sum of squared distances of centroids to global mean, where squared distance of each mean to global mean is multiplied by number of data points in cluster.
total_within_cluster_error DOUBLE PRECISION Sum of all within_cluster_ss values.
pseudo_f DOUBLE PRECISION Value given by this formula:

(between_cluster_errror / (K - 1)) /

(total_within_cluster_error / (N - K))

where N is total number of data points, or total weight if points are weighted, and K is number of clusters.

OutputTable Schema

Column Data Type Description
model_id INTEGER [Column appears only if multiple models are trained.] Model identifier.
cluster_id INTEGER Cluster identifier assigned by function.
numerical_attribute DOUBLE PRECISION [Column appears once for each numerical attribute.] Name of numerical attribute.
categorical_attribute VARCHAR [Column appears once for each categorical attribute and for each numerical attribute specified by NumericAsCategorical.] Name of attribute.
within_cluster_ss DOUBLE PRECISION Total distance summed over all points in cluster, between point and cluster center, as calculated by distance metric.
cluster_weight DOUBLE PRECISION Total weight of data points assigned to cluster.
distance_metric VARCHAR Value of Distance syntax element in function call (copied to output table so that you need not specify it again when calling KModesPredict).
category_weights VARCHAR Value of CategoryWeights syntax element in function call (copied to output table so that you need not specify it again when calling KModesPredict).