TD_KMeans Output - Analytics Database

Database Analytic Functions

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Analytics Database
Release Number
17.20
Published
June 2022
Language
English (United States)
Last Update
2024-10-04
dita:mapPath
gjn1627595495337.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
jmh1512506877710
lifecycle
latest
Product Category
Teradata Vantageā„¢

Output Table Schema

If the OutputClusterAssignment value is set to False:

Column Data Type Description
TD_CLUSTERID_KMEANS BIGINT The unique identifier of the cluster.
TargetColumns REAL The columns that contain the centroid value for each feature.
TD_SIZE_KMEANS BIGINT The number of points in the cluster.
TD_WITHINSS_KMEANS REAL The within-cluster-sum-of-squares, that is, the sum of squared differences of each point from its cluster centroid.
Id_Column BYTEINT The unique identifier column name copied from the InputTable. This column contains only NULL values in the output.
TD_MODELINFO_KMEANS VARCHAR(128) CHARACTER SET LATIN The following information related to the model is saved:
  • Converged: True or False
  • Number of Iterations: The number of iterations performed by the function.
  • Number of Clusters: The number of clusters produced.
  • Total_WithinSS: The total within cluster sum of squares.
  • Between_SS: Between sum of squares, that is, the sum of squared distances of centroids to global mean, where squared distance of each mean to global mean is multiplied by the number of data points it represents.
  • Method for InitialCentroids: 'Random' or 'KMeans++' or 'Externally supplied InitialCentroidsTable'.

If the OutputClusterAssignment value is set to True:

Column Data Type Description
Id_Column Any The unique identifier of input rows copied from the input table.
TD_CLUSTERID_KMEANS BIGINT The ClusterId assigned to the input row.