Results Data | Fast K-Means Clustering | Vantage Analytics Library - Results Data - Vantage Analytics Library

Vantage Analytics Library User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
Lake
VMware
Product
Vantage Analytics Library
Release Number
2.2.0
Published
March 2023
Language
English (United States)
Last Update
2024-01-02
dita:mapPath
ibw1595473364329.ditamap
dita:ditavalPath
iup1603985291876.ditaval
dita:id
zyl1473786378775
Product Category
Teradata Vantage
The function returns two result sets:
  • Progress report that shows how the algorithm converged and what made it stop processing.
  • Cluster means and variances.

Progress Report

Column Data Type Description
ctime TIMESTAMP Timestamp.
v VARCHAR Progress message.

Cluster Means and Variances

Rows associated with positive cluster IDs contain the average values of each of the clustered columns and the count for each cluster ID.

Rows associated with negative cluster IDs contain the variance of each clustered column for each cluster ID.

Column Data Type Description
clusterid INTEGER Cluster identifier.
cnt INTEGER Count of items in cluster.
avg_cc_bal DOUBLE PRECISION Input column.
avg_ck_bal DOUBLE PRECISION Input column.
avg_sv_bal DOUBLE PRECISION Input column.

Example

Progress Report
ctime v
12/4/2020 9:57:07 AM Starting Procedure
12/4/2020 9:57:07 AM Data randomly assigned to clusters initially
12/4/2020 9:57:07 AM Starting K-means loop 1
12/4/2020 9:57:07 AM Completed K-means loop 1
12/4/2020 9:57:08 AM Comparison complete, cluster aggregate difference is 8.44511468105135E 002
12/4/2020 9:57:08 AM Starting K-means loop 2
12/4/2020 9:57:08 AM Completed K-means loop 2
12/4/2020 9:57:08 AM Comparison complete, cluster aggregate difference is 6.28369025740172E 002
12/4/2020 9:57:08 AM Starting K-means loop 3
12/4/2020 9:57:08 AM Completed K-means loop 3
12/4/2020 9:57:08 AM Comparison complete, cluster aggregate difference is 6.15229501574827E 002
12/4/2020 9:57:08 AM Kmeans reached maximum iterations (3) so terminating procedure
12/4/2020 9:57:09 AM Procedure Complete
Cluster Means and Variances
clusterid cnt avg_cc_bal avg_ck_bal avg_sv_bal
-3 512 231857.172417 301876.198169 339777.941176
-2 25 14851425.760280 65128867.597238 42746526.179346
-1 210 3424898.292542 3992928.509731 3409787.968982
1 210 -1371.9022857 1593.53185714 952.924238
2 25 -2311.104230 6918.833076 3453.093846
3 512 -281.768082 312.603170 309.22684