Input - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software

The KMeans function has one required input table (specified by the InputTable argument) and one optional input table (specified by the CentroidsTable argument).

The required input table contains the features by which to cluster the data.

KMeans Input Table Schema
Column Name Data Type Description
id INTEGER Contains the identifier of the user or item.
dimension_i DOUBLE PRECISION Contains the data in dimension i. The table has columns dimension_1 through dimension_n, where n is the number of dimensions. Each dimension is a feature by which to cluster the data.

For example, if the required application is clustering points by latitude and longitude on the surface of the earth, then the input table has three columns: point-id, latitude, and longitude. Clustering is performed on the latitude and longitude columns. The dimensionality n of the data is not specified as an argument, but is implicitly derived from the data.

The optional input table contains the contains the initial seed means for the clusters. This table has the same schema as the table of cluster centroids (specified by the OutputTable argument), which is affected by the UnpackColumns argument and is described by KMeans Results Messages and KMeans Output Table Schema for UnpackColumns('true').