Example 2: KMeans++ Sampling - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software

This example uses KMeans++ sampling with the Manhattan distance metric, and treats the numeric variables cyl, gear, and carb as categorical variables (and the categorical variables vs and am). The category weights are assigned in the order that the columns appear in the input table: 1000 to cyl, 10 to vs, 100 to am, 100 to gear, and 100 to carb.