SQL-MapReduce Call - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software

We use default values for the MaxDepth, MinNodeSize, Variance and NumSurrogates, and build 50 trees on two worker nodes. Both seed values are set to 100 for repeatability and mtry is assigned a value of 3 (sqrt(12)= 3.4), as it is a classification type decision tree.

A good starting point for mtry is sqrt(p) for classification and p/3 for regression, where p is number of variables used for prediction.

SELECT * FROM Forest_Drive (
  ON (SELECT 1)
  PARTITION BY 1
  InputTable ('housing_train ')
  OutputTable ('rft_model')
  TreeType ('classification')
  ResponseColumn ('homestyle')
  NumericInputs ('price ', 'lotsize ', 'bedrooms ', 'bathrms ',
                 'stories ', 'garagepl')
  CategoricalInputs ('driveway ', 'recroom ', 'fullbase ', 'gashw ',
                     'airco ', 'prefarea')
  MaxDepth (12)
  MinNodeSize (1)
  NumTrees (50)
  NumSurrogates (0)
  Variance (0.0)
  Mtry ('3')
  MtrySeed ('100')
  Seed ('100')
);