- NumParallelTrees
- Specifies how many boosted trees to use to make predictions.
- NumBoostRounds
- Specifies how many iterations to load for each boosted tree to make predictions.
- ModelType
- For classification, output the prediction column as integers. These integral values represent different categories, and so are better observed as an integer column. To make the output schema for prediction column as an integer, set ModelType as Classification.
- OutputProb
- Specifies whether to output the probability for each response.
- Responses
- Specifies the classes for which to output probabilities.
- Accumulate
- Specifies the input columns names to copy to the output table.
- The processing time is controlled by (proportional to):
- The number of boosted trees used for prediction from the model (controlled by NumParallelTrees).
- The number of iterations (sub-trees) used for prediction from the model in each boosted tree (controlled by IterNum).
A careful choice of these parameters can be used to control the processing time. When the boosted trees size grows more than what can fit in memory, the trees are cached in a local spool space, which may impact the performance of the function compared to the case when all trees fit in memory.