PySpark API Supportability Matrix | DecisionTreeClassifier | pyspark2teradataml - DecisionTreeClassifier - Teradata Package for Python

Teradata® pyspark2teradataml User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Teradata Package for Python
Release Number
20.00
Published
December 2024
ft:locale
en-US
ft:lastEdition
2024-12-18
dita:mapPath
oeg1710443196055.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
oeg1710443196055
Product Category
Teradata Vantage

Arguments

This function internally uses scikit-learn function DecisionTreeClassifier through teradataml Open source ML functions.

Transformed data won’t have features, rawPrediction, probability columns.

PySpark Argument Name Open Source Function Argument Name Notes
predictionCol Not yet available.  
probabilityCol Not yet available.  
rawPredictionCol Not yet available.  
maxDepth max_depth  
maxBins Not yet available.  
minInstancesPerNode min_samples_split PySpark takes int value (>= 1) default value is 1, teradatamlspk supports int value (>1) or float in (0.0, 1.0].

Default value is 1.0.

cacheNodeIds Not yet available.  
checkpointInterval Not yet available.  
impurity criterion  
seed random_state  
weightCol class_weight  
leafCol Not yet available.  
minWeightFractionPerNode min_weight_fraction_leaf