PySpark API Supportability Matrix | VarianceThresholdSelector | pyspark2teradataml - VarianceThresholdSelector - Teradata Package for Python

Teradata® pyspark2teradataml User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Teradata Package for Python
Release Number
20.00
Published
December 2024
ft:locale
en-US
ft:lastEdition
2024-12-18
dita:mapPath
oeg1710443196055.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
oeg1710443196055
Product Category
Teradata Vantage

Arguments

This function internally uses regular Dataframe manipulation through teradataml.

outputCol is ignored. All the scaled columns will have same names as they have before.

Only scaled columns will be listed; the remaining columns won’t be present in transformed data.

PySpark Argument Name Open Source Function Argument Name Notes
featuresCol Not yet available. Features columns should be passed as a list. VectorAssembler won’t work to merge multiple columns and pass them as features during fit.
outputCol Not yet available. Argument is ignored. Transformed DF will contain the column whose variance is greater than the threshold.
varianceThreshold threshold  

Attributes/Methods

Attribute/Method Name Supported Notes
clear  
copy  
explainParam  
explainParams  
extractParamMap  
fit  
fitMultiple  
getFeaturesCol  
getOrDefault  
getOutputCol  
getParam  
getVarianceThreshold  
hasDefault  
hasParam  
isDefined  
isSet  
load  
read  
save  
set  
setFeaturesCol  
setOutputCol  
setParams  
setVarianceThreshold  
write  
featuresCol  
outputCol  
params  
varianceThreshold