TD_TargetEncodingFit Function | TargetEncodingFit - TD_TargetEncodingFit - Analytics Database

Database Analytic Functions

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Analytics Database
Release Number
17.20
Published
June 2022
Language
English (United States)
Last Update
2024-04-06
dita:mapPath
gjn1627595495337.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
jmh1512506877710
Product Category
Teradata Vantageā„¢

TargetEncoding generally uses the likelihood or expected value of the target variable for each category and encodes that category with that value. This technique works for both binary classification and regression and for multiclass classification a similar technique is applied, which encodes the categorical variable with k new variables, where k is the number of classes.

The TD_TargetEncodingFit function takes the InputTable and a CategoricalTable as input and generates the required hyperparameters, which will be used by the TD_TargetEncodingTransform function for encoding the categorical values.

  • This function requires the UTF8 client character set.
  • This function does not support Pass-Through Characters (PTCs).
  • This function does not support KanjiSJIS or Graphic data types.
  • The maximum number of unique categories in the particular column is 4000.
  • The maximum category length is 128 characters.
  • Columns with a large number of distinct categories can have an impact on query execution time.