Normalized Input - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.10
1.1
Published
October 2019
Language
English (United States)
Last Update
2019-12-31
dita:mapPath
ima1540829771750.ditamap
dita:ditavalPath
jsj1481748799576.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢

For some predictive modeling functions, it is very important to normalize the numeric input variables; that is, to rescale them so they have a similar mean and standard deviation. If you do not normalize input variables, the effect of variables with a large magnitude or a large standard deviation may dominate the model and reduce the accuracy of its predictions.

Normalize input variables before calling the following functions:
  • Canopy
  • KMeans Functions
    • KMeans
    • KMeansPredict
  • KNN
  • Generalized Linear Model (GLM) Functions
    • GLM
    • GLMPredict_MLE
    • GLML1L2
    • GLML1L2Predict
  • Least Angle Regression (LAR) Functions
    • LAR
    • LARPredict
  • Linear Regression Functions
    • Linear Regression
    • LinRegPredict
  • Principal Component Analysis (PCA) Functions
    • PCA
    • PCAScore
  • Support Vector Machine (SVM) Functions
    • SVMSparse
    • SVMSparsePredict_MLE
    • SVMSparseSummary
    • SVMDense
    • SVMDensePredict
    • SVMDenseSummary

The MLE Scale functions are designed to make normalization easy. For an example of using Scale functions to normalize input variables, see PCA Example.