TD_RegressionEvaluator Function | RegressionEvaluator - TD_RegressionEvaluator - Analytics Database

Database Analytic Functions

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Analytics Database
Release Number
17.20
Published
June 2022
Language
English (United States)
Last Update
2024-01-31
dita:mapPath
gjn1627595495337.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
jmh1512506877710
Product Category
Teradata Vantageā„¢

The TD_RegressionEvaluator function computes metrics to evaluate and compare multiple models and summarizes how close predictions are to their expected values. It takes the actual and predicted values of the dependent variables to calculate specified metrics, you choose which metrics you want to calculate from a list of supported metrics.

Regression is a type of machine learning algorithm that aims to establish a relationship between a dependent variable and one or more independent variables. Use regression to predict the value of a dependent variable based on the values of independent variables. The most common type of regression is linear regression, where the relationship between variables is assumed to be linear.

In the regression process, a model is trained on a dataset consisting of input variables and corresponding output variables. The model tries to find the best fit line or curve that passes through the data points minimizing the difference between the actual and predicted values.

Use metrics to evaluate the performance of a regression model. The most commonly used metrics are:
  • Mean squared error (MSE).
  • Mean absolute error (MAE).
  • R-squared.
These metrics measure the accuracy of the model's predictions by comparing the predicted values with the actual values.

Selecting the appropriate features or independent variables is one crucial aspect of regression. Too many features can lead to overfitting, where the model performs well on the training data but poorly on the test data. Alternatively, too few features can lead to underfitting, where the model fails to capture the underlying patterns in the data.