Deviance - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product

Teradata Vantage

Release Number

8.00

1.0

Published

May 2019

Language

English (United States)

Last Update

2019-11-22

dita:mapPath

blj1506016597986.ditamap

dita:ditavalPath

blj1506016597986.ditaval

dita:id

B700-4003

lifecycle

Product Category

Teradata Vantage™

The deviance for a model M 0, based on a data set y, is defined as follows:

In the preceding equation:

denotes the fitted values of the parameters in the model M 0.

denotes the fitted parameters for the full model (or saturated model).

Both sets of fitted values are implicitly functions of the observations y. In this case, the full model is a model with a parameter for every observation so that the data are fitted exactly.

The deviance is used to compare two models—in particular in the case of generalized linear models where it has a similar role to residual variance from ANOVA in linear models (RSS).

Suppose in the framework of the GLM that there are two nested models, M 1 and M 2. In particular, suppose that M 1 contains the parameters in M 2, and k additional parameters. Then, under the null hypothesis that M 2 is the true model, the difference between the deviances for the two models follows an approximate chi-squared distribution with k-degrees of freedom. This provides us an alternative way for computing the log-likelihood ratio of two models.

Deviance is implemented in the GLM function. It computes residual deviance from model deviance and saturated deviance. The function does not compute null deviance.