Statistics | Vantage Analytics Library - Statistics - Vantage Analytics Library

Vantage Analytics Library User Guide

Deployment
VantageCloud
VantageCore
Edition
VMware
Enterprise
IntelliFlex
Lake
Product
Vantage Analytics Library
Release Number
2.2.0
Published
June 2025
ft:locale
en-US
ft:lastEdition
2025-07-02
dita:mapPath
ibw1595473364329.ditamap
dita:ditavalPath
iup1603985291876.ditaval
dita:id
zyl1473786378775
Product Category
Teradata Vantage

Statistics analysis helps determine the characteristics of numeric data, assess its quality, and find outlying values and other anomalies.

You can apply Statistics analysis to columns of any numeric type, including date types.

Basic Statistics

Given a table name and the names of one or more numeric columns, Statistics analysis provides the following for each column:
  • Count (number of rows with values) *
  • Minimum value *
  • Maximum value *
  • Mean *
  • Standard deviation *
  • Skewness
  • Kurtosis
  • Standard error
  • Coefficient of variance
  • Variance
  • Sum
  • Uncorrected sum of squares
  • Corrected sum of squares

* Selected by default.

For all but count, minimum and maximum, Statistics analysis first converts DATE columns to the number of days since 1900.

Extended Statistics

You can request the following additional statistics:

Statistic Description
modes Modal (most frequently occurring value in column).
quantiles Bottom 10 percentiles, deciles, quartiles, tertiles, and top 10 percentiles of values in column.
values
  • Number of rows in column with non-NULL values.
  • Number of rows in column with NULL values.
  • Number of rows in column with unique values.
  • Number of rows in column with value 0.
  • Number of rows in column with positive values.
  • Number of rows in column with negative values.
  • Number of rows in column with blank values.
    • Appears in the results data as xblank.

Values analysis provides the same values (see Values).

rank The five smallest values in column, the five largest values in column, and the number of times that each value appears in the column.