TD_ScaleFit Output - Teradata Vantage

Teradata® VantageCloud Lake

Deployment
VantageCloud
Edition
Lake
Product
Teradata Vantage
Published
January 2023
Language
English (United States)
Last Update
2024-04-03
dita:mapPath
phg1621910019905.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
phg1621910019905

Dense Input

Output Table Schema
Column Data Type Description
PartitionColumns Same as InputTable Columns on which input is partitioned. This appears only when the PartitionColumns argument is specified.
TD_STATTYPE_SCLFIT VARCHAR

(CHARACTER SET LATIN)

Statistic names and parameters—see following table.
TargetColumns REAL Statistics values for TargetColumns argument.

Sparse Input

Output Table Schema
Column Data Type Description
PartitionColumns Same as InputTable Columns on which input is partitioned. This appears only when the PartitionColumns argument is specified.
TD_STATTYPE_SCLFIT VARCHAR CHARACTER SET LATIN Statistic names and parameters.
AttributeNameColumn VARCHAR CHARACTER SET UNICODE Name of the attributes.
AttributeValueColumn REAL Statistics values for the attributes.

Stattypes the function outputs are:

Statistic Name Description
min Minimum value in the corresponding column.
max Maximum value in the corresponding column.
sum Sum value in the corresponding column.
count Count of valid values in the corresponding column.
null Count of NULL values in the corresponding column.
avg Average of valid values in corresponding column.
variance Variance of values in corresponding column. Variance is calculated according to N-1 degrees of freedom. (number of valid values minus one).
ustd Unbiased Standard deviation of values in corresponding column. Standard deviation is calculated according to N-1 degrees of freedom (number of valid values minus one).
std Standard deviation of values in corresponding column. Standard deviation is calculated according to N degrees of freedom (number of valid values).
missvalue_* * is the value of the MissValue argument: KEEP, ZERO, or LOCATION.
globalscale_* * is the value of the GlobalScale argument : true or false.
unusedattributes_* * is the value of the UnusedAttributes argument: unscaled or nullify.

The values for location and scale parameters for different ScaleMethods are:

Method Description Location Scale
mean Mean Xmean 1
sum Sum 0


ustd Unbiased Standard Deviation (Z-Score using Unbiased Standard Deviation) Xmean Standard deviation, calculated according to the unbiased estimator of the variance.

ustd = sqrt ( sum( Xi - Xmean)^2 ) / N - 1)

where N is the count of valid values

std Standard Deviation (Z-Score using Standard Deviation) Xmean Standard deviation, calculated according to the biased estimator of the variance.

std = sqrt ( sum( Xi - Xmean)^2 ) / N )

where N is the count of valid values

range Range Xmin Xmax-Xmin
midrange Midrange (Xmax+Xmin)/2 (Xmax+Xmin)/2
maxabs Maximum Absolute Value 0 Maximum of the absolute value of X.
rescale Rescale using specified lower bound, upper bound, or both. Lower_bound only:

(Xmin - lb*)

Upper_bound only:

(Xmax - ub*)

Lower_bound and Upper_bound:

Xmin - (lb / (ub-lb) )

*where
  • lb is lower_bound
  • ub is upper_bound
Lower_bound only:

1

Upper_bound only:

1

Lower_bound and Upper_bound:

(Xmax - Xmin) / (ub-lb) )