Syntax | Adaptive Histogram | Vantage Analytics Library - Syntax - Vantage Analytics Library

Vantage Analytics Library User Guide

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
Lake
VMware
Product
Vantage Analytics Library
Release Number
2.2.0
Published
March 2023
Language
English (United States)
Last Update
2024-01-02
dita:mapPath
ibw1595473364329.ditamap
dita:ditavalPath
iup1603985291876.ditaval
dita:id
zyl1473786378775
Product Category
Teradata Vantage
CALL td_analyze (
  'adaptivehistogram',
  'required_parameter_list [ optional_parameter; [...] ]'
);
required_parameter_list
database = input_database_name;
tablename = input_table_name;
columns = { column_name [,...] | keyword };
optional_parameter
{ bins = bins [,...] |
  columnstoexclude = column_name [,...] |
  gensqlonly = { true | false } |
  outputdatabase = output_database_name |
  outputtablename = output_table_name |
  overlaycolumns = column_name [,...] |
  overwrite = { true | false } |
  spikethreshold = spike_threshold |
  subdivisionmethod = { means | quantiles } |
  subdivisionthreshold = subdiv_threshold |
  where = expression
}

Syntax Elements

database
The database containing the table to analyze.
tablename
The table containing the columns to analyze.
columns
The columns to analyze.
keyword Description
all All columns.
allnumeric All numeric columns.
allnumericanddate All numeric and date columns.
bins
[Optional] The number of equal width bins to create (for example, bins=5). If you specify multiple columns, you can specify multiple bin sizes (for example, bins=5, 10). If you specify fewer bin sizes than columns, leftover columns have the default size, 10.
For each bin, beginning range values are inclusive and ending range values are exclusive, with these exceptions:
  • The last ending range value is inclusive.
  • The ending range value of a spike is inclusive (because the beginning and ending values of a spike are the same).
  • The beginning range value of a bin that follows and adjoins a spike is exclusive (because this value is the same as the spike value).
  • The ending range value of a quantile sub-bin is inclusive.
columnstoexclude
[Optional] The columns to exclude when columns specifies a keyword.
gensqlonly
[Optional] True returns the SQL for the function as a result set but does not run it.
False runs the SQL for the function but does not return it as a result set.
Default: false
outputdatabase
[Optional] The name of the database to contain the analysis results table.
outputtablename
[Optional] The name of the table to store the analysis results.
Default behavior: Function returns results as a result set but does not create an output table.
overwrite
[Optional] Whether to drop the output tables before creating new ones.
Default: true
spikethreshold
[Optional] The percentage of rows, an integer in the range [1, 100], above which an individual value of a column is identified as a separate bin. Values with this or a larger percentage of rows are identified as a Spike.
Default: 10% of the total number of rows
subdivisionmethod
[Optional] The method for subdividing bins with too many values:
Subdivision Method Description
means Use range of +/- standard deviation around mean value in bin.
quantiles Use quantiles, producing approximately equally distributed bins.
subdivisionthreshold
[Optional] The percentage of rows, an integer in the range [1, 100], above which a bin is subdivided into sub-bins.
Default: 30% of the total number of rows
where
[Optional] The expression in the SQL WHERE clause to include in the generated SQL to filter rows selected for analysis.
Examples:
  • where = cust_id > 0 includes this WHERE clause in the generated SQL:
    WHERE cust_id > 0
  • where = gender = ''F'' includes this WHERE clause in the generated SQL:
    WHERE gender='F'