Advanced SQL Engine | Teradata Vantage - Advanced SQL Engine Analytic Functions Overview - Teradata® Database

Database Analytic Functions

Product
Teradata® Database
Release Number
17.10
Published
July 2021
Language
English (United States)
Last Update
2021-07-28
dita:mapPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/wnd1589838592459.ditamap
dita:ditavalPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/ayr1485454803741.ditaval
dita:id
B035-1206
lifecycle
previous
Product Category
Teradata Vantage™

Advanced SQL Engine (was NewSQL Engine) analytic functions are specifically for analyzing data. Data can include clickstreams, financial transaction data, and user interaction data.

Advanced SQL Engine Analytical Functions
Function Name Description
Antiselect (SQL Engine) AntiSelect returns all columns except those specified.
Attribution (SQL Engine) Calculates attributions with a wide range of distribution models. Often used in web-page analysis.
DecisionForestPredict (SQL Engine) Uses the model file output by Machine Learning Engine (ML Engine) DecisionForest function to analyze the input data and make predictions.
DecisionTreePredict (SQL Engine) Uses the model file output by ML Engine DecisionTree function to analyze the input data and make predictions.
GLMPredict (SQL Engine) Uses the model file output by ML Engine GLM function to analyze the input data and make predictions.
MovingAverage (SQL Engine) Computes average values in a series.
NaiveBayesPredict (SQL Engine) Uses the model file output by ML Engine Naive Bayes Classifier function to analyze the input data and make predictions.
NaiveBayesTextClassifierPredict (SQL Engine) Uses the model file output by ML Engine NaiveBayesTextClassifierTrainer function to analyze the input data and make predictions.
NGramSplitter (SQL Engine) Tokenizes (splits) an input stream and emits n multigrams, based on specified delimiter and reset parameters. Useful for sentiment analysis, topic identification, and document classification.
nPath® (SQL Engine) Performs regular pattern matching over a sequence of rows from one or more inputs.
Pack (SQL Engine) Compresses data in multiple columns into a single packed data column.
Sessionize (SQL Engine) Maps each click in a clickstream to a unique session identifier.
StringSimilarity (SQL Engine) Calculates the similarity between two strings, using the specified comparison method.
SVMSparsePredict (SQL Engine) Uses the model file output by ML Engine SVMSparse function to analyze the input data and make predictions.
Unpack (SQL Engine) Expands data from a single packed column to multiple unpacked columns.

Advanced SQL Engine analytic functions have corresponding ML Engine functions. The syntax of corresponding functions may differ, but given the same inputs and syntax element values, they produce the same results (with the minor exceptions noted in specific functions).

To execute ML Engine functions on Teradata Vantage™, contact your Teradata Support representative.

Data Cleaning Functions

Function Name Description
TD_ConvertTo Converts the specified input table columns to specified data types.
TD_GetRowsWithoutMissingValues Displays the rows that have non-NULL values in the specified input table columns.
TD_OutlierFilterFit Calculates the lower_percentile, upper_percentile, count of rows, and median for the specified input table columns.
TD_OutlierFilterTransform Filters outliers from the input table
TD_SimpleImputeFit Outputs a table of values to substitute for missing values in the input table.
TD_SimpleImputeTransform Substitutes specified values for missing values in the input table.

Data Exploration Functions

Function Name Description
TD_CategoricalSummary Displays the distinct values and their counts for each specified input table column.
TD_ColumnSummary Displays a summary of each specified input table column.
TD_GetRowsWithMissingValues Displays the rows that have NULL values in the specified input table columns.
TD_Histogram Calculates the frequency distribution of a data set.
TD_QQNorm Checks whether the values in the specified input table columns are normally distributed.
TD_UnivariateStatistics Displays descriptive statistics for each specified numeric input table column.
TD_WhichMax Displays all rows that have the maximum value in a specified input table column.
TD_WhichMin Displays all rows that have the minimum value in specified input table column.

Feature Engineering Transform Functions

Function Name Description
TD_BinCodeFit Converts numeric data to categorical data by binning the numeric data into multiple numeric bins (intervals).
TD_BinCodeTransform Transforms input table columns from the BinCodeFit function output.
TD_FunctionFit Determines whether specified numeric transformations can be applied to specified input columns.
TD_FunctionTransform Applies numeric transformations to input columns to the FunctionFit output.
TD_OneHotEncodingFit Outputs a table of attributes and categorical values to the TD_OneHotEncodingTransform function.
TD_OneHotEncodingTransform Encodes specified attributes and categorical values as one-hot numeric vectors using the output from the TD_OneHotEncodingFit function.
TD_PolynomialFeaturesFit Stores all the specified values in the argument in a tabular format.
TD_PolynomialFeaturesTransform Extracts values of arguments from the output of the TD_PolynomialFeaturesFit function and generates a feature matrix of all polynomial combinations of the features.
TD_RowNormalizeFit Outputs a table of parameters and specified input columns to TD_RowNormalizeTransform which normalizes the input columns row-wise.
TD_RowNormalizeTransform Normalizes the input columns row-wise using the output of the TD_RowNormalizeFit function.
TD_ScaleFit Outputs a table of statistics to the TD_ScaleTransform function.
TD_ScaleTransform Scales the specified input table columns using the output of the TD_ScaleFit function.

Feature Engineering Utility Functions

Function Name Description
TD_FillRowID Adds a column of unique row identifiers to the input table.
TD_NumApply Applies a specified numeric operator to the specified input table columns.
TD_RoundColumns Rounds the values of each specified input table column to a specified number of decimal places
TD_StrApply Applies a specified string operator to the specified input table columns.

Hypothesis Testing Functions

Function Name Description
TD_ChiSq Performs Pearson's chi-squared test for independence.
TD_FTest Performs an F-test, for which the test statistic has an F-distribution under the null hypothesis.
TD_ZTest Performs a Z-test, for which the distribution of the test statistic under the null hypothesis can be approximated by normal distribution.

More Information

Topic Reference
ML Engine functions Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003
Aster Analytics functions Teradata Aster® Analytics Foundation User Guide, B700-1022
Installing model files output by ML Engine functions on Advanced SQL Engine Teradata Vantage™ User Guide, B700-4002