5.4.6 - In-Database Analytic Functions - Teradata Warehouse Miner

In-Database Analytic Functions User Guide

Teradata Warehouse Miner
November 2018
User Guide

To invoke an in-database function, use Teradata Studio or Teradata BTEQ. The controlling program for these analytic functions is a custom Teradata External Stored Procedure called TD_Analyze. Some of the functions also use custom Teradata Table Operators. To use the Teradata Warehouse Miner (TWM) In-Database Analytic Functions, these user-defined functions (UDF) must be installed in the Teradata database using a console application supplied with TWM, accessible from the Start menu in the TWM program group. Once these UDFs are installed, they are run without invoking the TWM client application.

Although independent of the TWM client application, the in-database analytic functions are based on functions available in TWM. All of the in-database functions and more are provided in the TWM client application, which has three principal function categories.

Function Category Description
Data Profiling Descriptive statistics provided to generate reports and graphics with drill down capabilities, pointing out potential data quality issues. The In-Database Descriptive Statistics functions are derived from the corresponding functions in the Data Profiling function category.
Analytic Data Set (ADS) Generation Functions used to build and transform analytic data sets. The In-Database Variable Transformation function is derived from the Variable Transformation function in the Analytic Data Set Generation function category.
Analytic Functions Functions used to invoke analytic algorithms and scoring along with statistical tests. The In-Database Fast K-Means algorithm and the Gain Ratio Extreme algorithm provide new variations of the Clustering and Decision Tree algorithms and scoring in TWM in the Analytic Functions category.
Each of the in-database functions described in this document has a counterpart that resides in the TWM client application. The client application includes graphing and reporting features not available in the In-Database Analytic Functions. Some features of the in-database functions are more complete than in the client application, such as more fully supporting extended object names (that is, 128 character database, table, and column names).
By default, when an in-database analytic function creates a table, the existing table is automatically dropped first to allow re-execution of the analysis. The automatic dropping of output tables prior to execution may be inhibited if the overwrite parameter is set to false (overwrite=false).
You can copy and use the examples in this document on a system where the UDF is installed and the twm default user name along with the rest of the tutorial environment (including the twm_source, the twm_result database, and the tutorial tables installed in twm_source) is set up.