Arguments - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software
Argument Category Description
InputColumn Required Specifies the name of the input column that contains the packed data.
OutputColumns Required Specifies the names to give to the output columns, in the order in which the corresponding virtual columns appear in input_column.
OutputDataTypes Required Specifies the datatypes of the unpacked output columns.

If OutputDataTypes specifies only one value and OutputColumns specifies multiple columns, then the specified value applies to every output_column.

If OutputDataTypes specifies multiple values, then it must specify a value for each output_column. The nth datatype corresponds to the nth output_column.

Delimiter Optional Specifies the delimiter (a string) that separates the virtual columns in the packed data. If delimiter contains a character that is a symbol in a regular expression—such as an asterisk (*) or pipe character (|)—precede it with two escape characters. For example, if the delimiter is the pipe character, specify '\\|'. The default delimiter is comma (,).
If the virtual columns are separated by a delimiter, then specify the delimiter with this argument; otherwise, specify the ColumnLength argument.
Do not specify both this argument and the ColumnLength argument.
ColumnLength Optional Specifies the lengths of the virtual columns; therefore, to use this argument, you must know the length of each virtual column.

If ColumnLength specifies only one value and OutputColumns specifies multiple columns, then the specified value applies to every output_column.

If ColumnLength specifies multiple values, then it must specify a value for each output_column. The nth datatype corresponds to the nth output_column. However, the last output_column can be an asterisk (*), which represents a single virtual column that contains the remaining data. For example, if the first three virtual columns have the lengths 2, 1, and 3, and all remaining data belongs to the fourth virtual column, you can specify ColumnLength ('2', '1', '3', *).

Do not specify both this argument and the Delimiter argument.
Regex Optional Specifies a regular expression that describes a row of packed data, enabling the function to find the data values.

A row of packed data contains one data value for each virtual column, but the row might also contain other information (such as the virtual column name). In the regular_expression, each data value is enclosed in parentheses.

For example, suppose that the packed data has two virtual columns, age and sex, and that one row of packed data is age:34,sex:male.

The regular_expression that describes the row is '.*:(.*)'. The '.*:' matches the virtual column names, age and sex, and the '(.*)' matches the values, 34 and male.

The default regular_expression is '(.*)', which matches the whole string (between delimiters, if any). When applied to the preceding sample row, the default regular_expression causes the function to return 'age:34' and 'sex:male' as data values.

To represent multiple data groups in regular_expression, use multiple pairs of parentheses. By default, the last data group in regular_expression represents the data value (other data groups are assumed to be virtual column names or unwanted data). If a different data group represents the data value, specify its group number with the RegexSet argument.

RegexSet Optional Specifies the ordinal number of the data group in regular_expression that represents the data value in a virtual column. By default, the last data group in regular_expression represents the data value.

For example, suppose that regular_expression is '([a-zA-Z]*):(.*)'. If group_number is '1', then '([a-zA-Z]*)' represents the data value. If group_number is '2', then '(.*)' represents the data value.

Exception Optional Specifies whether the function ignores rows that contain invalid data; that is, continues without outputting them. The default value is 'false', which causes the function to fail if it encounters a row with invalid data.