TextParser Syntax - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.10
1.1
Published
October 2019
Language
English (United States)
Last Update
2019-12-31
dita:mapPath
ima1540829771750.ditamap
dita:ditavalPath
jsj1481748799576.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢

Version 1.14

SELECT * FROM TextParser (
  ON { table | view | (query) } [ PARTITION BY expression [,...] ]
  USING
  TextColumn ('text_column')
  [ ConvertToLowerCase ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ StemTokens ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ Delimiter ('delimiter_regular_expression') ]
  [ OutputTotalWords ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ Punctuation ('punctuation_regular_expression') ]
  [ Accumulate ({ 'accumulate_column' | accumulate_column_range }[,...]) ]
  [ TokenColName ('token_column') ]
  [ FrequencyColName ('frequency_column') ]
  [ TotalColName ('total_column') ]
  [ RemoveStopWords ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ PositionColName ('position_column') ]
  [ ListPositions ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ OutputByWord ({'true'|'t'|'yes'|'y'|'1'|'false'|'f'|'no'|'n'|'0'}) ]
  [ StemExceptions ('exception_rule_file') ]
  [ StopWordsList ('stop_word_file') ]
) AS alias;

If you include the PARTITION BY clause, the function treats all rows in the same partition as a single document. If you omit the PARTITION BY clause, the function treats each row as a single document.