SQL-MapReduce Call - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product
Aster Analytics
Release Number
6.21
Published
November 2016
Language
English (United States)
Last Update
2018-04-14
dita:mapPath
kiu1466024880662.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1021
lifecycle
previous
Product Category
Software

TextChunker requires each sentence to have a unique identifier, and the input to TextChunker must be partitioned by that identifier.

SELECT * FROM TextChunker (
  ON  POSTagger  (
    ON (
      SELECT paraid*1000+sentence_sn
      AS  sentence_id, sentence FROM  Sentenizer  (
        ON paragraphs_input
        TextColumn ('paratext')
        Accumulate ('paraid')
      )
    )
    TextColumn ('sentence')
    Accumulate ('sentence_id')
  ) PARTITION BY  sentence_id  ORDER BY word_sn
  WordColumn ('word')
  POSColumn ('pos_tag')
) ORDER BY 1, 2;