7.00.02 - Step 1: Create Tokenized Training Document Set - Aster Analytics

Teradata Aster® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Published
September 2017
Content Type
Programming Reference
User Guide
Publication ID
B700-1022-700K
Language
English (United States)
Last Update
2018-04-17
CREATE FACT TABLE tfidf_token1 DISTRIBUTE BY HASH(docid) AS
  SELECT * FROM nGram (
    ON tfidf_train
    TextColumn ('content')
    Delimiter (' ')
    Grams ('1')
    Overlapping ('false')
    ToLowerCase ('true')
    Punctuation ('\[.,?\!\]')
    Reset ('\[.,?\!\]')
    Total ('false')
      Accumulate ('docid')
  );