NGramSplitter_MLE Example: Overlapping ('false'), OutputTotalGramCount ('false') - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.10
1.1
Published
October 2019
Language
English (United States)
Last Update
2019-12-31
dita:mapPath
ima1540829771750.ditamap
dita:ditavalPath
jsj1481748799576.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢

SQL Call

SELECT * FROM NGramSplitter_MLE (
  ON paragraphs_input
  USING
  TextColumn ('paratext')
  Delimiter (' ')
  Grams ('4-6')
  OverLapping ('false')
  ConvertToLowerCase ('true')
  OutputTotalGramCount ('false')
  Accumulate ('paraid', 'paratopic')
) AS dt ORDER BY paraid, paratopic, ngram;

Output

 paraid paratopic                 ngram                                                   n frequency 
 ------ ------------------------- ------------------------------------------------------- - --------- 
      1 decision trees            a decision tree as                                      4         1
      1 decision trees            a finite set of                                         4         1
      1 decision trees            a predictive model which                                4         1
      1 decision trees            an item to conclusions about                            5         1
      1 decision trees            and branches represent conjunctions                     4         1
      1 decision trees            branches represent conjunctions of features             5         1
      1 decision trees            can take a finite set of                                6         1
      1 decision trees            can take continuous values typically real               6         1
      ...
      2 simple regression         a straight line through                                 4         1
      2 simple regression         a way that makes the                                    5         1
      2 simple regression         least squares estimator of a                            5         1
      2 simple regression         line as small as                                        4         1
      2 simple regression         line through the set of n                               6         1
      2 simple regression         linear regression model with a                          5         1
      2 simple regression         makes the sum of squared residuals                      6         1
      2 simple regression         model with a single                                     4         1
      ...

Download a zip file of all examples and a SQL script file that creates their input tables from the attachment in the left sidebar.