- Delimiter
- Specify,with a regular expression the character or string that separates words in the input text.
Default: ' ' (space).
- OverLapping
- Specify whether the function allows overlapping n-grams.
- ConvertToLowerCase
- Specify whether the function converts all letters in the input text to lowercase.
- Punctuation
- Specify, with a regular expression, the punctuation characters for the function to remove before evaluating the input text.
Default: '`~#^&*()-'
- Reset
- Specify, with a regular expression, the character or string that ends a sentence.
Default: '.,?!'
- Punctuation
- Specify, in a string, the punctuation characters for the function to remove before evaluating the input text.
- OutputTotalGramCount
- Specify whether the function returns the total number of n-grams in the document (that is, in the row) for each length n specified in the Grams syntax element. If you specify 'true', the TotalCountColName syntax element determines the name of the output table column that contains these totals.
- TotalCountColName
- Specify the name of the output table column that appears if the value of the OutputTotalGramCount syntax element is 'true'.
- Accumulate
- Specify the names of the input table columns to copy to the output table for each n-gram. These columns cannot have the same names as those specified by the syntax elements NGramColName, GramLengthColName, and TotalCountColName.
- NGramColName
- Specify the name of the output table column that is to contain the created n-grams.
- GramLengthColName
- Specify the name of the output table column that is to contain the length of n-gram (in words).
- FrequencyColName
- Specify the name of the output table column that is to contain the count of each unique n-gram (that is, the number of times that each unique n-gram appears in the document).