Arguments - Aster Analytics

Teradata Aster Analytics Foundation User Guide

Product: Aster Analytics
Release Number: 6.21
Published: November 2016
Language: English (United States)
Last Update: 2018-04-14
dita:mapPath: kiu1466024880662.ditamap
dita:ditavalPath: AA-notempfilter_pdf_output.ditaval
dita:id: B700-1021
lifecycle: previous
Product Category: Software

Argument	Category	Description
TextColumn	Required	Specifies the name of the input table column that contains the text to analyze.
ExtractorJAR	Optional	Specifies the name of the JAR file that contains the Java classes that extract features. You must install this JAR file in Aster Database under the user search path before calling the function. The name jar_file is case-sensitive.
FeatureTemplate	Required	Specifies the name of the file that specifies how to generate features when training the model. You must install this feature template file in Aster Database under the user search path before calling the function. For more information about template_file, refer to Feature Template.
ModelFile	Required	Specifies the name of the model file that the function generates and installs in Aster Database.
Language	Optional	Specifies the language of the input text: 'en' (English, the default) 'zh_CN' (Simplified Chinese) 'zh_TW' (Traditional Chinese)
MaxIterNum	Optional	Specifies the maximum number of iterations. The default value is 1000.
Eta	Optional	Specifies the tolerance of the termination criterion. Defines the differences of the values of the loss function between two sequential epochs. The default value is 0.0001. When training a model, the function performs n-times iterations. At the end of each epoch, the function calculates the loss or cost function on the training samples. If the loss function value change is very small between two sequential epochs, the function considers the training process to have converged. The function defines Eta as: Eta=(f(n)-f(n-1))/f(n-1) where f(n) is the loss function value of the nth epoch.
MinOccurNum	Optional	Specifies the minimum number times that a feature must occur in the input text before the function uses the feature to construct the model. The default value is 0.