Argument | Category | Description |
---|---|---|
InputTable | Required | Specifies the name of the table or view that contains the training samples. |
ModelTable | Required | Specifies the name for the model table that the function creates (which must not exist). |
SampleIDColumn | Required | Specifies the name of the input_table column that contains the identifiers of the training samples. |
AttributeColumn | Required | Specifies the name of the input_table column that contains the attributes of the samples. |
ValueColumn | Optional | Specifies the name of the input_table column that contains the attribute values. By default, each attribute has the value 1. |
LabelColumn | Required | Specifies the name of the input_table column that contains the classes of the samples. |
Cost | Optional | The regularization parameter λ in the SVM soft-margin loss function: Must be greater than 0.0. The default value is 1.0. |
Bias | Optional | A non-negative value. If the value is greater than zero, the function converts each sample in the training set to ( , b); that is, it adds another dimension containing the bias value b. This argument addresses situations where not all samples center at 0. The default value is 0.0. |
Hash | Optional | Specifies whether to use hash projection on attributes. Hash projection can accelerate processing speed but can slightly decrease accuracy. The default value is 'false'. You must use hash projection if the dataset has more features than fit into memory.
|
HashBuckets | Optional | Valid only if Hash is 'true'. Specifies the number of buckets for hash projection. In most cases, the function can determine the appropriate number of buckets from the scale of the input data set. However, if the dataset has a very large number of features, you might have to specify buckets_number to accelerate the function. |
ClassWeights | Optional | Specifies the weights for different classes. The format is: “classlabel m:weight m, classlabel n:weight n”. If weight for a class is given, the cost parameter for this class is weight * cost. A weight larger than 1 often increases the accuracy of the corresponding class; however, it may decrease global accuracy. Classes not assigned a weight in this argument is assigned a weight of 1.0. |
MaxStep | Optional | A positive integer value that specifies the maximum number of iterations of the training process. One step means that each sample is seen once by the trainer. The input value must be in the range (0, 10000]. The default value is 100. |
Epsilon | Optional | Termination criterion. When the difference between the values of the loss function in two sequential iterations is less than this number, the function stops. Must be greater than 0.0. The default value is 0.01. |
Seed | Optional | A long integer value used to order the training set randomly and consistently. You can use this value can to ensure that the same model is generated if the function is run multiple times in a given database with the same arguments. The input value must be in the range [0, 9223372036854775807]. The default value is 0. |