The KmeansRun class defines a wrapper function that uses the Aster Spark API and implements the running phase of the Spark MLlib K-means clustering algorithm. The function uses a model that is typically generated by the KMeansTrain function.
Run Method Signature
run(input: RDD[DataRow], sparkFunctParams: String): RDD[DataRow]
Parameters
String representing the parameters specific to the function you are implementing. The string has this syntax:
'--option_value_pair [,...]'
option_value_pair is:
-
modelLocation model_location
Required. Specifies the file system or HDFS path to the location of the training model.
Returns
The input data and the predicted value (that is, the cluster number).
Side Effects
Function saves model in model_location.
Version
Spark 1.4 and later.