Generate an RDD containing test data for KMeans.
Generate an RDD containing test data for KMeans.
SparkContext to use for creating the RDD
Number of points that will be contained in the RDD
Number of clusters
Number of dimensions
Scaling factor for the distribution of the initial centers
Number of partitions of the generated RDD; default 2
:: DeveloperApi :: Generate test data for KMeans. This class first chooses k cluster centers from a d-dimensional Gaussian distribution scaled by factor r and then creates a Gaussian cluster with scale 1 around each center.