Generate an RDD containing test data for LogisticRegression.
Generate an RDD containing test data for LogisticRegression.
SparkContext to use for creating the RDD.
Number of examples that will be contained in the RDD.
Number of features to generate for each example.
Epsilon factor by which positive examples are scaled.
Number of partitions of the generated RDD. Default value is 2.
Probability that a label is 1 (and not 0). Default value is 0.5.
Generate test data for LogisticRegression. This class chooses positive labels with probability
probOne
and scales features for positive examples byeps
.