Data intercept
Weights to be applied.
Number of points in sample.
Random seed
Epsilon scaling factor.
Return a Java List of synthetic data randomly generated according to a multi collinear model.
Return a Java List of synthetic data randomly generated according to a multi collinear model.
Data intercept
Weights to be applied.
Number of points in sample.
Random seed
Java List of input.
Generate an RDD containing sample data for Linear Regression models - including Ridge, Lasso, and uregularized variants.
Generate an RDD containing sample data for Linear Regression models - including Ridge, Lasso, and uregularized variants.
SparkContext to be used for generating the RDD.
Number of examples that will be contained in the RDD.
Number of features to generate for each example.
Epsilon factor by which examples are scaled.
Number of partitions in the RDD. Default value is 2.
RDD of LabeledPoint containing sample data.
Generate sample data used for Linear Data. This class generates uniformly random values for every feature and adds Gaussian noise with mean
eps
to the response variableY
.