when true the distinct operation is performed partially, per partition, without shuffling the data.
the input query plan.
the input query plan.
the input query plan.
Runs this query returning the result as an RDD.
Runs this query returning the result as an array.
Runs this query returning the result as an array.
Specifies how data is partitioned across different nodes in the cluster.
Specifies how data is partitioned across different nodes in the cluster.
when true the distinct operation is performed partially, per partition, without shuffling the data.
Specifies any partition requirements on the input data for this operator.
:: DeveloperApi :: Computes the set of distinct input rows using a HashSet.
when true the distinct operation is performed partially, per partition, without shuffling the data.
the input query plan.