RDD.
pipe
Return an RDD created by piping elements to a forked external process.
New in version 0.7.0.
command to run.
environment variables to set.
whether to check the return value of the shell command.
RDD
a new RDD of strings
Examples
>>> sc.parallelize(['1', '2', '', '3']).pipe('cat').collect() ['1', '2', '', '3']