SparkContext.
setCheckpointDir
Set the directory under which RDDs are going to be checkpointed. The directory must be an HDFS path if running on a cluster.
previous
pyspark.SparkContext.sequenceFile
next
pyspark.SparkContext.setJobDescription