Article Directory
In Spark, the number of partitions to modify the RDD can be specified when creating the RDD, or it can be repartitioned by certain operations after the RDD is created. The following are the specific methods:
1. Specify the number of partitions when creating an RDD
1. parallelize
Specify the number of partitions when using the method to create RDD
When using parallelize
the method Create RDD from an existing collection, you can specify the number of partitions. <