spark variables

  driver variable: there is no variable in the operator of.

    What are the operator

    Transformation:

   

  An input and an output partition one type partition

    1, map operator

    2, flatMap operator

    3, mapPartitions 算 子

    4, glom operator

  Second, many to one input partition and the partition type output 

    5, union operator

    6、cartesian算子

  Third, the input and output partition partition-many type

    7, grouBy operator

  Fourth, the output is the input a subset of partitions partition type

    8, filter operator

    9, distinct operator

    10, subtract operator

    11, sample operator

        12, takeSample Sanko

   Five, Cache type

    13, cache operator  

    14, persist operator

 

2) Key-Value data types Transfromation operator

  First, the input and output section with one partition

    15, mapValues ​​operator

  Second, a single or two RDD RDD aggregation

   RDD single gathering

    16、combineByKey算子

    17、reduceByKey算子

    18, partitionBy operator

   Two RDD gather

    19, Cogroup operator

  Third, the connection

    20, join operator

    21, leftOutJoin and rightOutJoin operator

 

 

 3) Action Operator

  First, no output

    22, foreach operator

  Two, HDFS

    23, saveAsTextFile operator

    24、saveAsObjectFile算子

  Three, Scala set and data type

    25, collect operator

    26, collectAsMap operator

      27、reduceByKeyLocally算子

      28, lookup operator

    29, count operator

    30, top operator

    31, reduce operator

    32, fold operator

    33、aggregate算子

      

 

 

  excutor variable

 

  task variables

  

Guess you like

Origin www.cnblogs.com/BrentBoys/p/11823216.html