RDD: Resilient Distributed Dataset elastic distributed data.
RDD five characteristics:
- RDD is composed of a series of partition
- Operators (function) acting on the partition of the RDD
- There are dependencies between RDD
- Acting on the partition is formatted kv RDD