Difference between Hadoop architecture and version
Foreword
First of all, we have to clarify the two core problems solved by the Hadoop framework: storage and calculation of massive data . In storage, Hadoop implements its own distributed file system HDFS (Hadoop Distributed File System); in computing, Hadoop uses a distributed offline computing framework MapReduce .
1. Hadoop 1.x architecture
Hadoop 1.x is composed of Common (auxiliary tools), HDFS (data storage), MapReduce (computing + resource scheduling).
2. Hadoop 2.x architecture
Hadoop 2.x is composed of Common (auxiliary tools), HDFS (data storage), MapReduce (computing), and Yarn (resource scheduling) .
3. Difference
As can be seen from the above, the main difference between 2.x and 1.x is that 2.x refines the resource scheduling, and is assigned to Yarn, while MapReduce is only responsible for calculations. 1.x 2.x more such coupling less.