Big data: 02 Difference between Hadoop architecture and version

Difference between Hadoop architecture and version

Foreword

 First of all, we have to clarify the two core problems solved by the Hadoop framework: storage and calculation of massive data . In storage, Hadoop implements its own distributed file system HDFS (Hadoop Distributed File System); in computing, Hadoop uses a distributed offline computing framework MapReduce .

1. Hadoop 1.x architecture

 Hadoop 1.x is composed of Common (auxiliary tools), HDFS (data storage), MapReduce (computing + resource scheduling).

2. Hadoop 2.x architecture

 Hadoop 2.x is composed of Common (auxiliary tools), HDFS (data storage), MapReduce (computing), and Yarn (resource scheduling) .

3. Difference

 As can be seen from the above, the main difference between 2.x and 1.x is that 2.x refines the resource scheduling, and is assigned to Yarn, while MapReduce is only responsible for calculations. 1.x 2.x more such coupling less.

Published 16 original articles · praised 7 · 20,000+ views

Guess you like

Origin blog.csdn.net/yuecangjiao5151/article/details/104885030