table of Contents
The difference between Yarn and ZooKeeper
The role of Yarn
Yarn is a distributed resource scheduler component. The main function of this component is to check the current status of each child node after each request is received.
Work out a scheduling plan for computing resources to ensure that tasks can be executed smoothly. Generally speaking, the resources scheduled by Yarn often include disk space resources.
Memory resources and communication bandwidth resources, etc. Yarn is a component introduced in Hadoop V2.0,
This component can be said to be a correction and supplement to the design problems such as unreasonable use of computing resources exposed by the Hadoop 1.0 system.
The role of ZooKeeper
ZooKeeper is a distributed consistency service component. The biggest difficulty of a distributed system is how to ensure that the system
The data operated or processed by each node server is the same, or their configuration information is the same,
The value of ZooKeeper lies in the realization of distributed-based unified configuration management, naming services, and state synchronization.
Under the ZooKeeper system, a tree-like file storage system will be used. This system emphasizes the file directory structure under each sub-node.
The file name and file content are the same, and ZooKeeper will use its internal mechanism to ensure that each child node will synchronize operations when sending additions, deletions, and other operations.
Similarities between the two
All are distributed and
have the ability to monitor and control the operation of child nodes
The difference between Yarn and ZooKeeper
The core function of Yarn is resource scheduling and management, while the core function of ZooKeeper is the consistency service in the distributed system.
Zookeeper and Yarn work together, they manage resources together.
reference:
https://blog.csdn.net/jiayunfei521/article/details/103520158