Yarn of introduction and infrastructure

Yarn popular Introduction

Apache Hadoop YARN (Yet Another Resource Negotiator , another resource coordinator) is a new Hadoop Explorer, it is a common resource management and scheduling system platform , provides a uniform for applications the resource management and scheduling.

Its introduction is a cluster tremendous benefits in terms of utilization of resources, unified management and data sharing.

The yarn can be understood as the equivalent of a distributed operating system platform, while other operations mapreduce program is equivalent applications running on top of the operating system, Yarn provide the resources needed operation (memory, cpu) for these programs.

 

  1. yarn is not clear operating mechanism of the program submitted by users
  2. yarn only provide scheduling computing resources (user application program resources to the yarn, yarn is responsible for allocating resources)
  3. The role of director of yarn called ResourceManager
  4. yarn specifically provide computing resources role called NodeManager
  5. yarn and run user programs completely decoupled, meaning that the yarn can run various types of distributed computing program, such as mapreduce, storm, spark, tez ......
  6. spark, storm and other operational framework can be integrated in the yarn run, as long as there are resources in line with their respective yarn specification framework can request mechanism
  7. yarn becoming a universal resource scheduling platform for enterprise computing in a variety of pre-existing clusters can be integrated on a single physical cluster, improve resource utilization, and facilitate data sharing

 

Yarn basic architecture

YARN is a resource management, task scheduling framework, mainly includes three modules: ResourceManager (RM), NodeManager (NM), ApplicationMaster (AM).

ResourceManager responsible for monitoring all resources, allocation and management of a cluster only one;

NodeManager responsible for the maintenance of each node, a cluster has more.

ApplicationMaster responsible for scheduling and coordinating each specific application, there are more than a cluster;

 

For all applications, RM has the absolute right to control and allocate resources. And each AM and RM will consult resources, communication and NodeManager to implement and monitor the task.

 

 

Published 176 original articles · won praise 278 · views 80000 +

Guess you like

Origin blog.csdn.net/weixin_43893397/article/details/105048228