The ResourceManager
The NodeManager : management of computing resources on the host Container
responsible for reporting its status information to the MR
MRAppMaster : computing tasks Master, responsible for the application of computing resources, coordinate computing tasks
YarnChild : responsible for doing the actual computing tasks / processes (MapTask / ReduceTask)
Container : is an abstract computing resources represents a group of memory for / cpu / network, whether MRAppMaster or YranChild fortune is a need to consume logic Container
-
-
A MR program is a Job, Job information will ResourceManager, ResourceManager registration Job Information
-
ResouceManager After registering through, Job will copy the relevant resource information (from the HDFS)
-
Job will immediately submit a complete application information (including resource information) to the ResourceManager
-
ResourceManager calculated resources required by the current Job information Job allocated Container (resource unit) of Job
-
This information will be distributed Container NodeManager, NodeManger process creates MRAppMaster
-
At this point MRAppMaster initializes Job
-
Then queries the task of slicing
-
Connecting RM, requesting allocation of resources, to obtain a corresponding resource, the NodeManager corresponding connector, the corresponding starting on YarnChild Container
-
Job resource copy from the Distributed File System
-
MR program execution