Hadoop ecosystem

Hadoop ecosystem

Overview:

  hadoop use a very wide field, in different areas and for different functions, various manufacturers have developed and offers a lot of development tools associated with Hadoop, the open source software, business tools and technical services. Hadoop ecosystem is very rich! The main contents of this paper is to interpret the various components of Hadoop ecosystem.

A, Hadoop ecosystem

1.0: Zookeeper: Distributed Coordination Service, Hbase: distributed database, Ambari: installation deployment tools, Oozie: workflow scheduling system, Hive: data warehousing, Pig: workflow engine, Mahout: data mining library, MapReduce: Distributed Computing framework, HDFS: a distributed storage system, Sqoop: TEL database tools, Flume: log collection.

FIG -1 Hadoop1.0

2.0: Zookeeper: Distributed Coordination Service, Hbase: distributed database, Ambari: installation deployment tools, Oozie: workflow scheduling system, Hive: data warehousing, Pig: workflow engine, Shark: data warehousing, Mahout: data mining library, MapReduce: distributed computing framework, Tez: DAG computing, Spark: memory computing, YARN: resource management system, HDFS: a distributed storage system, Sqoop: TEL database tools, Flume: log collection. 

FIG -2 Hadoop2.0

 

 

 

 

 

 

 

Guess you like

Origin www.cnblogs.com/yinminbo/p/11840517.html