1, hadoop preliminary

HDFS      --------     GFS

MapReduce   --------   Mapreduce(PageRank)

HBase ------- BigTable (column data table)

 

Reference Documentation: Copying Virtual Machines

 

Installation mode: stand-alone mode, pseudo-distribution mode, full distribution mode.

Pseudo-distributed mode: a single node starts at the same time: namenode, datanaode, jobtracker, tasktracker, secondarynode;

Fully distributed mode: at least three nodes

 

Configuration file:

hadoop-env.sh

 

Core configuration file:

core-site.xml

hdfs-site.xml

mapred-site.xml

ssh: connect to other nodes remotely. (password-free login between all nodes)

 

 

 

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=326443116&siteId=291194637