HDFS -------- GFS
MapReduce -------- Mapreduce(PageRank)
HBase ------- BigTable (column data table)
Reference Documentation: Copying Virtual Machines
Installation mode: stand-alone mode, pseudo-distribution mode, full distribution mode.
Pseudo-distributed mode: a single node starts at the same time: namenode, datanaode, jobtracker, tasktracker, secondarynode;
Fully distributed mode: at least three nodes
Configuration file:
hadoop-env.sh
Core configuration file:
core-site.xml
hdfs-site.xml
mapred-site.xml
ssh: connect to other nodes remotely. (password-free login between all nodes)