1. Hadoop supports cloud storage and cloud computing.
Cloud storage is big data storage, and 100T data can also be stored. It uses HDFS to separate files into many blocks and store them on multiple servers.
Cloud computing is map and reduce. It should be noted here that in the past, all programs were run on one machine, and then big data was transmitted over the network for processing. Hadoop, on the other hand, distributes the program to each server with data, because the program is small, saving network transmission.
2. Hadoop application scenarios:
Massive log analysis
I heard from Brother Nan before that the best position for typesetting is calculated.
3. Hadoop has several important components:
1、HDFS
2 、 DataNode
3 、 NameNode
4 、 Secondary NameNode
5、JobTracker
6、TaskTracker
where 5 and 6 have been replaced by YARN (Yet Another Resource Negotiator)
1. Complete hadoop construction tutorial, follow it, you will understand many concepts
Reference link: http://www.powerxing.com/install-hadoop/
2. Analysis of many key technologies
Reference link: http://www.thebigdata.cn/Hadoop/10722.html
3. Explain the specific application of hadoop in which scenarios
Reference link: http://blog.sina.com.cn/s/blog_687194cd01017lgu.html
4. Hadoop has become the ETL of the poor
Reference link: http://www.ctocio.com/bigdata/12464.html
5. Explained TaskTracker and DataTracker, very good
Reference link: http://my.oschina.net/u/1464779/blog/285801