HDFS is the abbreviation of Hadoop Distribute File System, which is a distributed file system of Hadoop.
HDFS basic architecture diagram
Write file
Read file
Reference link: https://my.oschina.net/shiw019/blog/92771
HDFS data stream (this is a little more detailed)
https://www.jianshu.com/p/cba50cb92c61
How to build HDFS distributed cluster?
1: Create a new virtual machine
2: Mount the mirror
One: Network configuration:
1Network card configuration only needs to be cloned object
2ip configuration-the following command
3 hostname settings
4 Restart to make the configuration take effect
Service network restart or reboot
Two: JDK installation and basic environment configuration-environment variables-domain name mapping:
Three: Cluster deployment start (configuration name node, data node)