What is a distributed file system? Why learn distributed file system? Importance of data storage?

 basic introduction:

  I Course Link: https://edu.51cto.com/lecturer/12750547.html   Author: Zhang Yanfeng, please indicate the source

  Hello everyone, netizens, today I will talk to you about what is a distributed file system? Why study the distributed file system and the importance of data storage.

  At the same time, my pen name is "Yunyan Old Dream", and I will update more practical articles for you later. Everyone pays attention, leaves a message, and we can discuss technical issues together.


What is a distributed file system?

If you have done web site maintenance, you should know that the amount of access logs generated every day is particularly large, then our server must not be enough to store locally, then there are two solutions for you now, the first is to delete these Access log files, the second is to move these access logs to other backup servers. In enterprises, it is often stored in the form of regular cleaning + backup. Then there will be a day when the disk is not enough and the hard disk slot is full, so the network file system appears at this time, which can solve our problem of insufficient capacity.

So what is a distributed file system? Linux servers manage and store data through a file system, and in this era of information explosion, the data that people can obtain has exponentially increased, for example: the amount of data such as log files, data files, and database backup files consumed by the server every day They are all very large. Therefore, simply by increasing the number of hard disks to expand the storage capacity of a computer file system, the performance in terms of capacity size, capacity growth rate, data backup, and data security is not satisfactory.

The distributed file system can effectively solve the problem of data storage and management, and it can form a file system network with numerous server nodes. Each node can be distributed in different locations, through the network for communication and data transmission between nodes. When people use a distributed file system, they don't need to care about which node the data is stored on or from which node the data is obtained from, but only need to manage and store the data in the file system as if using a local file system. The distributed file system is built on the basis of client and server ( C/S model) technology. One or more servers and the client file system cooperate with each other, so that the client can access the files managed by the server.

Distributed file system refers to distributing a large amount of data to different nodes for storage, thereby greatly reducing the risk of data loss. The distributed file system has redundancy. The failure of some nodes will not affect the normal operation of the whole, and even if the data stored by the failed computer has been damaged, the damaged data can be recovered by other nodes. Therefore, security is the most important feature of a distributed file system. The distributed file system connects a large number of scattered computers through the network to form a huge computer cluster, so that each host can give full play to its value. In addition, computers outside the cluster can be added to the distributed file system after a simple configuration, which has extremely strong scalability.

 

Why learn distributed file system?

Every enterprise generates a large amount of critical data, so these critical data need to be "backed up". This means that the data must be in a place that is easy to access and retrieve.

In a business, no one knows what is going to happen. Every company may encounter natural and man-made disasters such as server power failure, fire, theft, and accidental damage. If you do not back up this information, it will lead to catastrophic consequences. If the data is really lost, and you do not have some technical means to restore the data, the cost of hiring experts to restore the data will be high.

Let's talk about where the amount of data storage used in the enterprise is particularly large. The author once worked as a MySQL DBA engineer, and I remember that the binlog (that is, the database log) grew by as much as 100G every day . Think about it, if these data are not backed up, in case the server goes down unexpectedly or the hard disk is damaged, a lot of data will be lost. If there is a record of the information that you just made a deposit today, then your balance may not change, because this record of your deposit is to be written to the server's hard disk. Therefore, the consequences of data loss are unimaginable.

Periodically back up the information to a compact disc ( CD ) or hard disk. If the amount of information is large, network storage solutions should be considered.

 

Importance of data storage

We have talked so much above, everyone should understand the importance of data storage! For an enterprise, the security of network data is extremely important. Once important data is destroyed or lost, it will have a major impact on an enterprise, and even irreparable losses. For example: Bank. So data is the top priority of our company.

Ensuring the security and consistency of data is our responsibility for Linux operation and maintenance. In the working environment, we have no way to ensure that this service has been free of problems. We can only deal with it when a disaster strikes. This is what we should do in operation and maintenance.


Guess you like

Origin blog.51cto.com/12760547/2665479