How Big Data learning, to tell you the correct posture of learning

Now is the era of big data, also known as cloud data, we have a variety of data online, these finally finishing together to form a large collection of data, our lives have real-time big data applied. So, how big data learning? The following will be introduced for everyone.

How Big Data learning

First, learn the basics:

1, if you had knowledge of aspects of the database is not involved, before learning of large data, it is recommended to master the basic knowledge of databases, such as the popular relational database management system MySQL, non-relational database MongoDB, an open source, support networks, memory-based, relevant knowledge of key store database Redis and so on;

2, Java is currently the most widely used programming language, it has many features, especially for the development of language as a big data applications. Since Java currently has a different direction, if it is about big data aspects can focus on learning the standard version JavaSE;

3, to master the Linux system, due to the large data-related software is running on Linux, and therefore able to skillfully use the Linux operating system is very important for large data related to software, it is possible to later learn to understand hadoop, HBase and other large data the operating principle of software;

If you want to learn the best big data added to a good learning environment, this may be the Q group 251,956,502 so that everyone would be relatively easy to learn, but also to communicate and share information on common

Two, Scala basis:

Scala is a multi-paradigm programming language that was originally designed to integrate various characteristics of object-oriented programming and functional programming. Since Scala runs on the Java platform (Java Virtual Machine), and is compatible with existing Java programs, so Scala can be well integrated and large data relating to JVM-based systems;

Three, Hadoop technology modules;

Hadoop is a support for data-intensive distributed applications and open source software framework Apache 2.0 license issued, it can build a large data warehouse, storage PB-level data processing, analysis, statistical and other services. It has a high rate of fault-tolerant features, is designed to be deployed on inexpensive hardware; this platform has become synonymous with big data, through the study of the Hadoop can understand and use big data;

Four, Spark technology modules;

Hadoop and Spark are big data framework. Hadoop Spark offers features that are not, such as distributed file systems, and Spark provides real-time processing for the memory of those data sets need it. So learning Spark is very necessary;

Five big data project combat;

After learning more, we need to practice large data projects, through modeling, analysis and computation will eventually learned to practice;

These are the answers about how big data for learning, hope can help to you.

Guess you like

Origin blog.51cto.com/14296550/2420023