0 Big Data learning basic route and the stage of learning recommended books

To learn big data, you should at least know what big data, big data will be used in what areas. By general understanding of big data, you can find out if you are interested in big data.

01

Zero-based learning big data has the following steps:

1, understand the theory of Big Data

2, a computer programming language learning

3, the main data related courses

4, the actual project

02

Stage study recommended books:

A phase 0 study basic foundation: java language, Linux

java can be said that most big data base programming language, developed a large part of the Big Data I contacted were from Jave Web development job transfer over.

Recommended books:

"Effective Java Chinese version of"

If you want to learn the best big data added to a good learning environment, this may be the Q group 251,956,502 so that everyone would be relatively easy to learn, but also to communicate and share information on common

Phase II, Big Data Hadoop system

Hadoop is an open source distributed computing using Java language development platform, suitable for large data storage and distributed computing platforms. Hadoop Big Data platform is widely used, results of the work itself is a big data platform R & D personnel, Hadoop is more common platform to support large data.

Recommended books:

1、《Big Data》

2, "Hadoop The Definitive Guide"

3, "Hive Programming Guide"

Phase III, Scala language and gold Spark

Scala and are very similar in language java jvm running in the development process is seamless to call each other. Spark is designed for large-scale data processing designed for fast general-purpose computing engine. Spark is the alternative to MapReduce, and is compatible with HDFS, Hive, Hadoop can be integrated into the ecosystem, to make up for the lack of MapReduce.

Recommended books:

1、《Learning Spark》

2, "Spark Machine Learning: the core technology and practice"

Phase IV, big data project combat (first-tier real company project)

Data acquisition, data processing, data analysis, data presentation, data applications

Guess you like

Origin blog.51cto.com/14296550/2420974