Big Data development is doing?

Use of the commercial value of big data as hot nowadays IT industry vocabulary, followed by data development, data warehousing, data security, data analysis, data mining, and so on around the big data is becoming the focus of the profits of industry professionals competing sought . With the advent of the era of big data, big data development also came into being.

In fact, the development of big data in two ways. The first is to write some Hadoop, Spark application, the second is for large data processing system itself for development. The first type of work feels more suitable for data analyst of this post it, and now Hive Spark-SQL Such a system also provides SQL interface. The second type of work, then usually only large companies have, in general they will engage in their own systems or re-do the secondary development of open source. This work, then the theory and practice requirements are deeper, and more technical content.
Big Data development is doing?

Big Data technology development which need to learn? Learning Roadmap as follows:

01

Big data base -

java fundamental aspects of language

(1) Java language foundation

Java developers introduced, the familiar Eclipse development tools, Java language foundation, process control Java, Java strings, arrays and Java classes and objects, classes and digital processing core technology, I / O and reflection, multi-threading, Swing programs and collections

In the process of getting started big data have met learning, industry, the lack of systematic learning path, learning systems planning, you are welcome to join my big learning data exchange skirt: 529 867 072, skirt documents have my years of study manual sorting of large data , development tools, PDF document with a book, you can download yourself.

(2) HTML, CSS given Java

PC terminal site layout, HTML5 + CSS3 basis, WebApp page layout, native Java interactive feature development, Ajax asynchronous interaction, jQuery applications

(3) JavaWeb and databases

Database, JavaWeb development of core, JavaWeb development insider

02

Linux & Hadoop ecosystem

Linux system, Hadoop offline computing outline, distributed database Hbase, data warehouse Hive, data migration tool Sqoop, Flume distributed logging framework

03

Distributed computing framework and

Spark & ​​Strom ecosystem

(1) distributed computing frameworks

Python programming language, Scala programming language, Spark big data processing, Spark-Streaming Big Data processing, Spark-Mlib machine learning, Spark-GraphX ​​map calculation, real one: Spark recommendation system (a line of company real project) based combat two : Sina (www.sina.com.cn)

(2) storm system technology architecture

Storm principle and foundation, message queues kafka, Redis tools, zookeeper Detailed, real one: Log warning system project, combat two: you may also like recommendation system combat

04

Big Data project combat

(First-line real company project)

Data acquisition, data processing, data analysis, data presentation, data applications

05

Big Data analysis -AI (Artificial Intelligence)

Data Analyze & prepare the work environment data base analysis, data visualization, Python Machine Learning

Outdoor equipment identification analysis: 2, & neural network image recognition, natural language processing & social network processing, machine learning Python actual project

Big Data is the direction of development of the Internet, big data talent is the future of high-paying nobility. With big data talent shortage, salary big data talent is also rising. Big Data learning development opportunities and technology is essential!

Guess you like

Origin blog.51cto.com/14296550/2413836