Big Data, also known as Big Data, the data is large, wide source, a wide range (logs, video, audio), large PB level, the framework of this stage is to solve the PB level data.
Professional speaking: big data (big data, mega data), also known as Big Data, referring to the need for a new processing mode in order to have more decision-making power, massive, high growth rates insight and process optimization capabilities and diverse of information assets.
5V characteristics of big data: Volume (a lot), Velocity (high-speed), Variety (diversity), Value (the value of density), Veracity (authenticity).
Second, the large data needs to learn what the basis of language?
First of all, learning is the need for a large data base java, python and R language.
1) Java learning what extent we can learn to do big data?
java need to learn to javaSE. javaweb, javaee less than for large data. I learned javase can understand hadoop framework.
2) python is the easiest to learn, degree of difficulty: python java Scala.
python is not easy to understand it more intuitive than java, because it will still have to learn java in Python, you learn java, python would come to learn a very simple, one week can learn python.
3) R language can learn, but not recommended, because most people use java, the first big data framework Hadoop, underlying all written in Java. R or even learned to read hadoop.
Java role in big data is composed of large data language, the first big data framework Hadoop and other big data technology framework, the underlying language are all written in Java, it is recommended that the preferred learning java
They give you an example of division of labor and the role of the instructions under, java focus on business, big data focus on data, is the front face (display page), java is an arm (service), Big Data is straight male brain, artificial intelligence, there is a deep learning EQ brain.
In this case still have to recommend my own build Big Data learning exchange group: 529 867 072, the group is big data science development, big data if you are learning, you are welcome to join small series, we are all party software development, from time to time Share dry (only the big data-related software development), including a copy of the latest big data and advanced data advanced development course my own sort of welcome advanced and want to delve into the big data small partners to join.
Third, the large data career development
Learned to big data, do not need to start from java, it can directly expand data development engineer. Has accumulated years of experience, you can do the arithmetic engineer. Learn to look at the big job data which can be engaged in:
Big Data Development Engineer
Data Analyst
hadoop Development Engineer
spark Development Engineer
Data Warehouse Development Engineer
Data cleansing Engineer (ETL)
Big Data Architect
Algorithm Project
Fourth, the advantage of big data
Big Data support by the state invested a lot of resources in this area, a large data center in Guizhou sitting posture, artificial intelligence and cloud computing are based on Big Data, Big Data requires a large number of people.
1), Big Data talent salary
General data relating to post large first-tier cities the average monthly salary in Beijing 12-15K average 17K, big data algorithm engineers, annual salary of about 300,000 to 500,000.
2) study large data have academic / professional requirements do
High school also find a job, but a college education is better, though undergraduate education, but in four years of college did not learn practical skills, to learn something less than, but in understanding in working with something easier .
Five Big Data learning route
Big Data master class syllabus:
high concurrency linux + + + Hadoop ecosystem distributed search + Strom flow calculation + Spark + machine learning algorithm
Normal in terms of learning before the big data must do the following:
1. learn the basics of programming languages (java)
2. Getting to master programming basics (linux operations, database operations, git operation)
3. Learning frameworks inside the large data (hadoop, hive, hbase, spark)
These are the three steps of normal learning Big Data must be done, if they have to learn a foundation java basically been successful in half, at least do not have to base language learning and annoyed.
The real learning of big data can not just stay at the level of theory, such as spark framework now frequently used currently to support the development of java two languages or Scala, now can support the python language. Cut into the direction of big data is comprehensive basis of language learning is only a small one aspect, programmed to implement the final programming ideas, with the guiding principle will be able to learn it a lot easier.
Sixth, 0 large data base you can learn
Although big data requires Java foundation. However, the junior partner 0 basis can learn.