30 common big data interview questions to raise your salary by one level, let's take a look!

After going through the hot and difficult big data learning, you can finally see the light of day, but you are always one step away from success, that is to get an offer from a big data engineer.

I struggled for countless days and nights at the computer, typing the code countless times, and rectifying the project countless times, just to get a high salary and high treatment Offer that I am satisfied with. But this gain not only requires you to learn skilled big data technology, but also requires careful preparation before the interview, to understand the development of the company you are applying for, the technical requirements of the position you are applying for, etc. In addition, look at some big data. Data interview questions are also necessary to give yourself experience.

Although the editor can't help you investigate the development of your ideal enterprise, the common interview questions of big data have already been prepared for you, and you need to get it in your pocket as soon as possible!

1. What are the characteristics of the scala language and what is functional programming? What are the advantages

2. What is the role of the scala companion object

3. How does scala concurrent programming work, how do you understand the actor model and what are the advantages

4. How does Spark handle structured data and how does Spark handle unstructured data?

5. What are the main methods for Spark performance optimization?

6. For Spark, what do you think are the advantages and disadvantages of the current situation of big data?

7. Have you conducted an independent research design for the algorithm?

8. Briefly describe some data mining algorithms and content you know

9. How to use spark for data cleaning

10. Talk to me about spark applications, advertising in shopping malls, and scalper detection

11. How many Partitions does spark read data? How many partitions are there in several blocks of hdfs?

12. The difference between Mogodb and hbase

13. Problems encountered in development

14. Optimization of HIVE

15. The boot sequence of linux

16. Does the compiled scala program still need the scala environment at runtime?

17.Write a java program to implement Stack in java.

18. Difference between Linkedlist and ArrayList

19. The role of combiner in hadoop

20. Design a grouping row recounting algorithm with mr

21. Use MapReduce to find two people who have common friends

22.hdfs storage mechanism

23. Principle of MapReduce

24. Hadoop operating principle

25. The namenode of hadoop is down, how to solve it

26. The characteristics of Hbase, how do you design rowkey and columnFamily, and how to build a table

27. The difference between Redis, traditional database, hbase, hive (asked very carefully)

28. Talk about some understanding of hadoop, including which components

29. Explain in detail the project deployment of your streaming real-time computing and the results collected

30. Real-time streaming computing framework, how many people, how long, details, including the components of flume, kafka, storm, you are responsible for that piece, can you complete it if you need to build it?

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=325131597&siteId=291194637