Big data development and analysis study guide

Learning orientation:
big data development Linux 

SQL NoSQL database
offline Hadoop ecosystem
(the Yarn HDFS MapReduce HBase Hive
Flume Sqoop ZooKeeper Impala)

Real-time data processing
(Scala Python Streaming Kafka)


Data Analysis
Python
Excel
(reptiles) Data Collection
Data Analysis (Python numpy, matplotlib)
probability theory and mathematical statistics
data visualization (charts)
mathematical modeling (machine learning and deep learning)
project combat

Guess you like

Origin www.cnblogs.com/vathe/p/11286707.html