Big Data Skill Graph

http://mp.weixin.qq.com/s?__biz=MzA4Nzc4MjI4MQ==&mid=403428818&idx=1&sn=08a505f0204ea2edfb49925903a04a0a#rd

The following is the big data skill map released by StuQ, which is more practical for reference

Big data processing framework

Spark
    - RDD
    - Spark SQL
    - Spark Streaming
    - MLLib

Hadoop
    - HDFS (distributed file system)
    - Mapreduce (computing framework)
    - Yarn (resource management platform)
    - Pig (piglatin statement to mapreduce mapping)
    - Hive (data warehouse, providing SQL)
    - Mahout (mapreduce implementation library for machine learning algorithms)

Kafka
Storm
ELK
    - ElasticSearch
    - Logstash
    - Kibana

database

  - SQL
  - MySQL
  - MongoDB
  - Cassandra
  - Redis
  - SQLite
  - bsddb
  - HBase

Programming Language

  - Python
  - R
  - Ruby

Data Analysis Mining

  - MATLAB
  - SPSS
  - SAS

Data Visualization

  - R
  - D3.js
  - ECharts
  - Excle

Artificial Intelligence

    - Clustering
    - Time Series
    - Recommendation System
    - Regression Analysis
    - Text Mining
    - Decision Tree
    - Support Vector Machine
    - Bayesian Classification
    - Neural Network

Algorithm

  Consistency
    - Paxos
    - Raft
    - Gossip

  Data Structure
    - Stack, Queue, Linked List
    - Hash Table
    - Binary Tree, Red Black Tree, B Tree
    - Graph

Common Algorithms
    - Sorting (Insertion Sort, Bucket Sort, Heap Sort, Quick Sort)
    - Maximum Subarray
    - Longest Common Subsequence
    - Minimum Spanning Tree
    - Shortest Path
    - Matrix Storage and Computing

Cloud Computing

  - Cloud Services (SaaS, PaaS, IaaS)
  - Openstack
  - Docker

Guess you like

Origin http://10.200.1.11:23101/article/api/json?id=327096018&siteId=291194637