Keywords—distributed
Divide whole into zero, then convert zero into whole
Definition of big data
Data sets that are difficult for traditional databases to handle.
development path
China Open Source Ecosystem Map 2023
Reference content
China Open Source Ecosystem Map 2023.pdf
Technical component description
data integration
sqoop、dataX、flume
data storage
hdfs、kafka
data processing
mapreduce、hive、impala、spark、flink
data analysis
hbase、mysql、greenplum(postgreSQL)、clickhouse
Application scenarios
Data Analysis - Decision Making
Big data is a solution, but it is not necessarily the most efficient solution.