Illustration of common formats for compression and storage in hadoop

Common compression formats: Snappy, LZO, Gzip, bzip2, deflate
Insert picture description here
Insert picture description here
Common storage formats:
Storage format refers to the format of files stored in Hdfs. Commonly used are SequnceFile, RCFile, Parquet and TextFile.
SequnceFile
Insert picture description here
RCFile:
Insert picture description here
ORCFile:
Insert picture description here
Parquet:
Insert picture description here

Guess you like

Origin blog.csdn.net/xianyu120/article/details/114679968