浅谈HDFS(Hadoop Distributed File System)

Hadoop overview

Hadoop implements a distributed file system,
high fault tolerance,
high throughput,
compatible with low-cost devices to run

High availability is: loss prevention and damage prevention

hive overview

Hive is a data warehouse tool based on Hadoop for data extraction, transformation, and loading. This is a mechanism that can store, query, and analyze large-scale data stored in Hadoop.

hbase overview

Advantages of hbase High throughput
HBase is a distributed, column-oriented open source database based on hdfs.

Phoenix on hbase overview

Apache Phoenix is ​​an open-source Java middle layer based on the BSD license, which allows developers to execute SQL queries on Apache HBase.

Phoenix on hbase on hdfs

Guess you like

Origin blog.csdn.net/weixin_43983411/article/details/111463861