Hive and HBase differences and relations
Difference between the two
Hive: Hive Hadoop is based on a data warehouse tool , you can map the structure of the data file to a database table, and provides a simple sql query.
HBase: HBase Hadoop is a database , a distributed, scalable, large data store.
Both Contact
Hbase and Hive in big data architecture in a different position, Hbase mainly to solve the problem of real-time data query, data processing and Hive mainly to solve computing problems, usually used in conjunction.
In large data schema, Hive and HBase collaborative relationship, data flow typically follows:
- By ETL tools to extract the source data storage HDFS;
- Hive by washing, and processing the original data;
- The results of the cleaned HIve, if for massive data query scenarios may be stored in random Hbase
- HBase query data from data applications;