Hive entry learning route guide

Hive is used by most enterprises. Learning it will help you master the technology used by the enterprise. Here, I will explain hive from installation and use to concepts, principles and problems encountered in how to use it. I hope it will be helpful to everyone.
This article has a lot of content:
the goals that need to be achieved after reading it 1.
What is
hive 2. Understand the principle of hive
3. Will use hive
4. Will use hive programming







1. hive
First of all, what do we need hive?
Let you really understand what hive is
. The above is very clear
1.hive is a data warehouse
2.hive is based on hadoop.
To sum up in one sentence: hive is a data warehouse based on hadoop.
After hive understands, as we understand what a relational database is, then how do we use it:
First of all, we have to install it. The installation is divided into many types and is divided into single-opportunity cluster installation.
You can refer to the following: (Updating)
Hive Installation Guide

HIVE Completely Distributed Cluster Installation Process (Metabase: MySQL)

It should be noted that the default metabase of hive is not mysql, but because of the limitations of the default metabase, it is best to use mysql. Below we attach a picture, first from the overall understanding.
<ignore_js_op>  
is installed, then we need to practice our hands. We can understand the
basic operations of hive
according to the following content. Basic operations are not enough. Just like we need to understand sql syntax, we need to understand various hive syntaxes.

Hadoop Hive sql syntax detailed explanation 1 -Understand hive and DDL operations

Hadoop Hive sql syntax details 2-Modify table structure

Hadoop Hive sql syntax details 3--DML operations: metadata storage

Hadoop Hive sql syntax details 4--DQL operations: data query SQL

Hadoop Hive sql syntax details 5--The difference between HiveQL and SQL

This includes various operations such as loading data and querying data.
The above are some basic operations, and the following are responsible for some operations such as:
Hive Quick Query: Do not enable Mapreduce job Enable Fetch task Three ways to introduce how

Hive executes the sql statement in the file

Hive Four data import methods Introduce

the three in Hive Different data export methods introduce how

Hive creates indexes

. The above is basically a practical type. We also need advanced and advanced, we need to understand the principles inside, and we need to know how to solve problems encountered.
Principle:
Comprehensive understanding of hive

Hive architecture Introduction to

hive implementation principle

Hive internal table and external table difference Introduce

the topic of Join in HIVE in detail---Join detailed explanation allows you to thoroughly understand the meaning of configuration items in Hive configuration files in

various modes of hive data storage

(Collector's Edition)

Comparison between HIVE and traditional database

hive Detailed explanation
of hive configuration parameters (collector's edition)


hive supports SQL Daquan (collector's edition)


hive create/delete/truncate table



The principle is generally understood, the basic operation is familiar, and we may have a further understanding, so how does hive combine with hadoop and hbase to play a role, how to play its practical significance, and how to be used by us.

First of all, when we use hive, many of them are combined with hbase, so that the advantages of hive are used, and it is more convenient to use hbase in various query data.
Why hive integrates with hbase

Hive integrates with HBase complete guidance

on the correspondence between hadoop, hbase, and hive versions

The above is used in conjunction with hbase, so how to integrate with the programming language, you can refer to the following:

Connect Hive through JDBC driver Operation example

Hive: Using Java code to connect Hiveserver through JDBC Introduction


from MapReduce to Hive Practical analysis of

hive combat

You may already have some understanding of hive. For the following interviews and problems, you can refer to
hive. Can't find the created table.

Shark's analysis of Hive Introduction to supported and unsupported grammars Interview questions:

Use Hadoop MapReduce and hive respectively to count mobile phone traffic





Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=324613192&siteId=291194637