Big data: Apache hive distributed sql computing platform, hive architecture, hive deployment, hive first experience

Big data: Apache hive distributed sql computing platform

2022找工作是学历、能力和运气的超强结合体,遇到寒冬,大厂不招人,可能很多算法学生都得去找开发,测开
测开的话,你就得学数据库,sql,oracle,尤其sql要学,当然,像很多金融企业、安全机构啥的,他们必须要用oracle数据库
这oracle比sql安全,强大多了,所以你需要学习,最重要的,你要是考网络警察公务员,这玩意你不会就别去报名了,耽误时间!
与此同时,既然要考网警之数据分析应用岗,那必然要考数据挖掘基础知识,今天开始咱们就对数据挖掘方面的东西好生讲讲 最最最重要的就是大数据,什么行测和面试都是小问题,最难最最重要的就是大数据技术相关的知识笔试


Big data: Apache hive distributed sql computing platform

insert image description here
insert image description here
insert image description here
insert image description here
The previous MapReduce did not support sql and
supported java and python
, no
wonder

**INSERT PICTURE DESCRIPTION HERE**
insert image description here
Written in sql language, hive will translate it for you
and directly execute MapReduce

insert image description here
Sql is very simple, very easy to use, and you
can play with distributed massive data development.
Sql is translated into MapReduce.
insert image description here
Everyone is happy
. It is simple and capable of big data
. Cool

hive is so awesome

hive simulation implementation

insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
hive can do these two things
insert image description here

hive infrastructure

insert image description here
insert image description here
insert image description here
insert image description here
metadata store
sql parser
user interface

insert image description here

hive deployment

insert image description here
insert image description here
insert image description here
Hive is a single machine
, while MapReduce is multi-machine

insert image description here
insert image description here
insert image description here

insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
Create a database and use it for hive
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
insert image description here
without computing or using hive

If you need to calculate, take
insert image description here
insert image description here
the advantage of hive when the amount of data is large
.

insert image description here


Summarize

提示:重要经验:

1)
2) Learn oracle well, even if the economy is cold, the whole test offer is definitely not a problem! At the same time, it is also the only way for you to test the public Internet police.
3) When seeking AC in the written test, space complexity may not be considered, but the interview must consider both the optimal time complexity and the optimal space complexity.

Guess you like

Origin blog.csdn.net/weixin_46838716/article/details/131003999