hive中运行python脚本 - 代码天地

hive中运行python脚本

编程语言 2018-12-29 02:11:03 阅读次数: 0

hive中可以加载python脚本，然后在hive中运行。

好处：可以解决模型离线上线问题和一些基于行的运算。

python脚本：

import sys

#本代码实现47中变换中的求均值
#运行前先处理掉空值
'''
表结构：
uid,c1,c2,c3
123,11,22,33
...
...

'''
if __name__ == '__main__':

    for line in sys.stdin: #hive一行一行读取，必须使用标准输入流输入
        features = line.strip().split('\t')  # 在hive表中字段间的分割符是\t
        if len(features) != 7:   #判断每行的长度是否正确，理论上这句if不要也没问题
            print(sys.stderr, "error:error1!")
            break
        avg = (int(features[1])+int(features[1])+int(features[1])) / 3 # hive传入的数据全部为string类型，所以要先转成int后计算
        print(str(features[0])+'\t'+str(avg))  # 输出到hive表中，输出的格式必须为string类型，'\t'为分隔符

hive中代码：

add ARCHIVE hdfs:///tmp/anaconda3_nlp.zip; -- 指定python及相关包的路径，该路径为集群hdfs上的路径
add file hdfs:///tmp/02_LSTM/tmp.py; -- 将上面的python脚本上传到hdfs上（比如你自己的文件下）

drop table if exists manyorder_model_lstm_columntorow;
create table manyorder_model_lstm_columntorow as
select
transform(
            uid
           ,c1
           ,c2
           ,c3
)
using 'anaconda3_nlp.zip/anaconda3/bin/python tmp.py' -- 指定python的路径
as (uid,avg)

from tableX
;

猜你喜欢

转载自blog.csdn.net/weixin_42247685/article/details/81285965

hive中运行python脚本

python 中写hive 脚本

Hive脚本化运行

bat中运行python脚本

bat文件中运行python脚本方法

python在windows中运行shell脚本

hive学习之脚本化运行

Hive中运行Java脚本进行查询数据的二次处理

Hive调用Python脚本异常

Python脚本后台运行

python脚本运行

如何取消python中运行脚本时 run in unittest

如何让Python脚本成为在Windows环境中运行的exe文件

用python脚本获取运行环境中的module 列表

在linux中设置后台运行Python脚本命令

批处理文件中运行python脚本方法

IDEA中编写脚本并运行shell脚本

批处理运行Python脚本

在cmd下运行python脚本

运行python脚本后台执行

让Python脚本能双击运行

shell脚本运行python程序

运行带参数的python脚本

CMD运行python脚本步骤

LoadRunner中运行QTP脚本

hive运行脚本格式错误解决方案

hive:后台启动、和脚本化运行

hive 脚本

学习在外部Python脚本中运行Houdini的Python接口（hou模块）

docker 使用python 镜像运行python脚本

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)