python与hive通信交互 -

student_lp

浏览: 441126 次
性别:
来自: 北京

最近访客更多访客>>

james1110

coolworld

suzhiqiang99

zk11231002

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

python与hive通信交互

博客分类：

python与hive1通讯 python与hive2通讯 python编程大数据操作

一、python与hive1通信

#!/usr/bin/python2.7
#hive --service hiveserver >/dev/null 2>/dev/null&
#/usr/lib/hive/lib/py
import sys
from hive_service import ThriftHive
from hive_service.ttypes import HiveServerException
from thrift import Thrift
from thrift.transport import TSocket
from thrift.transport import TTransport
from thrift.protocol import TBinaryProtocol

def hiveExe(sql):
    try:
        transport = TSocket.TSocket('*.*.*.*', 10000)
        transport = TTransport.TBufferedTransport(transport)
        protocol = TBinaryProtocol.TBinaryProtocol(transport)
        client = ThriftHive.Client(protocol)
        transport.open()

        client.execute(sql)

        print "The return value is : "
        print client.fetchAll()
        print "............"
        transport.close()
    except Thrift.TException, tx:
        print '%s' % (tx.message)

if __name__ == '__main__':
    hiveExe("select * from project.table")

python和hive1通讯是一件非常容易的事情，因为python所需要的依赖包从/usr/lib/hive/lib/py获取即可，导入到hive的扩张中就可以应用了。

二、python与hive2的通信

#!/usr/bin/python2.7
#hive --service hiveserver2 >/dev/null 2>/dev/null&
#install pyhs2,first install cyrus-sasl-devel,gcc,libxml2-devel,libxslt-devel
#hiveserver2 is different from hiveserver on authority

import pyhs2

conn = pyhs2.connect(host='*.*.*.*',port=10000,authMechanism="PLAIN", user='hive', password='', database='project')
cur = conn.cursor()
cur.execute("select * from table limit 10")
for i in cur.fetch():
        print i
cur.close()
conn.close()

python与hive2通信比较费劲，需要安装的依赖比较多（install pyhs2,first install cyrus-sasl-devel,gcc,libxml2-devel,libxslt-devel）。但是安装完成后编程还是很容易的。

两种通讯有一个共同点，就是必须启动hive服务器。

分享到：

mysql架构 | 详解RabbitMQ

2014-08-21 15:41
浏览 3495
评论(0)
分类:互联网
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

python与hive通信交互

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

python与hive通信交互

评论

发表评论

相关推荐

kafka架构介绍

hbase rowkey 查询

HBase Rowkey设计

hbase热点问题(数据倾斜)解决方案---rowkey散列和预分区设计

HBase -ROOT-和.META.表结构【转】

转：Hive小文件合并

hive数据存储组织

hive中数据倾斜汇总

数据仓库架构

Java垃圾回收(2)

java内存管理(1)

spark shuffle详解

Hadoop之Cloudera Manager安装问题总结【转】

spark Streaming详解

spark详解

Hadoop之Cloudera Manager 管理机器的IP

Hadoop之Cloudera Manager CDH4卸载

搭建yum源服务器

yarn详解

Solr详解

最近访客更多访客>>