大数据Flume系列之Flume集群搭建

1. 概念

集群的意思是多台机器,最少有2台机器,一台机器从数据源中获取数据,将数据传送到另一台机器上,然后输出。接下来就要实现Flume集群搭建。集群如下图所示。

2. Flume搭建

2.1 部署准备

  • 部署主机
192.168.9.139    host14 
192.168.9.128    host15
  • host14主机下载flume软件包

# cd /opt/tools
# wget http://mirrors.tuna.tsinghua.edu.cn/apache/flume/1.7.0/apache-flume-1.7.0-bin.tar.gz
  • 上传解压flume
# mkdir -p /apps/svr/flume/
# tar -zxvf /opt/tools/apache-flume-1.7.0-bin.tar.gz -C /apps/svr/flume/

2.2 部署Flume

部署的是集群,需要在2台机安装Flume,host14作为push推送数据,host15作为pull获取数据后显示出来。

  • 修改配置文件
# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf/
# cp flume-env.sh.template flume-env.sh
# cp flume-conf.properties.template flume-telent.conf

# vim flume-env.sh
export JAVA_HOME=/apps/svr/java/jdk1.8.0_172
  • host15主机部署Flume
# scp -r /apps/svr/flume/ host15:/apps/svr/
  • 验证flume
# /apps/svr/flume/apache-flume-1.7.0-bin/bin/flume-ng version
Flume 1.7.0
Source code repository: https://git-wip-us.apache.org/repos/asf/flume.git
Revision: 511d868555dd4d16e6ce4fedc72c2d1454546707
Compiled by bessbd on Wed Oct 12 20:51:10 CEST 2016
From source with checksum 0d21b3ffdc55a07e1d08875872c00523

2.3 Flume集群配置

  • 配置push.conf 

[host14]

# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf
# vim push.conf

# Name the components on this agent
a2.sources= r1
a2.sinks= k1
a2.channels= c1
 
# Describe/configure the source
a2.sources.r1.type= spooldir
a2.sources.r1.spoolDir= /apps/svr/flume/logs
a2.sources.r1.channels= c1
 
# Use a channel which buffers events in memory
a2.channels.c1.type= memory
a2.channels.c1.keep-alive= 10
a2.channels.c1.capacity= 100000
a2.channels.c1.transactionCapacity= 100000
 
# Describe/configure the source
a2.sinks.k1.type= avro
a2.sinks.k1.channel= c1
a2.sinks.k1.hostname= host15
a2.sinks.k1.port= 8899
  • 配置pull.conf

[host15]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf
# vim pull.conf

# Name the components on this agent
a1.sources= r1
a1.sinks= k1
a1.channels= c1
 
# Describe/configure the source
a1.sources.r1.type= avro
a1.sources.r1.channels= c1
a1.sources.r1.bind= host15
a1.sources.r1.port= 8899
 
# Describe the sink
a1.sinks.k1.type= logger
a1.sinks.k1.channel = c1
 
# Use a channel which buffers events in memory
a1.channels.c1.type= memory
a1.channels.c1.keep-alive= 10
a1.channels.c1.capacity= 100000
a1.channels.c1.transactionCapacity= 100000
  • 创建spoolDir目录

[host14]
# mkdir -p /apps/svr/flume/logs

2.4 Flume集群启动

  • 启动pull主机

[host15]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/
# ./bin/flume-ng agent -c conf -f conf/pull.conf -n a1 -Dflume.root.logger=INFO,console

显示如图所示则为启动成功

  • 启动push主机

[host14]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/
# ./bin/flume-ng agent -n a2 -c conf -f conf/push.conf -Dflume.root.logger=INFO,console

显示如图所示则为启动成功

  • 验证连接

[host15]

显示如图所示表示连接成功

3. Flume测试

3.1 创建测试用例

[host14]
# cd /apps/svr/flume/logs/
# vim flume-use-case-test.log

HELLO WORLD!!!
HELLO FLUME!!!

3.2 验证测试

  • pull主机

显示如图所示表示测试成功

  • push主机

显示如图所示表示测试成功

结论:用例测试成功,证明Flume集群搭建成功。

原文地址:https://1csh1.github.io/2016/04/21/Flume%E9%9B%86%E7%BE%A4%E6%90%AD%E5%BB%BA/

猜你喜欢

转载自blog.csdn.net/volitationLong/article/details/82186379