Single-node Hadoop to build, run grep instance official

Preparing the Environment

centos 7
hadoop 2.7.7

Environment Installation

安装ssh
[root@swarm02 ~]# yum search ssh
Loaded plugins: fastestmirror
Loading mirror speeds from cached hostfile
 * base: centos.ustc.edu.cn
 * extras: ap.stykers.moe
 * updates: centos.ustc.edu.cn
============================================================================================================================== N/S matched: ssh ==============================================================================================================================
fence-agents-ilo-ssh.x86_64 : Fence agent for HP iLO devices via SSH
ksshaskpass.x86_64 : A KDE version of ssh-askpass with KWallet support
libssh.x86_64 : A library implementing the SSH protocol
libssh-devel.x86_64 : Development files for libssh
libssh2.i686 : A library implementing the SSH2 protocol
libssh2.x86_64 : A library implementing the SSH2 protocol
libssh2-devel.i686 : Development files for libssh2
libssh2-devel.x86_64 : Development files for libssh2
libssh2-docs.noarch : Documentation for libssh2
openssh.x86_64 : An open source implementation of SSH protocol versions 1 and 2
openssh-askpass.x86_64 : A passphrase dialog for OpenSSH and X
openssh-clients.x86_64 : An open source SSH client applications
openssh-keycat.x86_64 : A mls keycat backend for openssh
openssh-ldap.x86_64 : A LDAP support for open source SSH server daemon
openssh-server.x86_64 : An open source SSH server daemon
openssh-server-sysvinit.x86_64 : The SysV initscript to manage the OpenSSH server.
pam_ssh_agent_auth.i686 : PAM module for authentication with ssh-agent
pam_ssh_agent_auth.x86_64 : PAM module for authentication with ssh-agent
sshpass.x86_64 : Non-interactive SSH authentication utility
ansible.noarch : SSH-based configuration management, deployment, and task execution system
jsch.noarch : Pure Java implementation of SSH2
openssh-cavs.x86_64 : CAVS tests for FIPS validation
python-paramiko.noarch : SSH2 protocol library for python
python-paramiko-doc.noarch : Docs and demo for SSH2 protocol library for python

  Name and summary matches only, use "search all" for everything.
[root@swarm02 ~]# yum install fence-agents-ilo-ssh.x86_64
Loaded plugins: fastestmirror
Loading mirror speeds from cached hostfile
 * base: centos.ustc.edu.cn
 * extras: ftp.sjtu.edu.cn
 * updates: mirrors.cn99.com
base                                                                                                                                                                                                                                                   | 3.6 kB  00:00:00     
docker-ce-stable                                                                                                                                                                                                                                       | 3.5 kB  00:00:00     
extras                                                                                                                                                                                                                                                 | 3.4 kB  00:00:00     
updates                                                                                                                                                                                                                                                | 3.4 kB  00:00:00     
(1/2): docker-ce-stable/x86_64/primary_db                                                                                                                                                                                                              |  27 kB  00:00:00     
(2/2): updates/7/x86_64/primary_db                                                                                                                                                                                                                     | 3.4 MB  00:00:03     
Resolving Dependencies
--> Running transaction check
---> Package fence-agents-ilo-ssh.x86_64 0:4.2.1-11.el7_6.7 will be installed
--> Processing Dependency: fence-agents-common >= 4.2.1-11.el7_6.7 for package: fence-agents-ilo-ssh-4.2.1-11.el7_6.7.x86_64
--> Processing Dependency: telnet for package: fence-agents-ilo-ssh-4.2.1-11.el7_6.7.x86_64
--> Running transaction check
---> Package fence-agents-common.x86_64 0:4.2.1-11.el7_6.7 will be installed
--> Processing Dependency: pexpect for package: fence-agents-common-4.2.1-11.el7_6.7.x86_64
---> Package telnet.x86_64 1:0.17-64.el7 will be installed
--> Running transaction check
---> Package pexpect.noarch 0:2.3-11.el7 will be installed
--> Finished Dependency Resolution

Dependencies Resolved

==============================================================================================================================================================================================================================================================================
 Package                                                                  Arch                                                       Version                                                                Repository                                                   Size
==============================================================================================================================================================================================================================================================================
Installing:
 fence-agents-ilo-ssh                                                     x86_64                                                     4.2.1-11.el7_6.7                                                       updates                                                      29 k
Installing for dependencies:
 fence-agents-common                                                      x86_64                                                     4.2.1-11.el7_6.7                                                       updates                                                      75 k
 pexpect                                                                  noarch                                                     2.3-11.el7                                                             base                                                        142 k
 telnet                                                                   x86_64                                                     1:0.17-64.el7                                                          base                                                         64 k

Transaction Summary
==============================================================================================================================================================================================================================================================================
Install  1 Package (+3 Dependent packages)

Total download size: 310 k
Installed size: 1.1 M
Is this ok [y/d/N]: y
Downloading packages:
(1/4): fence-agents-ilo-ssh-4.2.1-11.el7_6.7.x86_64.rpm                                                                                                                                                                                                |  29 kB  00:00:00     
(2/4): telnet-0.17-64.el7.x86_64.rpm                                                                                                                                                                                                                   |  64 kB  00:00:00     
(3/4): fence-agents-common-4.2.1-11.el7_6.7.x86_64.rpm                                                                                                                                                                                                 |  75 kB  00:00:00     
(4/4): pexpect-2.3-11.el7.noarch.rpm                                                                                                                                                                                                                   | 142 kB  00:00:01     
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Total                                                                                                                                                                                                                                         261 kB/s | 310 kB  00:00:01     
Running transaction check
Running transaction test
Transaction test succeeded
Running transaction
  Installing : pexpect-2.3-11.el7.noarch                                                                                                                                                                                                                                  1/4 
  Installing : fence-agents-common-4.2.1-11.el7_6.7.x86_64                                                                                                                                                                                                                2/4 
  Installing : 1:telnet-0.17-64.el7.x86_64                                                                                                                                                                                                                                3/4 
  Installing : fence-agents-ilo-ssh-4.2.1-11.el7_6.7.x86_64                                                                                                                                                                                                               4/4 
  Verifying  : 1:telnet-0.17-64.el7.x86_64                                                                                                                                                                                                                                1/4 
  Verifying  : fence-agents-common-4.2.1-11.el7_6.7.x86_64                                                                                                                                                                                                                2/4 
  Verifying  : pexpect-2.3-11.el7.noarch                                                                                                                                                                                                                                  3/4 
  Verifying  : fence-agents-ilo-ssh-4.2.1-11.el7_6.7.x86_64                                                                                                                                                                                                               4/4 

Installed:
  fence-agents-ilo-ssh.x86_64 0:4.2.1-11.el7_6.7                                                                                                                                                                                                                              

Dependency Installed:
  fence-agents-common.x86_64 0:4.2.1-11.el7_6.7                                                        pexpect.noarch 0:2.3-11.el7                                                        telnet.x86_64 1:0.17-64.el7                                                       

Complete!
[root@swarm02 ~]# 

安装rsync
yun install libguestfs-rsync.x86_64

Download hadoop

前提之前需要配置java的环境变量
export JAVA_HOME=/usr/local/jdk1.8.0
export CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar
export PATH=$PATH:$JAVA_HOME/bin

[root@swarm02 ~]# wget http://mirror.bit.edu.cn/apache/hadoop/common/hadoop-2.7.7/hadoop-2.7.7.tar.gz
解压后拷贝文件到指定的目录文件下面,我这里面的文件夹的位置并不做特殊的要求,只要后面的配置满足这个就行了
[root@swarm02 ~]# tar -zxvf hadoop-2.7.7.tar.gz -C /usr/local

Configuration environment variable

配置hadoop的环境变量
export HADOOP_HOME=/usr/local/hadoop
export PATH=.:$HADOOP_HOME/bin:$JAVA_HOME/bin:$PATH

Environment variables to take effect

source /etc/profile

Detecting installation effect

需要java环境的支持,java环境的配置这里就不在详细讲解安装过程了
[root@swarm01 ~]# java -version
java version "1.8.0_201"
Java(TM) SE Runtime Environment (build 1.8.0_201-b09)
Java HotSpot(TM) 64-Bit Server VM (build 25.201-b09, mixed mode)
[root@swarm01 ~]# hadoop version
Hadoop 2.7.7
Subversion Unknown -r c1aad84bd27cd79c3d1a7dd58202a8c3ee1ed3ac
Compiled by stevel on 2018-07-18T22:47Z
Compiled with protoc 2.5.0
From source with checksum 792e15d20b12c74bd6f19a1fb886490
This command was run using /usr/local/hadoop/share/hadoop/common/hadoop-common-2.7.7.jar
[root@swarm01 ~]# 

grep's demo demo

创建一个文件夹 grep_demo用于存放grep程序的input和output

[root@swarm01 swarm01]# mkdir grep_demo
[root@swarm01 swarm01]# ll
total 213600
drwxr-xr-x. 2 root root         6 Apr 14 21:27 grep_demo
-rw-r--r--. 1 root root 218720521 Jul 20  2018 hadoop-2.7.7.tar.gz
drwxr-xr-x. 3 root root        60 Apr 14 11:37 java-8
-rw-r--r--. 1 root root      1506 Apr 12 11:00 vi.text
drwxr-xr-x. 4 root root        33 Apr 14 13:16 word_regex
[root@swarm01 swarm01]# 

这里可以选择吧hadoop的文件名直接拷贝到这个input文件夹中,也可以自己创建,这里我选择自己创建了,这样更有含义

下面是准备工作,执行命令在gerp_demo文件夹中
[root@swarm01 swarm01]# cd grep_demo/
[root@swarm01 grep_demo]# ll
total 0
[root@swarm01 grep_demo]# mkdir input
[root@swarm01 grep_demo]# ll
total 0
drwxr-xr-x. 2 root root 6 Apr 14 21:29 input
[root@swarm01 grep_demo]# cd input/
[root@swarm01 input]# touch sakura_demo.xml
[root@swarm01 input]# touch licunzhi_demo.xml
[root@swarm01 input]# ll
total 0
-rw-r--r--. 1 root root 0 Apr 14 21:30 licunzhi_demo.xml
-rw-r--r--. 1 root root 0 Apr 14 21:30 sakura_rain.xml
[root@swarm01 input]# 
[root@swarm01 input]# cat licunzhi_demo.xml 
licunzhi_demo_001
sakura_demo_licunzhi
[root@swarm01 input]# cat sakura_demo.xml 
sakura_demo_001
demo_001_sakura
[root@swarm01 input]# 

执行命令,将会生成output文件夹,因此这里面不需要创建output,否则一定会报失败的

[root@swarm01 grep_demo]# hadoop jar /usr/local/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.7.jar grep input output 'sakura_[a-z.]+'
19/04/14 09:35:45 INFO Configuration.deprecation: session.id is deprecated. Instead, use dfs.metrics.session-id
19/04/14 09:35:45 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId=
19/04/14 09:35:47 INFO input.FileInputFormat: Total input paths to process : 4
19/04/14 09:35:47 INFO mapreduce.JobSubmitter: number of splits:4
19/04/14 09:35:48 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_local1775879846_0001
19/04/14 09:35:48 INFO mapreduce.Job: The url to track the job: http://localhost:8080/
19/04/14 09:35:48 INFO mapreduce.Job: Running job: job_local1775879846_0001
19/04/14 09:35:48 INFO mapred.LocalJobRunner: OutputCommitter set in config null
19/04/14 09:35:48 INFO output.FileOutputCommitter: File Output Committer Algorithm version is 1
。。。。。。。。。。。。。。具体的日志信息这里面就不全部展示了

最终的效果
[root@swarm01 grep_demo]# ll
total 0
drwxr-xr-x. 2 root root 102 Apr 14 21:30 input
drwxr-xr-x. 2 root root  88 Apr 14 21:35 output
[root@swarm01 grep_demo]# cd output/
[root@swarm01 output]# ll
total 0
-rw-r--r--. 1 root root 0 Apr 14 21:35 part-r-00000
-rw-r--r--. 1 root root 0 Apr 14 21:35 _SUCCESS
[root@swarm01 output]# 
[root@swarm01 grep_demo]# cat output/part-r-00000 
2	sakura_demo
[root@swarm01 grep_demo]# 
——SUCCESS是执行成功的标志

Note that you

  • Here only a brief introduction
  • The so-called installation is configured based environment
  • Distributed content more relevant and practical introduction and subsequent other core components will continue to be updated
Published 88 original articles · won praise 17 · views 30000 +

Guess you like

Origin blog.csdn.net/qq_32112175/article/details/89297859