安装
- 下载安装包
- 上传并解压安装包
tar -zxvf spark-2.4.7-bin-hadoop2.6.tgz
- 修改权限
chown -R root /export/server/spark-2.4.7-bin-hadoop2.6
chgrp -R root /export/server/spark-2.4.7-bin-hadoop2.6
- 创建软连接
ln -s /export/server/spark-2.4.7-bin-hadoop2.6 /export/server/spark
测试
- 启动spark交互式窗口
/export/server/spark/bin/spark-shell
- 测试Spark的WordCount
1.准备文件
vim /root/words.txt
添加以下内容:
hello me you her
hello me you
hello me
hello
2.执行WordCount
val textFile = sc.textFile("file:///root/words.txt")
val counts = textFile.flatMap(_.split(" ")).map((_,1)).reduceByKey(_ + _)
counts.collect
结果: