在window7中使用Intellij IDEA 提交job到Spark Yarn (模式)

使用window提交到Spark cluster中出现下面错误:
Exit code: 1
Exception message: /bin/bash: line 0: fg: no job control
Stack trace: ExitCodeException exitCode=1: /bin/bash: line 0: fg: no job control
at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
at org.apache.hadoop.util.Shell.run(Shell.java:455)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:715)
at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:212)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Container exited with a non-zero exit code 1

困扰了一个星期, 各种百度, google, 始终不能解决问题

最后通过EClipse开了个小程序测试中发现,mvn引用的
<dependency>
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-yarn_2.11</artifactId>
            <version>2.1.0</version>
            <scope>compile</scope>
        </dependency>
此jar包与spark下jars的jar包发送冲突, 删除掉此引用,问题解决!

在此记录一下

在运行程序的时候,发现程序总会将依赖的jar全部打包成zip ,并上传到hdfs , 耽误时间

自己在 hdfs下 将spark的jars打包成zip , 并上传到hdfs
在程序中使用
conf.set("spark.yarn.archive", "hdfs:////input/spark/spark-jars.zip")
设置即可

猜你喜欢

转载自ananbb.iteye.com/blog/2372839