This series last one
spark can publish data analysis tools to run on hadoop.
spark is Quguan download, address: http://spark.apache.org/downloads.html
then decompress
$ tar -xvf spark-1.5.2-bin-hadoop2.4.tgz
$ sudo mv spark-1.5.2-bin-hadoop2.4 /srv/spark-1.5.2
$ ln -s /srv/spark-1.5.2 /srv/spark
Configuration environment variable
$ sudo su hadoop
$ vim ~/.bashrc
Add the following configuration
export SPARK_HOME=/srv/spark
export PATH=$PATH:$SPARK_HOME/bin
Validate the configuration
$ source ~/.bashrc
Run Spark`
$ pyspark
This should be one of the most worry is installed.
So far the whole pseudo-distributed Hadoop data analysis applications and make use of the installation has been completed, I wish you all a happy use ~