Ubuntu16.04+GeForce GTX 1070Ti+CUDA8.0+cuDNN5.1+TensorFlow1.2+tf-faster-rcnn训练

1、下载CUDA8.0和CUDNN5.1

百度网盘下载地址(包含8.0和9.0):https://pan.baidu.com/s/1ir3rKhUtU1aIRE7n1BQ5mg

2、安装CUDA8.0

安装方式1(后缀为.deb的):

sudo dpkg -i cuda-repo-ubuntu1604-8-0-local-ga2_8.0.61-1_amd64.deb
sudo apt-get update
sudo apt-get install cuda

卸载CUDA方式:

sudo apt autoremove cuda

安装方式2(后缀为.run的):

sudo sh cuda_8.0.44_linux.run

然后一直Enter到100%,并输入accept

接着出现 Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 361.62?

一定要选择No,否则之前的驱动安装就白安了

安装完毕后会出现以下内容:

=========== 
= Summary = 
===========

Driver: Not Selected 
Toolkit: Installed in /usr/local/cuda-8.0 
Samples: Installed in /home/textminer

Please make sure that 
– PATH includes /usr/local/cuda-8.0/bin 
– LD_LIBRARY_PATH includes /usr/local/cuda-8.0/lib64, or, add /usr/local/cuda-8.0/lib64 to /etc/ld.so.conf and run ldconfig as root

To uninstall the CUDA Toolkit, run the uninstall script in /usr/local/cuda-8.0/bin

Please see CUDA_Installation_Guide_Linux.pdf in /usr/local/cuda-8.0/doc/pdf for detailed information on setting up CUDA.

***WARNING: Incomplete installation! This installation did not install the CUDA Driver. A driver of version at least 361.00 is required for CUDA 8.0 functionality to work. 
To install the driver using this installer, run the following command, replacing with the name of this run file: 
sudo .run -silent -driver

Logfile is /opt/temp//cuda_install_6583.log

接着添加环境变量:

sudo gedit /etc/profile

打开“profile”文件,在末尾处添加(注意不要有空格,不然会报错):

export PATH=/usr/local/cuda-8.0/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-8.0/lib64$LD_LIBRARY_PATH

测试CUDA是否安装成功,输入:

nvcc -V

3、安装CUDNN V5.1

下载完成之后进入下载目录(将下载的安装包拷贝到home文件夹下),执行以下命令进行解压:

sudo tar -zxvf ./cudnn-8.0-linux-x64-v5.1.tgz

解压之后,得到一个 cudn 文件夹,该文件夹下include 和 lib64 两个文件夹,命令行进入 cuda/include 路径下,然后进行以下操作:

cd cuda/include
sudo cp cudnn.h /usr/local/cuda/include #复制头文件

再将进入lib64目录下的动态文件进行复制和链接:

cd ..
cd lib64
sudo cp lib* /usr/local/cuda/lib64/ #复制动态链接库
cd /usr/local/cuda/lib64/
sudo rm -rf libcudnn.so libcudnn.so.5 #删除原有动态文件
sudo ln -s libcudnn.so.5.1.10 libcudnn.so.5 #生成软衔接
sudo ln -s libcudnn.so.5 libcudnn.so #生成软链接
sudo ldconfig

验证CUDNN是否安装成功:

cat /usr/local/cuda/include/cudnn.h | grep CUDNN_MAJOR -A 2

4、安装Tensorflow-gpu 1.2

根据CUDA和CUDNN版本选择Tensorflow版本,即:tensorflow_gpu-1.2.0

(1)、安装tensorflow虚拟环境(由于支持python3.6,因此我选择3.6安装)

conda create -n tensorflow1.2 python=3.6

(2)、安装tensorflow

pip install tensorflow-gpu==1.2.0

卸载命令:pip uninstall tensorflow-gpu==1.2.0

(3)、安装其它依赖包

pip install Cython
pip install easydict
pip install opencv_python==3.3.1.11
pip install matplotlib
pip install Pillow
pip install scipy
pip install easydict

5、检测tensorflow是否使用gpu进行计算,在pycharm里配好工程环境后新建一个py文件,输入以下代码:

import tensorflow as tf
ess = tf.Session(config=tf.ConfigProto(log_device_placement=True))

输出:

6、训练tf-faster-rcnn

在使用CPU训练的时候,平均速度是训练一次需要2s多一点,使用CPU加速后,每训练一次在0.5s左右,速度确实有了非常明显

的提升。

猜你喜欢

转载自blog.csdn.net/oMoDao1/article/details/83303385