目录
问题描述
rancher上部署应用时报错(以下报错信息量很大,任意一句查一下都可以查出很多东西,非常杂乱):
Unable to mount volumes for pod "prometheus-65l4m-server-6958f84dd8-jwx5v_cp(e631afe7-2300-11ea-9b6b-fa163ed8966c)": timeout expired waiting for volumes to attach or mount for pod "cp"/"prometheus-65l4m-server-6958f84dd8-jwx5v". list of unmounted volumes=[storage-volume]. list of unattached volumes=[config-volume storage-volume prometheus-65l4m-server-token-l27xh]
MountVolume.WaitForAttach failed for volume "pvc-e5d89bb6-2300-11ea-9b6b-fa163ed8966c" : rbd: map failed exit status 2, rbd output: libkmod: ERROR ../libkmod/libkmod.c:586 kmod_search_moddep: could not open moddep file '/lib/modules/3.10.0-514.el7.x86_64/modules.dep.bin' modinfo: ERROR: Module alias rbd not found. modprobe: ERROR: ../libkmod/libkmod.c:586 kmod_search_moddep() could not open moddep file '/lib/modules/3.10.0-514.el7.x86_64/modules.dep.bin' modprobe: FATAL: Module rbd not found in directory /lib/modules/3.10.0-514.el7.x86_64 rbd: failed to load rbd kernel module (1) rbd: sysfs write failed In some cases useful info is found in syslog - try "dmesg | tail" or so. rbd: map failed: (2) No such file or directory
如图:
平台的存储类、创建的pvc状态都是正常的。
查了很多资料,什么升级内核...都没有用!
问题解决
在集群的worker节点上执行:
yum -y install librbd1 && modprobe rbd
原因是worker节点上却少rbd client,所以无法使用持久化存储!