最近在公司服务器上遇到了一个特别离谱的问题,就是在本身在nividia官网上面下载的匹配的显卡驱动,安装之后采用下面命令查看驱动显示:
$ nvidia-smiNVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver.
Make sure that the latest NVIDIA driver is installed and running
开机在开机日志之中显示:显卡与驱动版本不匹配。这里是由于公司本身的服务器搭建架构的问题。下面是日志记录
$ sudo dmesg.......
[ 2.078256] NVRM: The NVIDIA GPU 0002:00:00.0 (PCI ID: 10de:2236)
[ 2.078256] NVRM: installed in this system is not supported by the
[ 2.078256] NVRM: NVIDIA 550.54.14 driver release.
[ 2.078256] NVRM: Please see ‘Appendix A - Supported NVIDIA GPU Products’
[ 2.078256] NVRM: in this release’s README, available on the operating system
[ 2.078256] NVRM: specific graphics driver download page at www.nvidia.com.
[ 2.097526] nvidia: probe of 0002:00:00.0 failed with error -1
.......
我们公司的服务器是按照 硬件服务器-->PVE(虚拟化管理平台类似于Vmvare)-->虚拟机--> 显卡-->驱动-->操作系统-->软件这样搞的。所以这个问题的关键就是不在于重启虚拟机,而在于直接重启节点。也就是下面节点的chat节点,而不是102 这台机子。
重启chat节点之后就可以,就可以显示显卡驱动正常了!
今天写个帖子,希望可以帮到和我遇到相似问题的同学!