完善资料让更多小伙伴认识你,还能领取20积分哦, 立即完善>
你好,我有一个奇怪的问题,一切都很好,运作良好,但在一瞬间新的虚拟机点开始。
当我尝试检查问题时,在“nvidia-smi”命令之后,我有“NVIDIA-SMI失败,因为它无法与NVIDIA驱动程序通信......”。 我尝试重新安装驱动程序一次,问题解决了,但在第二天我再次遇到这个问题并重新安装驱动程序不帮助,它可以是什么? 以上来自于谷歌翻译 以下为原文 Hello, I have strange problem, all was good and worked well, but at one moment new vm dot start. When I try to check problem, after "nvidia-smi" command I have "NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver...". I try reinstall driver once and problem was solved, but at next day I have this problem again and reinstall driver dont help, what can it be ? |
|
相关推荐
6个回答
|
|
你分享的信息太少了(IT人群 - 你有没有试过再打开它? - > YouTube)。
您应该检查运行时日志。 像“dmesg | grep NVRM”或“dmesg | grep nvidia”...... 我想你在以前的帖子中在ESXi6.5上有M60。 您应该能够直接使用nvidia支持。 以上来自于谷歌翻译 以下为原文 You share too few information (IT Crowd - Have You Tried Turning It Off And On Again? -> YouTube). You should check runtime logs. Something like "dmesg | grep NVRM" or "dmesg | grep nvidia" ... I suppose that you have M60 on ESXi6.5 from previous posts. You should be able to use nvidia support directly. |
|
|
|
总是值得在知识库中搜索 - http://nvidia.custhelp.com/app/home/
以上来自于谷歌翻译 以下为原文 Always worth a search in the Knowledge Base - http://nvidia.custhelp.com/app/home/ |
|
|
|
在知识库中,我找不到任何可以帮助我的东西。
我有ESXI 6.5和TESLA M60 [root @ localhost:〜] dmesg | grep nvidia 2016-12-09T14:02:27.706Z cpu13:66686)启动服务nvidia-vgpu 2016-12-09T14:02:27.706Z cpu13:66686)激活Jumpstart插件nvidia-vgpu。 2016-12-09T14:02:27.908Z cpu13:66686)Jumpstart插件nvidia-vgpu已激活。 [root @ localhost:〜] nvidia-smi NVIDIA-SMI因为无法与NVIDIA驱动程序通信而失败。 确保已安装并运行最新的NVIDIA驱动程序。 [root @ localhost:〜] dmesg | grep NVRM 我不知道我怎么能正确地解决它... 以上来自于谷歌翻译 以下为原文 In Knowledge Base i dont find nothing what can help me. I have ESXI 6.5 and TESLA M60 [root@localhost:~] dmesg | grep nvidia 2016-12-09T14:02:27.706Z cpu13:66686)Starting service nvidia-vgpu 2016-12-09T14:02:27.706Z cpu13:66686)Activating Jumpstart plugin nvidia-vgpu. 2016-12-09T14:02:27.908Z cpu13:66686)Jumpstart plugin nvidia-vgpu activated. [root@localhost:~] nvidia-smi NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running. [root@localhost:~] dmesg | grep NVRM I dont know how i can troubleshoot it correctly... |
|
|
|
[root @ localhost:/etc/init.d] esxcli hardware pci list -c 0x300 -m 0xff
0000:07:00.0 地址:0000:07:00.0 段:0x0000 总线:0x07 插槽:0x00 功能:0x0 VMkernel名称: 供应商名称:ASPEED Technology,Inc。 设备名称:ASPEED图形系列 已配置所有者:未知 当前所有者:VMkernel 供应商ID:0x1a03 设备ID:0x2000 SubVendor ID:0x1043 子设备ID:0x85f9 设备类:0x0300 设备类名称:VGA兼容控制器 编程接口:0x00 版本ID:0x30 中断线:0x05 IRQ:255 中断向量:0x00 PCI引脚:0x00 产生的总线:0x00 标志:0x3221 模块ID:-1 模块名称:无 底盘:0 物理槽:4294967295 插槽说明: Passthru Capable:是的 父设备:PCI 0:6:0:0 从属设备:PCI 0:6:0:0 重置方法:桥重置 FPT可共享:真实 0000:83:00.0 地址:0000:83:00.0 段:0x0000 总线:0x83 插槽:0x00 功能:0x0 VMkernel名称:vmgfx0 供应商名称:NVIDIA Corporation 设备名称:NVIDIATesla M60 配置所有者:VM Passthru 当前所有者:VM Passthru 供应商ID:0x10de 设备ID:0x13f2 SubVendor ID:0x10de SubDevice ID:0x115e 设备类:0x0300 设备类名称:VGA兼容控制器 编程接口:0x00 版本ID:0xa1 中断线:0x05 IRQ:255 中断向量:0x00 PCI引脚:0x00 产生的总线:0x00 标志:0x3401 模块ID:20 模块名称:pciPassthru 底盘:0 物理槽:4294967295 插槽说明:机箱插槽8; 功能0; 相对bdf 01:00.0 Passthru Capable:是的 父设备:PCI 0:130:8:0 从属设备:PCI 0:131:0:0 重置方法:桥重置 FPT可共享:真实 0000:84:00.0 地址:0000:84:00.0 段:0x0000 总线:0x84 插槽:0x00 功能:0x0 VMkernel名称:vmgfx1 供应商名称:NVIDIA Corporation 设备名称:NVIDIATesla M60 配置所有者:VM Passthru 当前所有者:VM Passthru 供应商ID:0x10de 设备ID:0x13f2 SubVendor ID:0x10de SubDevice ID:0x115e 设备类:0x0300 设备类名称:VGA兼容控制器 编程接口:0x00 版本ID:0xa1 中断线:0x05 IRQ:255 中断向量:0x00 PCI引脚:0x00 产生的总线:0x00 标志:0x3401 模块ID:20 模块名称:pciPassthru 底盘:0 物理槽:4294967295 插槽说明:机箱插槽8; 功能0; 相对bdf 02:00.0 Passthru Capable:是的 父设备:PCI 0:130:16:0 从属设备:PCI 0:132:0:0 重置方法:桥重置 FPT可共享:真实 以上来自于谷歌翻译 以下为原文 [root@localhost:/etc/init.d] esxcli hardware pci list -c 0x300 -m 0xff 0000:07:00.0 Address: 0000:07:00.0 Segment: 0x0000 Bus: 0x07 Slot: 0x00 Function: 0x0 VMkernel Name: Vendor Name: ASPEED Technology, Inc. Device Name: ASPEED Graphics Family Configured Owner: Unknown Current Owner: VMkernel Vendor ID: 0x1a03 Device ID: 0x2000 SubVendor ID: 0x1043 SubDevice ID: 0x85f9 Device Class: 0x0300 Device Class Name: VGA compatible controller Programming Interface: 0x00 Revision ID: 0x30 Interrupt Line: 0x05 IRQ: 255 Interrupt Vector: 0x00 PCI Pin: 0x00 Spawned Bus: 0x00 Flags: 0x3221 Module ID: -1 Module Name: None Chassis: 0 Physical Slot: 4294967295 Slot Description: Passthru Capable: true Parent Device: PCI 0:6:0:0 Dependent Device: PCI 0:6:0:0 Reset Method: Bridge reset FPT Sharable: true 0000:83:00.0 Address: 0000:83:00.0 Segment: 0x0000 Bus: 0x83 Slot: 0x00 Function: 0x0 VMkernel Name: vmgfx0 Vendor Name: NVIDIA Corporation Device Name: NVIDIATesla M60 Configured Owner: VM Passthru Current Owner: VM Passthru Vendor ID: 0x10de Device ID: 0x13f2 SubVendor ID: 0x10de SubDevice ID: 0x115e Device Class: 0x0300 Device Class Name: VGA compatible controller Programming Interface: 0x00 Revision ID: 0xa1 Interrupt Line: 0x05 IRQ: 255 Interrupt Vector: 0x00 PCI Pin: 0x00 Spawned Bus: 0x00 Flags: 0x3401 Module ID: 20 Module Name: pciPassthru Chassis: 0 Physical Slot: 4294967295 Slot Description: Chassis slot 8; function 0; relative bdf 01:00.0 Passthru Capable: true Parent Device: PCI 0:130:8:0 Dependent Device: PCI 0:131:0:0 Reset Method: Bridge reset FPT Sharable: true 0000:84:00.0 Address: 0000:84:00.0 Segment: 0x0000 Bus: 0x84 Slot: 0x00 Function: 0x0 VMkernel Name: vmgfx1 Vendor Name: NVIDIA Corporation Device Name: NVIDIATesla M60 Configured Owner: VM Passthru Current Owner: VM Passthru Vendor ID: 0x10de Device ID: 0x13f2 SubVendor ID: 0x10de SubDevice ID: 0x115e Device Class: 0x0300 Device Class Name: VGA compatible controller Programming Interface: 0x00 Revision ID: 0xa1 Interrupt Line: 0x05 IRQ: 255 Interrupt Vector: 0x00 PCI Pin: 0x00 Spawned Bus: 0x00 Flags: 0x3401 Module ID: 20 Module Name: pciPassthru Chassis: 0 Physical Slot: 4294967295 Slot Description: Chassis slot 8; function 0; relative bdf 02:00.0 Passthru Capable: true Parent Device: PCI 0:130:16:0 Dependent Device: PCI 0:132:0:0 Reset Method: Bridge reset FPT Sharable: true |
|
|
|
|
|
|
|
我想你将卡配置为“VM pass-through”,例如。
vDGA。 ESXi中的NVidia驱动程序(和nvidia-smi)无法为vDGA卡提供服务。 您应该将卡配置为vGPU(查找Soft3D,vSGA,vGPU和vDGA说明,例如http://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/techpaper/vmware-horizon-view-graphics -acceleration-deployment.pdf或http://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/products/horizon/grid-vgpu-deployment-guide.pdf或http://us.download .nvidia.com / Windows / Quadro_Certified / GRID / 348.07 / 346.68-348.07-nvidia-grid-quick-start-guide.pdf或更新版本)。 以上来自于谷歌翻译 以下为原文 I suppose that you configured card as "VM pass-through" eg. vDGA. NVidia driver (and nvidia-smi) in ESXi cannot service vDGA cards. You should configure card as vGPU (look for Soft3D, vSGA, vGPU and vDGA explanation for example http://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/techpaper/vmware-horizon-view-graphics-acceleration-deployment.pdf or http://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/products/horizon/grid-vgpu-deployment-guide.pdf or http://us.download.nvidia.com/Windows/Quadro_Certified/GRID/348.07/346.68-348.07-nvidia-grid-quick-start-guide.pdf or newer). |
|
|
|
只有小组成员才能发言,加入小组>>
使用Vsphere 6.5在Compute模式下使用2个M60卡遇到VM问题
3121 浏览 5 评论
是否有可能获得XenServer 7.1的GRID K2驱动程序?
3528 浏览 4 评论
小黑屋| 手机版| Archiver| 电子发烧友 ( 湘ICP备2023018690号 )
GMT+8, 2024-12-19 07:25 , Processed in 0.654409 second(s), Total 87, Slave 70 queries .
Powered by 电子发烧友网
© 2015 bbs.elecfans.com
关注我们的微信
下载发烧友APP
电子发烧友观察
版权所有 © 湖南华秋数字科技有限公司
电子发烧友 (电路图) 湘公网安备 43011202000918 号 电信与信息服务业务经营许可证:合字B2-20210191 工商网监 湘ICP备2023018690号