英伟达
直播中

张桂荣

7年用户 181经验值
私信 关注
[问答]

安装vGPU驱动程序后的vSphere是紫色屏幕

GPU:GRID K1
型号:戴尔PowerEdge R720
驱动程序:NVIDIA-VMware_ESXi_6.5_Host_Driver_367.64-1OEM.650.0.0.4240417-offline_bundle.zip
系统:vSphere 6.5
安装驱动程序并重新启动后,此紫色屏幕。
早些时候,我更改了服务器型号(不同的R720)和Sytem版本(6.0U2& 6.5),但它总是发生:(
我应该怎么解决?
谢谢。
史蒂芬

以上来自于谷歌翻译


以下为原文

GPU: GRID K1
Model: Dell PowerEdge R720
Driver: NVIDIA-VMware_ESXi_6.5_Host_Driver_367.64-1OEM.650.0.0.4240417-offline_bundle.zip
System: vSphere 6.5

This purple screen after I install driver and reboot.
Earlier, I was change Server Model (Different R720) and Sytem version (6.0U2 & 6.5), but it always happened :(

How should I fix it? Thanks.

Steven

回帖(3)

萧治维

2018-9-20 12:00:24
嗨史蒂文
您在上面列出的安装驱动程序是“Offline_Bundle”驱动程序。
我不确定这个驱动程序的用途是什么,或者它是否会导致任何问题(我不使用它),但我已经在内部询问Nvidia进行澄清,我会在收到有关信息后回复(如果有其他人的话)
可以回答,请随时评论如下)。
您可能想尝试卸载该驱动程序,重新启动主机并在下载中安装其他驱动程序:NVIDIA-kepler-VMware_ESXi_6.5_Host_Driver_367.64-1OEM.650.0.0.4240417.vib。
这是主.zip文件中的.vib以及Windows驱动程序和文档。
如果您想在更改之前等待澄清驱动程序的差异,那就没问题,如上所述,我会在收到回复后立即更新。
如果主机无法启动并继续为您提供PSOD,您可以尝试禁用GPU所在的PCIe插槽,以便无法检测到(或者只是暂时从主机中删除GPU),然后在手动启动时卸载
驱动程序,然后启用PCIe插槽/重新安装GPU并安装新驱动程序。
还有别的东西要检查;
你的R720 BIOS是最新的吗?
启动服务器时,可以使用“Lifecycle Manager”进行检查,并检查Dell FTP站点以获取系统更新。
问候


以上来自于谷歌翻译


以下为原文

Hi Steven

The driver you've listed above as being installed is the "Offline_Bundle" driver. I'm unsure what this driver is for or if it will cause you any issues (I don't use it), but I have asked internally at Nvidia for clarification and I'll post back when I have the information (if anyone else can answer that, please feel free to comment below).

You may want to try uninstalling that driver, rebooting the host and installing the other driver in the download: NVIDIA-kepler-VMware_ESXi_6.5_Host_Driver_367.64-1OEM.650.0.0.4240417.vib. This is the .vib that is in the main .zip file along with the windows drivers and documentation.

If you'd like to wait for clarification on what the driver differences are before changing them, then that's fine and as said, I'll update as soon as I hear back.

If the host won't boot and keeps giving you the PSOD, you can try disabling the PCIe slot where the GPU is located so it can't be detected (or just temporarily remove the GPU from the host), then when booted manually uninstall the driver, then enable the PCIe slot / re-fit the GPU and install the new driver.

Something else for you to check; is your R720 BIOS as up to date as it can be? You can check this with "Lifecycle Manager" when you start your server and check the Dell FTP site for updates for the system.

Regards

Ben
举报

李阳

2018-9-20 12:07:06
嗨本
当我在vSphere上使用.vib时,情况也是如此。
现在我卸载K1 GPU卡并导出vSphere的日志。
非常感谢
史蒂芬

以上来自于谷歌翻译


以下为原文

Hi Ben

It's same situation when I used .vib on vSphere.
Now I uninstall K1 GPU Card and export the log of vSphere.

Many Thanks

Steven
举报

曾盼丽

2018-9-20 12:18:28
嗨史蒂文,
我们的KB搜索通常会检查已知的配置问题:http://nvidia.custhelp.com/app/home/
搜索“PSOD GRID”显示:
http://nvidia.custhelp.com/app/answers/detail/a_id/4135/kw/grid%20psod
你能检查你的MSI配置吗?
雷切尔

以上来自于谷歌翻译


以下为原文

Hi Steven,

It's often work checking known configuration issues in our KB search: http://nvidia.custhelp.com/app/home/


A search on "PSOD GRID" shows:

http://nvidia.custhelp.com/app/answers/detail/a_id/4135/kw/grid%20psod


Can you check your MSI config?

Rachel
举报

更多回帖

发帖
×
20
完善资料,
赚取积分