我们已经将esxi主机升级到6.5并将VIB升级到从Nvidia网站下载的受支持的NVIDIA-kepler-vSphere-6.5-367.64-369.71,但基本机器将无法启动GPU(PCI共享设备)启用抱怨GPU不够
记忆。
在主机上运行'nvidia-smi'时,会显示以下卡片:
NVIDIA-SMI
2016年11月24日星期四00:04:52
+ -------------------------------------------------
---------------------------- +
|
NVIDIA-SMI 367.64驱动程序版本:367.64 |
| ------------------------------- + -----------------
----- + ---------------------- +
|
GPU名称持久性-M |
Bus-Id Disp.A |
挥发性的Uncorr。
ECC |
|
Fan Temp Perf Pwr:用法/上限|
内存使用|
GPU-U
til Compute M. |
| =============================== + =================
===== + ====================== |
|
0 GRID K2开|
0000:05:00.0关闭|
关|
|
N / A 25C P8 28W / 117W |
18MiB / 4095MiB |
0%默认值|
+ ------------------------------- + -----------------
----- + ---------------------- +
|
1 GRID K2开|
0000:06:00.0关闭|
关|
|
N / A 23C P8 27W / 117W |
18MiB / 4095MiB |
0%默认值|
+ ------------------------------- + -----------------
----- + ---------------------- +
|
2 GRID K2开|
0000:84:00.0关|
关|
|
N / A 26C P8 28W / 117W |
18MiB / 4095MiB |
0%默认值|
+ ------------------------------- + -----------------
----- + ---------------------- +
|
3 GRID K2开|
0000:85:00.0关闭|
关|
|
N / A 24C P8 27W / 117W |
18MiB / 4095MiB |
0%默认值|
+ ------------------------------- + -----------------
----- + ---------------------- +
+ -------------------------------------------------
---------------------------- +
|
进程:GPU内存|
|
GPU PID类型进程名称用法|
| =================================================
============================ |
|
0 68574 G Xorg 7MiB |
|
1 68600 G Xorg 7MiB |
|
2 68641 G Xorg 7MiB |
|
3 68660 G Xorg 7MiB |
+ -------------------------------------------------
---------------------------- +
[根@ K2-3:〜]
嗯,Xorg?
年长的esxi主持人没有表现出来。
'gpuvm'的输出
gpuvm
Xserver unix:0,PCI ID 0:5:0:0,vSGA模式,GPU最大内存4173824KB
GPU内存剩余4173824KB。
Xserver unix:1,PCI ID 0:6:0:0,vSGA模式,GPU最大内存4173824KB
GPU内存剩余4173824KB。
Xserver unix:2,PCI ID 0:132:0:0,vSGA模式,GPU最大内存4173824KB
GPU内存剩余4173824KB。
Xserver unix:3,PCI ID 0:133:0:0,vSGA模式,GPU最大内存4173824KB
GPU内存剩余4173824KB。
对我来说,有些东西意味着VIB不正确,但这是Nvidia网站上唯一可用的。
在esxi主机上降级到NVIDIA-GRID-vGPU-kepler-vSphere-6.0-367.64-369.71允许基本机器启用GPU启动,但View不会组成池,因为它无法识别旧GPU。
无论如何,有没有其他人将他们的Vsphere升级到6.5并遇到这个问题或者我们是否遗漏了一些简单的东西?
谢谢。
以上来自于谷歌翻译
以下为原文
We have upgraded a esxi host to 6.5 and the VIB to the supported NVIDIA-kepler-vSphere-6.5-367.64-369.71 downloaded from Nvidia's website but the base machine will not start with the GPU (PCI shared device) enabled complaining about not enough GPU memory. When running 'nvidia-smi' on the host, it shows the cards:
nvidia-smi
Thu Nov 24 00:04:52 2016
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 367.64 Driver Version: 367.64 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GRID K2 On | 0000:05:00.0 Off | Off |
| N/A 25C P8 28W / 117W | 18MiB / 4095MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 GRID K2 On | 0000:06:00.0 Off | Off |
| N/A 23C P8 27W / 117W | 18MiB / 4095MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 2 GRID K2 On | 0000:84:00.0 Off | Off |
| N/A 26C P8 28W / 117W | 18MiB / 4095MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 3 GRID K2 On | 0000:85:00.0 Off | Off |
| N/A 24C P8 27W / 117W | 18MiB / 4095MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 68574 G Xorg 7MiB |
| 1 68600 G Xorg 7MiB |
| 2 68641 G Xorg 7MiB |
| 3 68660 G Xorg 7MiB |
+-----------------------------------------------------------------------------+
[root@k2-3:~]
Um, Xorg? The older esxi host down't show that. Output from 'gpuvm'
gpuvm
Xserver unix:0, PCI ID 0:5:0:0, vSGA mode, GPU maximum memory 4173824KB
GPU memory left 4173824KB.
Xserver unix:1, PCI ID 0:6:0:0, vSGA mode, GPU maximum memory 4173824KB
GPU memory left 4173824KB.
Xserver unix:2, PCI ID 0:132:0:0, vSGA mode, GPU maximum memory 4173824KB
GPU memory left 4173824KB.
Xserver unix:3, PCI ID 0:133:0:0, vSGA mode, GPU maximum memory 4173824KB
GPU memory left 4173824KB.
To me, something implies the VIB is not correct but that is the only 1 available via Nvidia's website. Downgrading to NVIDIA-GRID-vGPU-kepler-vSphere-6.0-367.64-369.71 on the esxi host allows the base machine to start with GPU enabled, but View won't compose a pool as it does not recognize the older GPU.
Anyway, has anyone else upgraded their Vsphere to 6.5 and run into this issue or are we missing something simple?
Thanks.