英伟达
直播中

樊周依

7年用户 180经验值
私信 关注
[问答]

带有TESLA M60卡的DL380 Gen9的风扇速度和噪音怎么改善

你好,
我正在安装2台新的HP DL380 Gen9服务器,每台服务器都配有TESLA M60卡。
我安装了XenServer 7.2(包含所有当前的补丁)。
从我将这些M60卡从COMPUTE切换到GRAPHICS模式的那一刻起,FANS就会达到100%的速度并产生很多噪音。
我已经安装了NVIDIA GRID Manager 5.1版。
还有XenServer平台上的HP SNMP Agent,但每次重启后,在XenServer Hypervisor的启动阶段突然间,风扇再次以100%的速度运行。
在BIOS中,“最佳冷却”功能设置为默认值
我已将BIOS固件更新到最新版本P89 v2.52(10/25/2017),但仍然没有运气,在每次重启后,在XS启动期间,风扇以最大速度吹出一大堆
噪声。
有任何经验建议可以改善吗?
这是HP还是XenServer或Nvidia问题?
换句话说,联系谁来创建支持票。
谢谢,
克里斯马雷尔

以上来自于谷歌翻译


以下为原文

Hello,

I'm in the process of installing 2 new HP DL380 Gen9 servers each equipped with a TESLA M60-card.
I have installed XenServer 7.2 (with all the current patches).
And from the moment I have switched these M60-cards from COMPUTE to GRAPHICS mode the FANS go to 100% of their speed and making a lot of noise.

I have already installed the NVIDIA GRID Manager Version 5.1.
And also the HP SNMP Agents on the XenServer platform, but after each reboot, during the startup-phase of the XenServer Hypervisor at a sudden moment the fans are going to 100% speed again.
In the BIOS the 'optimal cooling' feature is set to the default

I have updated the BIOS-firmware to the last version P89 v2.52 (10/25/2017), but still no luck, after each reboot, during the startup of XS the fans are blowing at their maximum speed an making a lot of noise.

Any suggestions of experiences to improve this ?
Is this an HP or XenServer or Nvidia-issue?  In other words who to contact to create a support-ticket.

Thanks,
  Chris Marreel

回帖(8)

张变英

2018-10-10 17:20:58
嗨......你确定电缆是否能阻挡气流?
电源的功率是多少?
在带有1100 W电源的Dell R730服务器上,我看不到这样的情况。
你有很多其他外围设备可能会产生大量额外的热量吗?

以上来自于谷歌翻译


以下为原文

Hi... Are you sure the cables are such that they are not obstructing the airflow? What is the wattage of your power supplies? I don't see anything like this on Dell R730 servers with 1100 W power supplies.
Do you have a lot of other peripherals in there that may be contributing to a lot of extra heat?
举报

任黎平

2018-10-10 17:36:14
你好,
首先,我想知道你在哪里买了M60主板?
这些是直接来自惠普还是这些通用主板?
我认为这些是通用主板,惠普在其主板上有特定的vbios,所以请联系惠普。
只要该板被nvidia-smi识别并且GPU在空闲时没有100%GPU负载运行,我不明白为什么这应该是Nvidia问题。
问候
西蒙

以上来自于谷歌翻译


以下为原文

Hello,

first of all I would like to know where you bought the M60 boards? Are these directly from HP or are these generic boards? I would assume these are generic boards and HP has a specific vbios on their boards so please contact HP. As long as the board is recognized with nvidia-smi and the GPU is not running on 100% GPU load in idle I don't see why this should be a Nvidia issue.

Regards

Simon
举报

张鹏

2018-10-10 17:49:07
托比亚斯和西蒙你好,
此M60卡是此DL380 Gen9服务器中唯一的额外卡。
如果我们测试几次重新启动,那么在XenServer 7.2的启动阶段,风扇每次都会在同一时刻旋转到100%。
所以我的结论是:它与温度无关,只有一些“逻辑”认为风扇应该达到100%的速度(噪音是恼人的副作用)。
在该服务器中有2x 1400W电源,两者都运行“冗余”,目前仅提供409W。
因此,这些M60板的电源是正确的。
M60卡由惠普提供,nvidia-smi正在识别主板,目前我已经拥有了第一台使用M60卡的Win10工作站,所以一切运行正常,只有风扇速度和生成
噪音是一个问题。
如果没有其他想法,我会在HPE支持上记录一张票。
谢谢和问候, 
克里斯

以上来自于谷歌翻译


以下为原文

Hello Tobias and Simon,
This M60-card is the only extra card in this DL380 Gen9 server.  And the fans are spinning up to 100% during the startup-phase of the XenServer 7.2 each time at exact the same moment if we test a few reboots.  So my conclusion : it has nothing to do with temperature, only with some 'logic' that is thinking the fans should go to 100% speed (with the noise as annoying side effect).

In that server there are 2x 1400W power supply's, both running 'redundant' and at the moment only delivering 409W.  So the power supply's are correct for these M60-boards.

The M60-cards are delivered by HP, and the nvidia-smi is recognizing the board, and at the moment I already have my first Win10-station using the M60-card, so everything is running fine, only the fan speed and the generated noise is an issue.

If there are no other thoughts, I will log a ticket at HPE Support for this.

Thanks and greetings,
  Chris
举报

贾埃罗

2018-10-10 18:08:31
你好
我曾经在各种硬件上看过几次。
如果您还没有这样做,可以记下当前的BIOS配置(如果有任何特殊配置),然后将BIOS重置为出厂默认设置。
重置后,检查所有组件的所有相关“功率/性能”和“冷却”策略,它们都应设置为“平衡”。
尝试一下,看看它是否有帮助,让我们知道你是如何进行的......
问候


以上来自于谷歌翻译


以下为原文

Hi

I've seen this before a few times on various hardware. If you haven't done so already, can you make a note of the current BIOS configuration (in case there is anything special configured) and then reset the BIOS back to factory default. Once reset, check all the associated "Power / Performance" and "Cooling" policies for all components, they should all be set to something like "Balanced".

Give that a try and see if it helps, let us know how you get on ...


Regards

Ben
举报

更多回帖

发帖
×
20
完善资料,
赚取积分