使用NVIDIA GPU在虚拟化环境中进行视频转码

您好，我在本论坛中阅读了一些主题，并根据这些Q＆amp; As我将尝试制定我的问题。
我们今天在HP刀片服务器BL460上运行虚拟化环境，使用vmware sphere 5.5进行虚拟化。
使用外部Netapp存储。
我们希望在其上运行视频流SW WOWZA构建新的2个VM，这将使用NVIDIA M6的GPU功能（具有NVENC视频编解码器支持）。
我们知道必须使用HP WS460s刀片，因为从技术上讲，HP不支持相同的BL460。
所以我们会买两个新的WS460c刀片。
HA冗余是两个原因，因此在主机发生故障的情况下，VM会在另一个上重启。
这些VM（两个）将使用所有GPU内存，但仅使用相对较少的主机资源（4vCPU，每个32GB RAM） - 这就是我们不希望仅为此HPComputing实时视频专用两台主机的原因 -
转码的东西。
https://www.wowza.com/forums/con ... th-Wowza-Transcoder
你能告诉我决定：
1. NVIDIA M6 SW许可仅以桌面虚拟化术语进行。
3种许可模式正在受到桌面版本的影响。
只是，我不知道如何获得视频转码VM的许可 - 如果有的话。
2.在这个论坛的某个地方，我读到HPC在虚拟环境中不受支持，但该评论来自2014年。为什么不呢？
真的不？
所以，非常感谢，Daniel
HP WS460c数据表以及支持的GPU列表：
https://www.google.cz/url?sa=t&a ... JEQFggeMAE&url=
HTTPS％3A％2F％2Fwww.hpe.com％2Fh20195％2Fv2％2FGetPDF.aspx％2F4AA5-7517ENN.pdf＆安培; USG = AFQjCNEhu88SEdiGzlT175xIHbGNvJ8XNg＆安培; SIG2 = IyhZWboR_T6MsrKpmrQtWA＆安培; BVM = bv.144224172，d.bGs

以上来自于谷歌翻译

以下为原文

Hello, I read some topics in this forum, and based on those Q&As I`ll try to formulate my questions.

We are running a virtualized environment today on HP blade servers BL460, virtualizing with vmware sphere 5.5. External Netapp Storage used.
We would like to build new 2 VMs on top of it with video streaming SW WOWZA running in them, that would use the GPU capability of NVIDIA M6 (with NVENC video-codec support). We are aware that HP WS460s blades must be used because technically same BL460 is not supported by HP from some reason. So we would buy two new WS460c blades. HA redundancy is the reason why two, so in case of a host failure the VM would restart on the other.
These VMs (two) would use all the GPUs memory, but would use only relatively small amount of host resources allocated (4vCPU, 32GB RAM each) - that is the reason we do not want to dedicate two hosts only for this HPComputing realtime video-transcoding stuff.

https://www.wowza.com/forums/con ... th-Wowza-Transcoder

Could you please advise me to decide:
1. NVIDIA M6 SW licensing is talking only in desktop virtualization terms. The 3 licensing models are reffering to desktop virt. only, I have no clue how would I license a videotranscoding VM - if at all.
2. In this forum somewhere I read that HPC is not supported in a virtualized environemnt, but that comment was from 2014. Why not? Really not?

So, thank you very much, Daniel

HP WS460c datasheet with a list of supported GPUs:

https://www.google.cz/url?sa=t&a ... mp;bvm=bv.144224172,d.bGs

张涛

2018-9-30 10:53:48

嗨丹尼尔，
从提供的链接我无法确定您是否要在客户端或服务器操作系统上运行该软件。
通常，这是一个直通用例，需要客户端操作系统的vWorkstation GRID许可证。
如果您在服务器操作系统上运行它，则vApps许可证可能就足够了，因为您不需要Quadro功能，并且还可以使用直通vApp和Server OS。
问候
西蒙

以上来自于谷歌翻译

以下为原文

Hi Daniel,

from the link provided I couldn't find out if you're going to run the software on client or server OS.
In general this is a passthrough use case and needs a vWorkstation GRID license for client OS. If you run it on a server OS a vApps license might be sufficient as you don't need the Quadro features and would also be able to use passthrough with vApps and Server OS.

Regards

Simon

张超

2018-9-30 11:08:02

需要更多信息/说明：
Wowza代码转换器：
1a）只有NVIDIA NVENC加速编码
1b）NVIDIA NVENC加速编码+ NVIDIA CUDA加速视频缩放
如果1a）可以使用vGPU。
如果1b）由于CUDA要求，只有直通选项（或vGPU配置文件M6-8Q）可用。
硬件设置：
2a）带有M6的一个M6夹层（805132-B21）（只能安装一个MXM夹层）。
2b）Parner PCIe刀片（扩展刀片）（836738-B21 + 775168-B21），最大2x多gpu载波（4xMXM，但最大2xMXM M6安装在一个卡中）（805133-B21）。
2c）Parner PCIe刀片（扩展刀片）（836738-B21 + 775168-B21）和2x Tesla P4。
如果2a）两位客人必须共用一个M6（例如M6-4Q）并且必须只有1a）（没有CUDA）。
如果2b）可以使用passthrough选项（1-2卡取决于所需的性能以及Wowza Transcoder中是否支持多gpu）。
2c）可以称为passthrouh选项（参见HPC）。
这是新卡，它现在不支持矩阵，但它应该是更便宜的解决方案。
性能+内存+编解码器要求（https://developer.nvidia.com/nvenc-application-note），应该确定FPS＆amp;
分辨率＆amp;
品质与价值
编码延迟：
3a）1x M6~430FPS（1920x1080 / YUV4：2：0，高性能，H264）* 2编码器* 0.8 #underclocked M6 = 688 FPS
3b）1x P4  -  648FPS（1920x1080 / YUV4：2：0，高性能，H264）* 2编码器* 0.7 #underclocked P4 = 907 FPS
（#uderclocked比率是估计和未经验证的，NVidia没有为每个卡片型号发布任何准确的基准更新：NVidia发布的性能“详细列表”对#underclocked卡没有任何影响https://developer.nvidia.com/nvidia-video-codec
-sdk＃NVENCPerf）
许可证（http://images.nvidia.com/content/pdf/grid/guides/GRID-Packaging-and-Licensing-Guide.pdf,http://images.nvidia.com/content/pdf/grid/guides/
GRID-Licensing的指南.pdf）：
-  vAPP  - 用于vGPU（M6- * A）和passthrough  - 警告：如果vAPP许可证限制编码器会话数（geforce卡（例如非quadro功能）已强制软件限制为2x编码器会话，则对我来说未知/未经测试）
每个系统的视频SDK）。
-  vWS  - 用于vGPU（M6- * Q）和passthrough。
- 没有许可证 - 切换到“特斯拉”（并使用一些vSGA作为主显示器）。
HPC / Tesla卡为PCIe空间提供了大容量存储区域，而且VMware ESXi存在问题，因为VMware可以出于未知原因仅在ESXi中处理32位PCIe空间（但不应该用于直通）。
有一些成功的故事（https://cto.vmware.com/gpgpu-computing-with-the-nvidia-k80-on-vmware-vsphere-6/）。
如果有客户使用适用于Video SDK的HPC设置，您也可以尝试使用NVidia Video（SDK）技术论坛（https://devtalk.nvidia.com/default/board/175/）（问题不是特定于网格+许可）
。
大多数本地HPE代表或HPE合作伙伴应帮助您组织POC。

以上来自于谷歌翻译

以下为原文

More information/clarification needed:

Wowza Transcoder:

1a) only NVIDIA NVENC accelerated encoding
1b) NVIDIA NVENC accelerated encoding + NVIDIA CUDA accelerated video scaling

If 1a) vGPU can be used.
If 1b) only passthrough option (or vGPU profile M6-8Q) is available due to CUDA requirements.

Hardware setup:

2a) One M6 mezzanine (805132-B21) (only one MXM mezzanine can be installed) with M6.
2b) Parner PCIe blade (Expansion Blade) (836738-B21+775168-B21) with maximum 2x multi-gpu carrier (4xMXM but maximum 2xMXM M6 installed in one card) (805133-B21).
2c) Parner PCIe blade (Expansion Blade) (836738-B21+775168-B21) and 2x Tesla P4.

If 2a) two guests must share one M6 (eg. M6-4Q) and must be 1a) only (no CUDA).
If 2b) there can be used passthrough option (for 1-2 cards depends on required performance and if multi gpu is supported in Wowza Transcoder).
The 2c) can be mention as passthrouh option (see HPC). This is new card and it is not in support matrix now but it should be cheaper solution.

Performance+memory+codec requirements (https://developer.nvidia.com/nvenc-application-note), there should be determined FPS & resolution & quality & latency for encoding:

3a) 1x M6 ~ 430FPS (1920x1080/YUV4:2:0, High Performance, H264) * 2 encoders * 0.8 #underclocked M6 = 688 FPS
3b) 1x P4 - 648FPS (1920x1080/YUV4:2:0, High Performance, H264) * 2 encoders * 0.7 #underclocked P4 = 907 FPS

(#uderclocked ratio is estimated and unverified, NVidia did not publish any accurate benchmarks per card model UPDATE: NVidia published performance "detailed list" without any impact on #underclocked cards https://developer.nvidia.com/nvidia-video-codec-sdk#NVENCPerf)

Licenses (http://images.nvidia.com/content/pdf/grid/guides/GRID-Packaging-and-Licensing-Guide.pdf, http://images.nvidia.com/content/pdf/grid/guides/GRID-Licensing-Guide.pdf):

- vAPP - for vGPU (M6-*A) and passthrough - Warning: It is unknown/untested for me if the vAPP license limits number of encoder sessions (geforce cards (eg. not quadro features) have enforced software limit to 2x encoder session per system for Video SDK).
- vWS - for vGPU (M6-*Q) and passthrough.
- without license - switch to "Tesla" (and use some vSGA as primary display).

HPC/Tesla card presents large memory region to PCIe space and it is/was problem for VMware ESXi because VMware can handle only 32bit PCIe space in ESXi for unknown reason (but it should not be needed for passthrough). There is some success story (https://cto.vmware.com/gpgpu-computing-with-the-nvidia-k80-on-vmware-vsphere-6/). You can try also forum for NVidia Video (SDK) technologies (https://devtalk.nvidia.com/default/board/175/) (with questions not specific to grid+licensing) if there are customers with HPC setups for Video SDK.

Most local HPE representatives or HPE partners should help you to organize POC.

张玉梅

2018-9-30 11:20:37

绅士们，我很高兴我在周末回到工作岗位后得到了2个相关的答案。
非常感谢你！
现在我要深入探讨这个话题，它看起来很有趣。
:)

以上来自于谷歌翻译

以下为原文

Hello gentlemen, I`m kindly surprised I have got 2 relevant answers just after returning to work after the weekend. Thank you very much for that!
Now I`m going to dive into the topic, it looks it is going to be fun. :)

陈衍暖

回帖（3）

张涛

张超

张玉梅

相关问答

可以使用适用于云计算中HPC的GPU虚拟化吗

GPU虚拟化在哪里发生？

NVIDIA-SMI：监控GPU的绝佳起点

如何在视频工程中使用转码技术？

如何在视频工程中使用转码技术？

在passthrough中使用GPU进行测试

购买哪款Nvidia GPU

如何在vGPU环境中优化GPU性能

使用DDA和Nvidia Tesla M60在Windows Server 2016上播放硬件加速视频的问题

40/5000 XenApp 7.14.1和NVIDIA M10许可

20万+工程师都在用，免费PCB检查工具