Azure Linux VM 中的 DPDK | Microsoft Learn

link之家

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

支持的操作系统最低版本

支持 Azure 市场中的以下分发版：

Linux OS

所记录的版本是最低要求。还支持较新版本。

自定义内核支持

对于未列出的任何 Linux 内核版本，请参阅用于生成 Azure 优化 Linux 内核的修补程序。有关详细信息，还可以联系 aznetdpdk@microsoft.com 。

所有 Azure 区域都支持 DPDK。

必须在 Linux 虚拟机上启用加速网络。虚拟机应至少有两个网络接口，其中一个接口用于管理。不建议在管理界面上启用加速网络。了解如何创建启用加速网络的 Linux 虚拟机。

在使用 InfiniBand 的虚拟机上，确保加载了相应的 mlx4_ib 或 mlx5_ib 驱动程序（请参阅启用 InfiniBand ）。

通过系统包安装 DPDK（建议）

Ubuntu 18.04

sudo add-apt-repository ppa:canonical-server/server-backports -y
sudo apt-get update
sudo apt-get install -y dpdk
Ubuntu 20.04 和更高版本
sudo apt-get install -y dpdk
Debian 10 和更高版本
sudo apt-get install -y dpdk
手动安装 DPDK（不推荐）
安装版本依赖项
Ubuntu 18.04
sudo add-apt-repository ppa:canonical-server/server-backports -y
sudo apt-get update
sudo apt-get install -y build-essential librdmacm-dev libnuma-dev libmnl-dev meson
Ubuntu 20.04 和更高版本
sudo apt-get install -y build-essential librdmacm-dev libnuma-dev libmnl-dev meson
Debian 10 和更高版本
sudo apt-get install -y build-essential librdmacm-dev libnuma-dev libmnl-dev meson
RHEL7.5/CentOS 7.5
yum -y groupinstall "Infiniband Support"
sudo dracut --add-drivers "mlx4_en mlx4_ib mlx5_ib" -f
yum install -y gcc kernel-devel-`uname -r` numactl-devel.x86_64 librdmacm-devel libmnl-devel meson
SLES 15 SP1
Azure 内核
zypper  \
  --no-gpg-checks \
  --non-interactive \
  --gpg-auto-import-keys install kernel-azure kernel-devel-azure gcc make libnuma-devel numactl librdmacm1 rdma-core-devel meson
zypper \
  --no-gpg-checks \
  --non-interactive \
  --gpg-auto-import-keys install kernel-default-devel gcc make libnuma-devel numactl librdmacm1 rdma-core-devel meson
编译并手动安装 DPDK
下载最新的 DPDK。 Azure 需要 19.11 版 LTS 或更高版本。
运行 meson builddir 生成默认配置。
使用 ninja -C builddir 进行编译。
使用 DESTDIR=<output folder> ninja -C builddir install 进行安装。
配置运行时环境
重启后，运行下面的命令一次：
针对每个 numa 节点运行以下命令一次，以配置巨页：
echo 1024 | sudo tee /sys/devices/system/node/node*/hugepages/hugepages-2048kB/nr_hugepages
使用 mkdir /mnt/huge 创建用于装载的目录。
使用 mount -t hugetlbfs nodev /mnt/huge 装载巨页。
运行 grep Huge /proc/meminfo 检查巨页是否已保留。
[注意] 可以将 grub 文件修改为，在启动时保留巨页，具体是按照适用于 DPDK 的说明操作。 页面底部提供了这些说明。 如果使用的是 Azure Linux 虚拟机，请改为将 /etc/config/grub.d 下的文件修改为跨重启保留巨页。
MAC 和 IP 地址：使用 ifconfig –a 查看网络接口的 MAC 和 IP 地址。 VF 网络接口和 NETVSC 网络接口具有相同的 MAC 地址，但只有 NETVSC 网络接口具有 IP 地址。 VF 接口以 NETVSC 接口的从属接口形式运行 。
PCI 地址
运行 ethtool -i <vf interface name> 确定对 VF 使用哪个 PCI 地址。
如果 eth0 已启用加速网络，请确保 testpmd 不会意外接管 eth0 的 VF PCI 设备  。 如果 DPDK 应用程序意外接管管理网络接口，并导致 SSH 连接断开，请使用串行控制台来停止 DPDK 应用程序。 串行控制台还可用于停止或启动虚拟机。
每次重新启动后，使用  加载 ibuverbs。 （仅适用于 SLES 15）另外，使用  加载 mlx4_ib。

防故障 PMD
DPDK 应用程序必须通过 Azure 中公开的防故障 PMD 运行。 如果应用程序直接通过 VF PMD 运行，它不会收到发往 VM 的所有包，因为一些包通过综合接口显示。
通过防故障 PMD 运行 DPDK 应用程序，可保证应用程序收到发往 VM 的所有包。 此外，还能确保应用程序继续以 DPDK 模式运行，即使在为主机提供服务时撤销了 VF，也不例外。 若要详细了解防故障 PMD，请参阅防故障轮询模式驱动程序库。
运行 testpmd
若要在根模式下运行 testpmd，请在 testpmd 命令前面使用 sudo。
基本：健全性检查、防故障适配器初始化
运行以下命令启动单端口 testpmd 应用程序：

testpmd -w <pci address from previous step> \
  --vdev="net_vdev_netvsc0,iface=eth1" \
  -- -i \
  --port-topology=chained
运行以下命令启动双端口 testpmd 应用程序：
testpmd -w <pci address nic1> \
-w <pci address nic2> \
--vdev="net_vdev_netvsc0,iface=eth1" \
--vdev="net_vdev_netvsc1,iface=eth2" \
-- -i
若要运行包含超过 2 个 NIC 的 testpmd，--vdev 参数采用以下模式：net_vdev_netvsc<id>,iface=<vf’s pairing eth>。
启动后，运行 show port info all 检查端口信息。 应会看到一个或两个值为 net_failsafe（不是 net_mlx4）的 DPDK 端口。
使用 start <port> /stop <port> 启动流量。
上面的命令在交互模式下启动 testpmd，这是建议用于试用 testpmd 命令的模式。
基本：单个发送端/单个接收端
以下命令定期列显每秒数据包数的统计信息：
在 TX 端运行以下命令：
testpmd \
  -l <core-list> \
  -n <num of mem channels> \
  -w <pci address of the device you plan to use> \
  --vdev="net_vdev_netvsc<id>,iface=<the iface to attach to>" \
  -- --port-topology=chained \
  --nb-cores <number of cores to use for test pmd> \
  --forward-mode=txonly \
  --eth-peer=<port id>,<receiver peer MAC address> \
  --stats-period <display interval in seconds>
在 RX 端运行以下命令：
testpmd \
  -l <core-list> \
  -n <num of mem channels> \
  -w <pci address of the device you plan to use> \
  --vdev="net_vdev_netvsc<id>,iface=<the iface to attach to>" \
  -- --port-topology=chained \
  --nb-cores <number of cores to use for test pmd> \
  --forward-mode=rxonly \
  --eth-peer=<port id>,<sender peer MAC address> \
  --stats-period <display interval in seconds>
若要在虚拟机上运行上面的命令，请先将  中的 IP_SRC_ADDR 和 IP_DST_ADDR 更改为与虚拟机的实际 IP 地址一致，再进行编译。 否则，数据包在抵达接收端之前将被丢弃。

高级：单个发送端/单个转发端
以下命令定期列显每秒数据包数的统计信息：
在 TX 端运行以下命令：
testpmd \
  -l <core-list> \
  -n <num of mem channels> \
  -w <pci address of the device you plan to use> \
  --vdev="net_vdev_netvsc<id>,iface=<the iface to attach to>" \
  -- --port-topology=chained \
  --nb-cores <number of cores to use for test pmd> \
  --forward-mode=txonly \
  --eth-peer=<port id>,<receiver peer MAC address> \
  --stats-period <display interval in seconds>
在 FWD 端运行以下命令：
testpmd \
  -l <core-list> \
  -n <num of mem channels> \
  -w <pci address NIC1> \
  -w <pci address NIC2> \
  --vdev="net_vdev_netvsc<id>,iface=<the iface to attach to>" \
  --vdev="net_vdev_netvsc<2nd id>,iface=<2nd iface to attach to>" (you need as many --vdev arguments as the number of devices used by testpmd, in this case) \
  -- --nb-cores <number of cores to use for test pmd> \
  --forward-mode=io \
  --eth-peer=<recv port id>,<sender peer MAC address> \
  --stats-period <display interval in seconds>
若要在虚拟机上运行上面的命令，请先将  中的 IP_SRC_ADDR 和 IP_DST_ADDR 更改为与虚拟机的实际 IP 地址一致，再进行编译。 否则，数据包在抵达转发端之前将被丢弃。 无法使用第三台计算机来接收转发的流量，因为除非做出一些代码更改，否则 testpmd 转发器不会修改第 3 层地址。

EAL 选项
Testpmd 命令
数据包转储命令