NVIDIA 全面转向开源 Linux GPU 内核模块

NVIDIA 全面转向开源 Linux GPU 内核模块
NVIDIA Transitions Fully Towards Open-Source Linux GPU Kernel Modules

原始链接: https://developer.nvidia.com/blog/nvidia-transitions-fully-towards-open-source-gpu-kernel-modules/

2022 年 5 月，NVIDIA 在 GPL 和 MIT 许可下推出了名为 R515 驱动程序的开源 Linux GPU 内核模块。最初，它的目标是数据中心 GPU，而 GeForce 和工作站 GPU 还处于 alpha 阶段。目标是在未来版本中为 GeForce 和工作站 GPU 提供全面且高性能的 Linux 支持。自推出以来，开源 GPU 内核模块已显示出可比或卓越的应用程序性能，并获得了异构内存管理 (HMM) 和机密计算等附加功能。现在，经过两年的开发，NVIDIA 计划在即将发布的 R560 驱动程序中专门使用开源 GPU 内核模块。由于兼容性问题，并非所有图形处理单元 (GPU) 都可以与这些开源模块一起使用；只有 NVIDIA 的 Grace Hopper 或 Blackwell 等尖端平台才需要它们。基于 Turing、Ampere、Ada Lovelace 或 Hopper 架构的新型 GPU 应该会实现这一转变。基于 Maxwell、Pascal 或 Volta 架构的较旧 GPU 无法使用开源模块，应坚持使用专有驱动程序。在同一系统中使用较旧和较新 GPU 的混合部署仍然需要依赖专有驱动程序。对于那些不确定使用哪个驱动程序的人，NVIDIA 提供了一个有用的脚本来帮助选择正确的驱动程序。此外，所有安装方法中默认安装的驱动程序已从专有切换为开源。在将包管理器与 CUDA 元包或其他非标准过程结合使用时，某些情况可能需要特别考虑。此外，对于用户在运行文件中选择专有驱动程序选项或使用 Ansible 等自动化工具时，安装过程也有一些细微的变化。如需进一步指导，请参阅本文的“使用安装帮助程序脚本”部分以及 NVIDIA 的驱动程序安装文档。

最近，Nvidia 宣布计划开源部分 Nvidia 显卡驱动程序，特别是内核部分，同时关闭大部分用户空间驱动程序。此举旨在带来诸如提高内核灵活性等优势，允许 Nvidia GPU 内的时钟恢复功能，但它也带来了诸如不稳定的应用程序二进制接口 (ABI) 和大固件文件大小等挑战。这些问题给希望更新和测试开源驱动程序版本的开发人员带来了困难，在合并固件更新时需要格外小心。这一决定似乎主要是由历史原因驱动的，而不是完全避免开源。早期的 Nvidia 系统在专业级显卡和消费级显卡中包含相同的硬件，导致用户将专业 BIOS 刷新到更便宜的显卡上。为了防止销售损失，Nvidia 采取了一些措施，例如在主板中嵌入电阻器来识别卡的类型，并最终开发出更复杂的方法来阻止 BIOS 刷新。由于专业卡和游戏卡现在使用不同的硬件，Nvidia 可以安全地开始发布开源驱动程序，而不必担心 BIOS 兼容性问题。

With the R515 driver, NVIDIA released a set of Linux GPU kernel modules in May 2022 as open source with dual GPL and MIT licensing. The initial release targeted datacenter compute GPUs, with GeForce and Workstation GPUs in an alpha state.

At the time, we announced that more robust and fully-featured GeForce and Workstation Linux support would follow in subsequent releases and the NVIDIA Open Kernel Modules would eventually supplant the closed-source driver.

NVIDIA GPUs share a common driver architecture and capability set. The same driver for your desktop or laptop runs the world’s most advanced AI workloads in the cloud. It’s been incredibly important to us that we get it just right.

Two years on, we’ve achieved equivalent or better application performance with our open-source GPU kernel modules and added substantial new capabilities:

Heterogeneous memory management (HMM) support
Confidential computing
The coherent memory architectures of our Grace platforms
And more

We’re now at a point where transitioning fully to the open-source GPU kernel modules is the right move, and we’re making that change in the upcoming R560 driver release.

Distro	Install the latest	Install a specific release
Fedora/RHEL/Kylin	`dnf module install nvidia-driver:open-dkms`	`dnf module install nvidia-driver:560-open`
openSUSE/SLES	`zypper install nvidia-open{-azure,-64k}`	`zypper install nvidia-open-560{-azure,-64k}`
Debian	`apt-get install nvidia-open`	`apt-get install nvidia-open-560`
Ubuntu	`apt-get install nvidia-open`	`apt-get install nvidia-open-560`

NVIDIA 全面转向开源 Linux GPU 内核模块
NVIDIA Transitions Fully Towards Open-Source Linux GPU Kernel Modules

Supported GPUs

Installer changes

Using package managers with the CUDA metapackage

Using the runfile

Using the installation helper script

Package manager details

apt: Ubuntu and Debian-based distributions

dnf: Red Hat Enterprise Linux, Fedora, Kylin, Amazon Linux, or Rocky Linux

zypper: SUSE Linux Enterprise Server, or OpenSUSE

Package manager summary

Windows Subsystem for Linux

CUDA Toolkit

More information

NVIDIA 全面转向开源 Linux GPU 内核模块 NVIDIA Transitions Fully Towards Open-Source Linux GPU Kernel Modules

Supported GPUs

Installer changes

Using package managers with the CUDA metapackage

Using the runfile

Using the installation helper script

Package manager details

apt: Ubuntu and Debian-based distributions

dnf: Red Hat Enterprise Linux, Fedora, Kylin, Amazon Linux, or Rocky Linux

zypper: SUSE Linux Enterprise Server, or OpenSUSE

Package manager summary

Windows Subsystem for Linux

CUDA Toolkit

More information

NVIDIA 全面转向开源 Linux GPU 内核模块
NVIDIA Transitions Fully Towards Open-Source Linux GPU Kernel Modules