Summary
DeepEP expert-parallel NVSHMEM kernels are not available on GB10 (sm_121a). The kernel build system hardcodes 9.0a (Hopper) and 10.0a (datacenter Blackwell) in its arch list, excluding sm_121a.
Impact
- Expert parallelism via DeepEP is unavailable
- Standard tensor-parallel inference is unaffected
- Attempting to use DeepEP kernels will fail at runtime
What is needed upstream
DeepEP needs to add sm_121a to its NVSHMEM kernel arch list and validate on GB10 hardware.
vLLM commit
Verified against v0.23.1rc0 (e3e3cd54589cee689b785aab5bda81b3e4203191).
Summary
DeepEP expert-parallel NVSHMEM kernels are not available on GB10 (sm_121a). The kernel build system hardcodes
9.0a(Hopper) and10.0a(datacenter Blackwell) in its arch list, excluding sm_121a.Impact
What is needed upstream
DeepEP needs to add sm_121a to its NVSHMEM kernel arch list and validate on GB10 hardware.
vLLM commit
Verified against
v0.23.1rc0(e3e3cd54589cee689b785aab5bda81b3e4203191).