Skip to content

DeepEP / expert-parallel NVSHMEM kernels unavailable on GB10: arch list hardcodes 9.0a/10.0a #28

Description

@timothystewart6

Summary

DeepEP expert-parallel NVSHMEM kernels are not available on GB10 (sm_121a). The kernel build system hardcodes 9.0a (Hopper) and 10.0a (datacenter Blackwell) in its arch list, excluding sm_121a.

Impact

  • Expert parallelism via DeepEP is unavailable
  • Standard tensor-parallel inference is unaffected
  • Attempting to use DeepEP kernels will fail at runtime

What is needed upstream

DeepEP needs to add sm_121a to its NVSHMEM kernel arch list and validate on GB10 hardware.

vLLM commit

Verified against v0.23.1rc0 (e3e3cd54589cee689b785aab5bda81b3e4203191).

Metadata

Metadata

Assignees

No one assigned

    Labels

    upstreamFix should be applied upstream

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions