Skip to content

feat(gpu): move device selection to driver config#1815

Open
elezar wants to merge 1 commit into
mainfrom
gpu-device-driver-config-elezar
Open

feat(gpu): move device selection to driver config#1815
elezar wants to merge 1 commit into
mainfrom
gpu-device-driver-config-elezar

Conversation

@elezar

@elezar elezar commented Jun 8, 2026

Copy link
Copy Markdown
Member

Summary

Move exact GPU device selection out of the public sandbox proto/API and into driver-specific driver_config fields.

Related Issue

Related to #1716 and #1812.

Changes

  • Remove public and driver proto gpu_device fields, reserving the field numbers and names.
  • Remove the CLI --gpu-device flag.
  • Add Docker/Podman exact GPU selection through driver_config.cdi_devices.
  • Add VM exact GPU selection through driver_config.gpu_device_ids, currently limited to one entry.
  • Require gpu=true when exact GPU device config is supplied, and reject exact selection for Kubernetes.
  • Update GPU e2e coverage and user-facing docs for the new --driver-config-json path.

Testing

  • mise run pre-commit passes
  • Unit tests added/updated
  • E2E tests added/updated

Checklist

  • Follows Conventional Commits
  • Commits are signed off (DCO)

@elezar elezar requested a review from a team as a code owner June 8, 2026 19:59
Signed-off-by: Evan Lezar <elezar@nvidia.com>
@elezar elezar force-pushed the gpu-device-driver-config-elezar branch from 14e335a to 5b6ab51 Compare June 8, 2026 20:16
@github-actions

github-actions Bot commented Jun 8, 2026

Copy link
Copy Markdown

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant