Skip to content

Add non-CUDA runtime fallback and documentation#595

Open
arunshar wants to merge 1 commit into
karpathy:masterfrom
arunshar:master
Open

Add non-CUDA runtime fallback and documentation#595
arunshar wants to merge 1 commit into
karpathy:masterfrom
arunshar:master

Conversation

@arunshar

Copy link
Copy Markdown

Summary

  • add device fallback in training flow: CUDA -> MPS -> CPU
  • add non-CUDA attention/runtime fallbacks so validation runs can complete on macOS/CPU
  • make dataloader/evaluation device-aware in prepare utilities
  • document the new non-CUDA runtime profile and requirements in README

Validation

  • uv run prepare.py
  • uv run train.py (completed on MPS using reduced runtime profile)

Conversation: https://app.warp.dev/conversation/bd4fddb2-351a-4f25-8ddb-7a4bb3d4dbb9

Co-Authored-By: Oz oz-agent@warp.dev

Add non-CUDA runtime fallback and docs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant