7.9% improvement in singular iterations on M4 Mac Mini (16GB) vs 2.8% in 126 iterations on H100 from Karpathy autoresearch #392
ediestel
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I applied an alternative optimization methodology to the autoresearch benchmark. Instead of iterative search, the method analyzes
the training script and hardware constraints to recommend changes directly.
Results
The key finding: infrastructure and architecture optimizations (batch size, torch.compile on MPS, ReLU² → SwiGLU) required no
iteration — the method identified them from the code and constraints in a single pass.
Full paper aavailable as PDF at https://drive.google.com/file/d/1ULb5Fx5Be-HKBbvnDtu0xad0-_3hceBY/view?usp=sharing
References: Discussion #43, Discussion #32.
For questions or discussion: diestel.research@gmail.com
Beta Was this translation helpful? Give feedback.
All reactions