Official Implementation of "Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention"
-
Updated
Nov 4, 2025 - Jupyter Notebook
Official Implementation of "Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention"
Training Distribution Selection for Provable OOD Performance
🔥🔥🔥 This repository curates research on Weak-to-Strong Generalization across LLMs, multimodal learning, and beyond, focusing on how strong models learn from weak supervision and surpass their teachers. Stay tuned for the latest updates!
Generalization bounds for two-layer ReLU neural networks via scale-invariant complexity measures
Add a description, image, and links to the generalization-bounds topic page so that developers can more easily learn about it.
To associate your repository with the generalization-bounds topic, visit your repo's landing page and select "manage topics."