Multi strategy logit fusion between Ouro-1.4B (Universal Transformer) and HRM-Text-1B (prefix-LM) for improved text generation
entropy transformers fusion benchmarks language-model kl-divergence perplexity universal-transformer logits llm ouro-1-4b hrm-text-1b js-div
-
Updated
Jun 22, 2026 - Python