Implementation of Universal Transformer in Pytorch
-
Updated
Nov 19, 2018 - Python
Implementation of Universal Transformer in Pytorch
The implementation of Adaptive Computation function used in RNN "https://arxiv.org/pdf/1603.08983.pdf" and Universal Transformer Network "https://arxiv.org/pdf/1807.03819.pdf"
Universal Transformers implementation in PyTorch
Unofficial Implementation of Universal Transformer https://arxiv.org/abs/1807.03819
When does recurrent depth beat width? Controlled experiments on length extrapolation, cheap test-time compute, and composition via orchestration.
Simple from-scratch implementations of transformer-based models that match the state of the art.
Multi strategy logit fusion between Ouro-1.4B (Universal Transformer) and HRM-Text-1B (prefix-LM) for improved text generation
Add a description, image, and links to the universal-transformer topic page so that developers can more easily learn about it.
To associate your repository with the universal-transformer topic, visit your repo's landing page and select "manage topics."