Neural Memory Reproduction: TITANS + MIRAS + NL

Complete PyTorch reproduction of three interconnected Google Research papers on neural memory systems.

Why This Matters: Continual Learning is the next frontier in AI. These papers enable test-time memorization - AI that learns and adapts in real-time without retraining. Read more: Google Research Blog

This repository provides verified implementations of core equations from:

TITANS: Learning to Memorize at Test Time
MIRAS: It's All Connected - Test-Time Memorization, Attentional Bias & Retention
NL: Nested Learning - The Illusion of Deep Learning Architecture

Features

Complete Equation Implementations - Core equations from all three papers
Verified with Tests - 52 tests with 87% code coverage
Three MIRAS Variants - Moneta (ℓ_p), Yaad (Huber), Memora (KL)
Cross-Paper Integration - Tests verify paper dependencies work together
Docker Support - Run anywhere without setup hassles
CI/CD Pipeline - Automated testing on every commit

Quick Start

Option 1: Using UV (Recommended - Fastest)

# Clone the repository
git clone https://github.com/aryateja2106/neural-memory-reproduction.git
cd neural-memory-reproduction

# Install UV if you don't have it
pip install uv

# Create environment and install dependencies
uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
uv pip install -e ".[dev]"

# Run tests to verify everything works
pytest tests/ -v

Option 2: Using Docker (No Python Setup Required)

# Clone the repository
git clone https://github.com/aryateja2106/neural-memory-reproduction.git
cd neural-memory-reproduction

# Run tests in Docker
docker compose up test

# Or build and run manually
docker build -t neural-memory .
docker run --rm neural-memory

Option 3: Using pip

git clone https://github.com/aryateja2106/neural-memory-reproduction.git
cd neural-memory-reproduction
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"
pytest tests/ -v

Installation

Prerequisites

Python 3.10 or higher - Download Python
Git - Download Git
Optional: Docker - Download Docker

Detailed Installation

See SETUP_GUIDE.md for step-by-step instructions with screenshots and troubleshooting tips.

Usage

Running Tests

# Run all tests
pytest tests/ -v

# Run specific paper tests
pytest tests/test_equations/test_titans_memory.py -v
pytest tests/test_equations/test_miras_memory.py -v
pytest tests/test_equations/test_nl_optimizers.py -v

# Run integration tests
pytest tests/test_integration/ -v

# Run with coverage report
pytest tests/ --cov=src --cov-report=term-missing

Verify Implementation is Real (Not Hardcoded)

Run this single command to prove the implementation actually works:

python -c 'import torch; from src.titans.memory import MLPMemory, memory_update, compute_surprise; m = MLPMemory(64, 128); x1, x2 = torch.randn(8, 64), torch.randn(8, 64); print("Test 1 - Different outputs:", not torch.allclose(m(x1), m(x2))); print("Test 2 - Deterministic:", torch.allclose(m(x1), m(x1))); key, value = torch.randn(4, 64), torch.randn(4, 128); s1 = compute_surprise(m, key, value); memory_update(m, key, value, 0.1); s2 = compute_surprise(m, key, value); print(f"Test 3 - Learning: {s1:.2f} -> {s2:.2f} = {s2 < s1}"); x = torch.randn(4, 64, requires_grad=True); m(x).sum().backward(); print("Test 4 - Gradients:", x.grad is not None); print("Real implementation verified!")'

Expected output:

Test 1 - Different outputs: True
Test 2 - Deterministic: True
Test 3 - Learning: 1.08 -> 1.06 = True
Test 4 - Gradients: True
Real implementation verified!

Test	What it proves
Different outputs	Not hardcoded - different inputs produce different outputs
Deterministic	Consistent neural network behavior
Learning	Core TITANS concept - memory updates reduce surprise
Gradients	Real differentiable neural network with backprop

Using the Implementations

import torch
from src.titans.memory import MLPMemory, memory_update, compute_surprise
from src.miras.memory import MonetaMemory, YaadMemory, MemoraMemory
from src.nl.optimizers import M3Optimizer

# TITANS: Gradient-based memory
titans_memory = MLPMemory(input_dim=64, output_dim=128)
key = torch.randn(8, 64)
value = torch.randn(8, 128)
output = titans_memory(key)

# Compute surprise and update memory (core TITANS concept)
surprise = compute_surprise(titans_memory, key, value)
memory_update(titans_memory, key, value, eta=0.1)

# MIRAS: Moneta with ℓ_p attentional bias
moneta = MonetaMemory(input_dim=64, output_dim=128, p=3.0)
output = moneta(key)

# NL: M3 Optimizer
optimizer = M3Optimizer(titans_memory.parameters(), lr=0.001)

Interactive Notebook

# Start Jupyter
jupyter notebook notebooks/quickstart.ipynb

# Or use Docker
docker compose up jupyter
# Then open http://localhost:8888 in your browser

Docker

Quick Commands

# Run tests
docker compose up test

# Run tests with coverage report
docker compose up coverage

# Development environment (interactive shell)
docker compose run --rm dev bash

# Start Jupyter notebook server
docker compose up jupyter

# Run linting checks
docker compose up lint

# Auto-format code
docker compose up format

Building Images

# Build production image
docker build -t neural-memory .

# Build development image
docker build --target dev -t neural-memory:dev .

# Run with custom command
docker run --rm neural-memory python -c "import src; print('Success!')"

Project Structure

neural-memory-reproduction/
├── src/                          # Source code
│   ├── common/                   # Shared utilities
│   │   └── attention.py          # Attention mechanisms (Eq 1-5)
│   ├── titans/                   # TITANS implementation
│   │   └── memory.py             # Memory module (Eq 8-14)
│   ├── miras/                    # MIRAS implementation
│   │   └── memory.py             # Moneta, Yaad, Memora variants
│   ├── nl/                       # NL implementation
│   │   └── optimizers.py         # GD, Momentum, M3 optimizer
│   └── utils/                    # Helper functions
│
├── tests/                        # Test suite
│   ├── test_equations/           # Per-equation tests
│   └── test_integration/         # Cross-paper tests
│
├── notebooks/                    # Jupyter notebooks
│   └── quickstart.ipynb          # Interactive demo
│
├── .github/workflows/            # CI/CD
│   └── ci.yml                    # GitHub Actions
│
├── Dockerfile                    # Container definition
├── docker-compose.yml            # Multi-container setup
├── pyproject.toml                # Project configuration
└── README.md                     # This file

Papers

TITANS: Learning to Memorize at Test Time

Authors: Ali Behrouz, Peilin Zhong, Vahab Mirrokni
Link: arXiv:2501.00663
Key Contribution: Gradient-based memory updates during inference

MIRAS: It's All Connected

Authors: Ali Behrouz, Meisam Razaviyayn, Peilin Zhong, Vahab Mirrokni
Link: arXiv:2504.13173
Key Contribution: Unified framework with ℓ_p attentional bias and retention gates

NL: Nested Learning

Authors: Ali Behrouz, Meisam Razaviyayn, Peilin Zhong, Vahab Mirrokni
Link: PDF
Key Contribution: M3 optimizer and nested learning architecture

Implementation Status

Paper	Equations	Implemented	Tests	Coverage
TITANS	35	Core (Eq 8-14)	5	100%
MIRAS	32	Full (Moneta, Yaad, Memora)	24	82%
NL	121	Optimizers (Eq 1-13, Alg 1)	4	92%
Common	-	Attention (Eq 1-5)	8	85%
Integration	-	Cross-paper	11	-
Total	188	Core + MIRAS	52	87%

Architecture Diagram

TITANS (Foundation)
   │
   ├──► MIRAS (Generalization)
   │     - Moneta: ℓ_p attentional bias (p=3)
   │     - Yaad: Huber loss (outlier robust)
   │     - Memora: KL divergence retention
   │
   └──► NL (Application)
         - M3 optimizer
         - Nested learning principles

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

Quick Contribution Guide

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Make changes and write tests
Run checks: pytest tests/ && ruff check src/
Commit: git commit -m "feat: add my feature"
Push and create a Pull Request

License

This project is licensed under the MIT License - see LICENSE for details.

Academic Use

If you use this code in academic work, please cite the original papers:

@article{behrouz2025titans,
  title={TITANS: Learning to Memorize at Test Time},
  author={Behrouz, Ali and Zhong, Peilin and Mirrokni, Vahab},
  journal={arXiv preprint arXiv:2501.00663},
  year={2025}
}

@article{behrouz2025miras,
  title={It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization},
  author={Behrouz, Ali and Razaviyayn, Meisam and Zhong, Peilin and Mirrokni, Vahab},
  journal={arXiv preprint arXiv:2504.13173},
  year={2025}
}

@inproceedings{behrouz2025nested,
  title={Nested Learning: The Illusion of Deep Learning Architecture},
  author={Behrouz, Ali and Razaviyayn, Meisam and Zhong, Peilin and Mirrokni, Vahab},
  booktitle={NeurIPS},
  year={2025}
}

Citation

If you find this reproduction helpful, please star this repository and cite:

@misc{rudraraju2025neuralmemory,
  author = {Rudraraju, Arya Teja},
  title = {Neural Memory Reproduction: TITANS + MIRAS + NL},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/aryateja2106/neural-memory-reproduction}
}

Acknowledgments

Original paper authors at Google Research
PyTorch team for the deep learning framework
Claude Code with Opus 4.5 for AI-assisted reproduction using the research-reproduction skill

Made with research by Arya Teja Rudraraju

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.claude		.claude
.github/workflows		.github/workflows
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
EXPERIMENT_REPORT.md		EXPERIMENT_REPORT.md
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
LICENSE		LICENSE
MIRAS.context.md		MIRAS.context.md
NL.context.md		NL.context.md
README.md		README.md
SETUP_GUIDE.md		SETUP_GUIDE.md
TITANS.context.md		TITANS.context.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
run_miras_test.sh		run_miras_test.sh
run_titans_test.sh		run_titans_test.sh
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Neural Memory Reproduction: TITANS + MIRAS + NL

Table of Contents

Features

Quick Start

Option 1: Using UV (Recommended - Fastest)

Option 2: Using Docker (No Python Setup Required)

Option 3: Using pip

Installation

Prerequisites

Detailed Installation

Usage

Running Tests

Using the Implementations

Interactive Notebook

Docker

Quick Commands

Building Images

Project Structure

Papers

TITANS: Learning to Memorize at Test Time

MIRAS: It's All Connected

NL: Nested Learning

Implementation Status

Architecture Diagram

Contributing

Quick Contribution Guide

License

Academic Use

Citation

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages