shodh-redb

Embedded multi-modal database engine for edge AI. Key-value, vector search (IVF-PQ), blob storage, TTL, merge operators, CDC, composite queries -- all in one ACID-transactional file with a no_std core.

95% recall@1 at 1,000+ QPS on SIFT1M. 1.6--3x faster writes than LMDB, 2--3x faster reads than RocksDB. Single-threaded. 8 GB RAM. No GPU.

vs. alternatives

	shodh-redb	SQLite + sqlite-vec	LMDB	RocksDB	Qdrant
KV transactions	ACID	ACID	ACID	--	--
Vector search	IVF-PQ	flat scan	--	--	HNSW
Blob storage	built-in	external	--	--	--
TTL / expiration	per-key	manual	--	TTL CF	--
CDC	built-in	triggers	--	--	--
Merge operators	built-in	--	--	built-in	--
`no_std` / wasm	core lib	--	--	--	--
Deployment	embed	embed	embed	embed	server
File format	single file	single file	single file	LSM dir	server

[dependencies]
shodh-redb = "0.4"

When to choose shodh-redb

shodh-redb occupies a specific niche: embedded Rust applications that need KV + vector search + blob storage in a single ACID-transactional process. It is not a drop-in replacement for any single tool above; it replaces combinations of them.

Choose shodh-redb when:

You need vector similarity search and KV in the same process, and you do not want to compose RocksDB + FAISS (or LMDB + hnswlib) yourself.
You ship a single Rust binary and want to avoid a C++ dependency or a separate vector-database service.
You target embedded / no_std / wasm32 and need a real B-tree, not a sled-style log.
Your workload is read-heavy or read/write balanced at single-host scale (gigabytes to low terabytes).
You want a small, auditable codebase (~50K LOC core) instead of a 600K-LOC C++ project.

Choose something else when:

You need RocksDB's write-optimized LSM: shodh-redb is a B-tree. For sustained 100K+ writes/sec at multi-terabyte scale, RocksDB's LSM architecture wins by design.
You need multiple writer processes or shared-database concurrency. shodh-redb is single-process, single-writer (try_lock enforced). Use a server database (Postgres, Qdrant, MongoDB) instead.
You need state-of-the-art ANN recall. IVF-PQ is a solid baseline; for graph indexes (HNSW, DiskANN) at the recall/latency frontier, use Qdrant, Milvus, or FAISS directly.
You need a query language (SQL, Cypher). shodh-redb is a programmatic API.
You require Jepsen-grade crash-safety evidence. The B-tree engine inherits redb's COW + checksum design, but no formal linearizability tests have been published.

The published benchmark numbers (1.6–3× writes vs LMDB, 2–3× reads vs RocksDB) are at single-host, single-thread, 5M-key scale on the workloads in crates/redb-bench. They are real, reproducible micro-benchmarks -- not extrapolations to RocksDB's strong regime (large-scale, write-heavy, concurrent).

Why shodh-redb?

Every edge vector database (sqlite-vec, LanceDB, Qdrant) bolts vector search onto a storage engine that wasn't designed for it. shodh-redb builds both from the same B-tree:

Vector search -- IVF-PQ with integer ADC, SIMD distance, and ACID-transactional index updates
Blob storage -- content-addressed dedup, chunked writes, causal lineage graph
TTL tables -- per-key expiration for sessions, caches, and ephemeral agent state
CDC -- row-level change capture for replication and reactive pipelines
Composite queries -- ranked fusion across semantic, temporal, and causal signals
Runs anywhere -- no_std core compiles to wasm32, ARM Cortex, RISC-V. No allocator, no filesystem required

One binary. One file. ACID crash-safe.

Single-process recommended. shodh-redb uses try_lock/try_lock_shared to prevent the same process from opening a database twice (returns DatabaseAlreadyOpen). However, on platforms where file locks are unsupported (e.g., some network filesystems), the lock is silently skipped. Multi-process access to the same file is not supported and will corrupt data.

Benchmarks (SIFT1M)

1M vectors, 128 dimensions, Euclidean distance. Single-threaded on Intel i7-1355U, 8 GB RAM.

Mode	nprobe	recall@1	recall@10	QPS	p50 latency
Top-10 reranked	10	95.3%	92.6%	1,018	0.90 ms
Full rerank	10	95.7%	95.2%	337	2.72 ms
Full rerank	50	99.1%	99.9%	157	5.75 ms
PQ-only scan	10	42.5%	52.1%	1,521	0.64 ms

Index build: 30.8s (single-threaded, 256 clusters, M=16 subquantizers).

Full results and reproduction steps: BENCHMARKS.md

Quick start

use shodh_redb::{Database, TableDefinition, ReadableDatabase, ReadableTable};

const TABLE: TableDefinition<&str, u64> = TableDefinition::new("agent_state");

let db = Database::create("agent.redb")?;

let wtxn = db.begin_write()?;
{
    let mut table = wtxn.open_table(TABLE)?;
    table.insert("task_count", &42u64)?;
}
wtxn.commit()?;

let rtxn = db.begin_read()?;
let table = rtxn.open_table(TABLE)?;
assert_eq!(table.get("task_count")?.unwrap().value(), 42);

Vector search

Native IVF-PQ index with integer ADC, residual encoding, and per-vector metadata filtering.

use shodh_redb::{FixedVec, DistanceMetric, nearest_k, TableDefinition};

// Brute-force top-k (small datasets)
const EMBEDDINGS: TableDefinition<u64, FixedVec<384>> = TableDefinition::new("memory");
let results = nearest_k(
    table.iter()?.map(|r| { let (k, v) = r.unwrap(); (k.value(), v.value().to_vec()) }),
    &query, 10, |a, b| DistanceMetric::Cosine.compute(a, b),
);

Three quantization levels:

Type	Compression	Distance
`FixedVec<N>` / `DynVec`	None (f32)	Cosine, L2, dot, Manhattan
`ScalarQuantized<N>`	~4x (u8)	Approximate L2
`BinaryQuantized<N>`	32x (1-bit)	Hamming

For datasets beyond brute-force range, use the IVF-PQ index -- inverted file with product quantization, integer asymmetric distance tables, and optional reranking from stored raw vectors.

Blob store

Content-addressed storage with SHA-256 dedup, chunked streaming writes, and causal lineage tracking.

let blob_id = wtxn.store_blob(data, ContentType::ImagePng, "screenshot", &opts)?;
let reader = rtxn.blob_reader(blob_id)?;

Per-blob tags, namespaces, temporal indexing, and causal graph with BFS traversal.

TTL tables

Per-key expiration. Lazy filtering on read, bulk purge on demand.

let mut table = wtxn.open_table(CACHE)?;
table.insert_with_ttl(&"result_abc", data, Duration::from_secs(1800))?;
table.purge_expired()?;

Merge operators

Atomic read-modify-write without manual locking.

table.merge(&"tool_calls", &1u64.to_le_bytes(), &NumericAdd)?;

Built-in: NumericAdd, SaturatingAdd, FloatAdd, NumericMax, NumericMin, BitwiseOr, BytesAppend. Implement MergeOperator for custom logic.

CDC (change data capture)

Row-level change streaming with cursors, retention pruning, and HLC timestamps.

Composite queries

Multi-signal ranked retrieval fusing semantic similarity, temporal recency, causal proximity, and namespace/tag filtering into a single scored result set.

Multimap tables

Multiple values per key for tag indices, inverted lookups, and many-to-many relationships.

Group commit

Batched fsync for write-heavy workloads. Multiple operations share a single transaction and disk sync.

Performance

KV engine (vs LMDB, RocksDB)

5 million key-value pairs, 24-byte keys, 150-byte values. Windows 11, x86_64.

Operation	shodh-redb	LMDB	RocksDB
Bulk load 5M	57.9s	95.3s	39.4s
Individual writes 1K	73ms	228ms	2,123ms
Batch writes 100K	3.6s	8.8s	778ms
Random reads 1M	1.97s	2.18s	5.70s
Range scans 500K	2.12s	1.76s	14.1s
Multi-threaded reads (32t)	1.23s	0.67s	4.59s

Blob store

Operation	Ops/sec	Throughput
Small writes (1K x 4KB)	18,686	73 MB/s
Large writes (100 x 1MB)	127	127 MB/s
Sequential reads (1K blobs)	137,170	536 MB/s
Range reads (1K x 1KB slice)	185,957	182 MB/s
Dedup writes (1K identical)	47,612	1000x ratio

SIMD vector ops

Distance functions use hand-written AVX2 intrinsics on x86_64 with runtime feature detection and scalar fallback.

Function	dim=128	dim=384	dim=1536	Peak throughput
`dot_product`	6.2 ns	17.8 ns	75.8 ns	173 GB/s
`euclidean_distance_sq`	9.6 ns	23.8 ns	92.5 ns	133 GB/s
`cosine_similarity`	13.7 ns	34.6 ns	124.8 ns	99 GB/s
`manhattan_distance`	7.3 ns	21.2 ns	102.1 ns	145 GB/s
`hamming_distance`	18.5 ns	12.5 ns	--	62 GB/s

Function	Strategy
`dot_product`	`_mm256_mul_ps` + `_mm256_add_ps`, 8 f32/iter
`euclidean_distance_sq`	Fused sub+mul, 8 f32/iter
`cosine_similarity`	3 accumulators, 8 f32/iter
`manhattan_distance`	`_mm256_andnot_ps` for abs
`hamming_distance`	Mula's vectorized popcount (`pshufb` + SAD), 32 bytes/iter

On no_std or non-x86 targets, scalar code is structured for LLVM auto-vectorization.

Zero-copy serialization

On little-endian targets (x86, ARM LE, RISC-V LE), FixedVec and DynVec use bulk copy_nonoverlapping -- the f32 memory layout matches the on-disk LE format directly.

`no_std` support

The core library compiles with #![no_std] (disable the std feature). Vector types, distance functions, quantization, IVF-PQ indexing, and the B-tree engine all work without the standard library. Targets: wasm32, ARM Cortex-M, RISC-V bare metal.

The std feature adds file backends, TTL tables, group commit, and runtime SIMD dispatch. A flash translation layer (FTL) backend supports raw NOR/NAND flash on embedded targets without a filesystem.

Compression

Value-level compression is built in via two algorithms, off by default and selected per database:

Algorithm	Cargo feature	Crate	When to use
LZ4	`compression_lz4`	`lz4_flex`	Latency-sensitive workloads -- LZ4 prioritizes decompression speed over ratio
zstd	`compression_zstd`	`zstd`	Storage-bound or cold data -- better compression ratio at higher CPU cost (level 1-22, default 3)
Both	`compression`	--	Build with both available, choose at database open time

[dependencies]
shodh-redb = { version = "0.4", features = ["compression_lz4"] }
# or: features = ["compression_zstd"]
# or: features = ["compression"]    # both

Why off by default:

no_std / embedded targets (Cortex-M, wasm32) often cannot pull in zstd
Adds CPU on every value insert and read; not always a win
Smaller default dependency tree, faster compile, smaller attack surface

Compression operates at the value level: each value is compressed before entering the B-tree, with a self-describing 5-byte envelope (1-byte flags + 4-byte original size). Values that don't shrink are stored raw with zero envelope overhead. The chosen algorithm is recorded in the database header and validated on open -- a database written with compression_zstd will refuse to open without that feature.

Feature flags

Flag	Default	Description
`std`	Yes	File backends, group commit, TTL, SIMD dispatch
`logging`	No	`log` crate integration
`cache_metrics`	No	Cache hit/miss counters
`compression_lz4`	No	LZ4 value compression (see Compression)
`compression_zstd`	No	Zstandard value compression (see Compression)
`compression`	No	Enable both LZ4 and zstd

Architecture

                    +--------------------+
                    |   Database API     |
                    |  (typed, ACID)     |
                    +--------+-----------+
                             |
              +--------------+--------------+
              |              |              |
        +-----+------+ +----+-----+ +------+------+
        | Key-Value  | | Blob     | | Vector      |
        | Tables     | | Store    | | Index       |
        | TTL, Merge | | CDC,     | | IVF-PQ,     |
        | Multimap   | | dedup,   | | int ADC,    |
        |            | | causal   | | SIMD        |
        +-----+------+ +----+-----+ +------+------+
              |              |              |
              +--------------+--------------+
                             |
                   +---------+---------+
                   | B-tree Engine     |
                   | (COW pages, MVCC, |
                   |  crash-safe,      |
                   |  no_std core)     |
                   +-------------------+

Credits

Core B-tree page store and crash recovery derived from redb. All extensions -- vector indexing, blob store, TTL, merge operators, HLC, CDC, composite queries, group commit, SIMD distance, quantization -- are original work.

License

Apache License, Version 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 1,805 Commits
.cargo		.cargo
.github		.github
benches		benches
crates		crates
docs		docs
examples		examples
fuzz		fuzz
src		src
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
BENCHMARKS.md		BENCHMARKS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile.bench		Dockerfile.bench
LICENSE		LICENSE
LIMITATIONS.md		LIMITATIONS.md
README.md		README.md
build.rs		build.rs
clippy.toml		clippy.toml
deny.toml		deny.toml
justfile		justfile
rust-toolchain		rust-toolchain
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shodh-redb

vs. alternatives

When to choose shodh-redb

Why shodh-redb?

Benchmarks (SIFT1M)

Quick start

Vector search

Blob store

TTL tables

Merge operators

CDC (change data capture)

Composite queries

Multimap tables

Group commit

Performance

KV engine (vs LMDB, RocksDB)

Blob store

SIMD vector ops

Zero-copy serialization

`no_std` support

Compression

Feature flags

Architecture

Credits

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

shodh-redb

vs. alternatives

When to choose shodh-redb

Why shodh-redb?

Benchmarks (SIFT1M)

Quick start

Vector search

Blob store

TTL tables

Merge operators

CDC (change data capture)

Composite queries

Multimap tables

Group commit

Performance

KV engine (vs LMDB, RocksDB)

Blob store

SIMD vector ops

Zero-copy serialization

no_std support

Compression

Feature flags

Architecture

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`no_std` support

Packages