Autonomous Web3 security auditor for Claude Code and OpenAI Codex CLI.
Orchestrates 18-100 AI agents across 8 phases to produce audit reports with verified PoC exploits — for smart contracts and L1 node-client infrastructure.
Supports EVM/Solidity, Solana/Anchor, Aptos Move, Sui Move, Soroban/Stellar, DAML/Canton, and L1 Go/Rust node clients.
Claude Code CLI or OpenAI Codex CLI, Python 3.11-3.12 + pip, Node.js 18+, Git
Backend CLIs. Install at least one. If you only have time to install one, pick Claude Code — it has the broadest MCP support (Slither, ChromaDB, Solodit). Codex is a strong alternative when you'd rather use OpenAI models, but pure-LLM phases fall back to WebSearch where MCPs aren't available. You can install both side-by-side; the audit wizard lets you pick per-run.
Claude Code:
npm install -g @anthropic-ai/claude-codeOpenAI Codex CLI — install without
sudousing a user-local npm prefix to avoidEACCESon Homebrew/system Node installs:mkdir -p ~/.npm-global && npm config set prefix ~/.npm-global echo 'export PATH="$HOME/.npm-global/bin:$PATH"' >> ~/.zshrc # or ~/.bashrc npm install -g @openai/codexCodex doesn't yet support every MCP server — pure-LLM phases use a WebSearch fallback. See docs/mcp-servers.md.
macOS: Also run
xcode-select --install(needed for C++ dependency compilation).Windows: Enable Developer Mode before installing (required for symlinks). Settings > System > For Developers > toggle ON. Or in admin PowerShell:
reg add HKLM\SOFTWARE\Microsoft\Windows\CurrentVersion\AppModelUnlock /v AllowDevelopmentWithoutDevLicense /t REG_DWORD /d 1 /fPer-language tools (Foundry, Solana CLI, etc.) are installed automatically via
plamen setup.PEP 668 / externally-managed Python: On Homebrew Python and Ubuntu 23.04+, system
piprefuses to write into the system site-packages.plamen installdetects this and adds--break-system-packagesto its pip invocations, printing a notice on stderr. If you'd rather isolate Plamen's Python deps in a virtualenv, activate one beforeplamen installand setPIP_BREAK_SYSTEM_PACKAGES=0to opt out.
Open Claude Code or Codex CLI in any project directory and paste the contents
of SETUP.md. It is the only Plamen doc designed for AI-assistant
consumption — it has step-by-step error handling, expected-output anchors,
and stops the assistant from running the heavy RAG build or the toolchain
wizard from a non-TTY context. The assistant handles cloning, the
non-interactive install (plamen install), and plamen install --codex if
you have Codex.
Do not paste
docs/setup.mdordocs/getting-started.mdinto the AI — those are long-form manuals for humans and contain the RAG build inline.
After paste-setup, run plamen setup from a real terminal yourself to install
chain toolchains (Foundry, Solana CLI, Anchor, etc.) and plamen rag to
build the optional vulnerability DB (~6GB RAM).
Linux / macOS:
git clone --recurse-submodules https://github.com/PlamenTSV/plamen.git ~/.plamen
cd ~/.plamen && python3 plamen.py install
python3 plamen.py install --codex # optional: add Codex CLI backendWindows (PowerShell):
git clone --recurse-submodules https://github.com/PlamenTSV/plamen.git $HOME\.plamen
cd $HOME\.plamen; python plamen.py install
python plamen.py install --codex # optional: add Codex CLI backendUse
git clone --recurse-submodules, not "Download ZIP". The repo shipscustom-mcp/slither-mcp/andcustom-mcp/farofino-mcp/as git submodules; ZIP downloads silently omit them. If you already cloned without--recurse-submodules, rungit submodule update --init --recursivefrom inside~/.plamen/beforeplamen install.
installvssetup:plamen installis non-interactive (symlinks + config + Python deps + dangling-hook self-heal) and is safe in any context — Claude Code Bash, Codex shell, CI, headless servers.plamen setupruns the install then drops into an interactive toolchain wizard (Foundry, Solana CLI, etc.) — run it from a real terminal. In a non-TTY context,plamen setupexits cleanly after the install rather than crashing on the picker.Before building the RAG database: add
SOLODIT_API_KEYto~/.claude/settings.json→"env"section (or~/.codex/config.toml→[env]for Codex). Free key from solodit.cyfrin.io. This is the only place the key is reliably visible to bothplamen ragand audit agent subprocesses. A terminalexportis not sufficient — Claude Code and Codex CLI spawn non-interactive subshells that don't source.bashrc/.zshrc.Python dependencies are installed automatically on first run. On macOS/Linux use
python3, on Windows usepython.
After install, add to PATH so you can run plamen from anywhere:
Linux (bash):
echo 'export PATH="$HOME/.plamen:$PATH"' >> ~/.bashrc && source ~/.bashrcmacOS (zsh):
echo 'export PATH="$HOME/.plamen:$PATH"' >> ~/.zshrc && source ~/.zshrcWindows (PowerShell, one-time):
[System.Environment]::SetEnvironmentVariable("Path", "$env:USERPROFILE\.plamen;" + [System.Environment]::GetEnvironmentVariable("Path", "User"), "User")Then use plamen from anywhere:
plamen # interactive audit wizard
plamen resume # resume an interrupted audit from last checkpoint
plamen doctor # verify install (no audit run, no API calls)
plamen setup # toolchain wizard + optional RAG build
plamen migrate # upgrade a v1.x install layout
plamen rag # rebuild RAG database only
plamen compare # diff two audit reports
plamen uninstall # remove Plamen symlinks
plamen help # full command + option referenceAudit runs accept additional options (
--tier,--modules,--network,--notes,--claude, ...). Runplamen helpor see docs/usage.md for the complete list.
Important: Always use
plamen(notpython3 plamen.py) after PATH is set. Thepython3 plamen.pyform only works from inside~/.plamen/.
The installer (plamen install):
- Creates symlinks from
~/.plameninto~/.claude/so Claude Code discovers Plamen's agents, rules, prompts, and commands - Merges Plamen's permissions into your existing
~/.claude/settings.json(additive only — won't remove your entries) - Merges MCP server definitions into
~/.claude/mcp.json(won't overwrite your existing servers) - Injects Plamen instructions into
~/.claude/CLAUDE.mdbetween<!-- PLAMEN:START/END -->markers (preserves your content) - Installs Python dependencies (RAG database is built separately via
plamen rag)
For Codex CLI support, also run plamen install --codex. This sets up ~/.codex/plamen/ (symlinked from ~/.plamen/) with:
- Codex orchestrator config in
~/.codex/AGENTS.md(equivalent ofCLAUDE.md), generated fromcodex-adapter/AGENTS.md - MCP/tool config in
~/.codex/config.toml(equivalent ofsettings.json+mcp.json), generated fromcodex-adapter/config.toml - Codex-specific slash commands in
~/.codex/commands/, generated fromcodex-adapter/commands/
Your existing Claude Code and Codex CLI configuration is preserved.
How symlinks work
The Plamen repo stays at ~/.plamen. The installer creates symlinks (shortcuts) pointing back to ~/.plamen/:
- Claude Code (
plamen install): symlinks into~/.claude/— agents, rules, skills, prompts, commands - Codex CLI (
plamen install --codex): symlinks~/.codex/plamen/→~/.plamen/(shared methodology), and copiescodex-adapter/{AGENTS.md,config.toml,agents/,skills/,commands/}into~/.codex/
When the AI runtime reads ~/.claude/agents/depth-edge-case.md (or ~/.codex/plamen/agents/depth-edge-case.md), the OS transparently reads ~/.plamen/agents/depth-edge-case.md. This means:
git pullin~/.plamenupdates symlinked files (agents, rules, skills, prompts) automatically for both backends- You still need
plamen install(andplamen install --codex) after pull —CLAUDE.md/AGENTS.md,settings.json/config.toml, andmcp.jsonare injected/merged copies, not symlinks. Without re-install, the orchestrator follows stale rules. See docs/updating.md. - Your own files in
~/.claude/or~/.codex/(custom agents, commands) are untouched - Deleting
~/.plamenwould break the symlinks for both backends — don't delete it while Plamen is installed
| Platform | How links are created | Requirements |
|---|---|---|
| Linux / macOS | Standard symlinks (os.symlink) |
None |
| Windows (directories) | Junctions (mklink /J) |
None |
| Windows (files) | Symlinks (os.symlink) |
Developer Mode enabled |
Migrating from v1.0.x (installed directly in
~/.claude): Close Claude Code (and Codex CLI if running) first, then run:Linux/macOS:
cd ~/.plamen 2>/dev/null || cd ~/.claude # cd into whichever exists python3 plamen.py migrateWindows (PowerShell):
cd $HOME\.plamen 2>$null; if (-not $?) { cd $HOME\.claude } # cd into whichever exists python plamen.py migrate
plamen migratestrips any dangling Plamen hook references from~/.claude/settings.json(which would otherwise block PreToolUse Bash and lock you out of shell commands in Claude Code), moves the repo to~/.plamen, runs the non-interactive install, and verifies theCLAUDE.mdmarker block. If you prefer the manual path:Linux/macOS:
mv ~/.claude ~/.plamen && cd ~/.plamen && python3 plamen.py installWindows (PowerShell):
Rename-Item $HOME\.claude $HOME\.plamen; cd $HOME\.plamen; python plamen.py installEither route moves the repo to
~/.plamenand recreates~/.claudewith symlinks + merged config. Claude Code will not work between the move and install — run them together (or just useplamen migrate). For Codex support, follow up withplamen install --codex.
Click to expand (~5-10 min)
Option B handles this automatically. These commands are for reference only.
cd ~/.plamen
# 1. Python deps (~2GB download — PyTorch for embeddings)
pip install -r requirements.txt
pip install -r custom-mcp/unified-vuln-db/requirements.txt
pip install -e custom-mcp/solana-fender
pip install -r custom-mcp/farofino-mcp/requirements.txt
pip install -e custom-mcp/slither-mcp # EVM only (needs Python 3.11+)
# 2. Build RAG database (~10-20 min, includes network download of indexers)
# IMPORTANT: a `export SOLODIT_API_KEY=...` here works for this terminal only.
# For `plamen rag` and audit agents to see it later, put the key in
# ~/.claude/settings.json -> "env" (Claude Code) or ~/.codex/config.toml -> [env]
# (Codex CLI). See the callout above. Below uses an inline export so this manual
# loop completes in one shell:
export SOLODIT_API_KEY=your_key_here # free at solodit.cyfrin.io
cd custom-mcp/unified-vuln-db
python3 -m unified_vuln.indexer index -s solodit --max-pages 10
python3 -m unified_vuln.indexer index -s defihacklabs
python3 -m unified_vuln.indexer index -s immunefi
python3 -m unified_vuln.indexer index -s immunefi-competitions
cd ../..
# Note: on Windows use 'python' instead of 'python3'
# 3. Chain tools (install what you need)
curl -L https://foundry.paradigm.xyz | bash && foundryup # EVM
pip install slither-analyzer # EVM static analysis
# See docs/setup.md for Solana, Aptos, Sui, Medusa, Trident
# 4. Codex backend (optional — after steps 1-3)
python3 plamen.py install --codex
# This generates ~/.codex/plamen/ with AGENTS.md, config.toml, and commands/
# from the Claude-side manifests. No additional deps needed.Windows + Solana: Enable Developer Mode (Settings > System > For Developers) and install OpenSSL (
winget install ShiningLight.OpenSSL.Dev) before building. See docs/dependencies.md.
See docs/setup.md for the full guide with all per-language prerequisites.
# macOS / Linux
cd ~/.plamen && git pull && plamen install
plamen install --codex # if using Codex backend# Windows (PowerShell)
cd $HOME\.plamen; git pull; plamen install
plamen install --codex # if using Codex backendThat's it. plamen install is idempotent — it re-links symlinks, re-injects the updated CLAUDE.md, and merges any new settings.json/mcp.json entries. Adding --codex does the same for AGENTS.md and config.toml. Neither wipes your RAG database, re-installs toolchains, or overwrites your API keys.
Why
plamen installafter pull? Most files auto-update via symlinks, but injected/merged files (CLAUDE.md/AGENTS.md,settings.json/config.toml,mcp.json) are copies, not symlinks. Without re-install, the orchestrator follows stale rules while everything else is updated.plamenwill warn you if it detects a version mismatch.
See docs/updating.md for details on what updates automatically and what doesn't.
plamen # terminal wrapper with interactive wizardOr inside Claude Code: /plamen · Inside Codex CLI: /plamen core /path/to/project
When an audit finishes, the headline deliverable is written to the root of the audited project:
<project>/AUDIT_REPORT.md
AUDIT_REPORT.md is a self-contained Markdown report (per
rules/report-template.md) containing:
- An Executive Summary and a severity summary table (Critical / High / Medium / Low / Informational counts).
- A "Components Audited" table listing the contracts/modules in scope.
- Severity-tiered findings — every finding gets its own section with Severity, Location (
file:Lnnn), a Description (with the offending code), an Impact statement, the PoC Result ([POC-PASS]/[POC-FAIL]/[CODE-TRACE]— see docs/glossary.md), and a Recommendation (a minimal fix diff when the PoC passed). Cosmetic Low/Info items may be grouped into a compact "Quality Observations" table. - A Priority Remediation Order — a numbered, most-urgent-first list using the clean client-facing IDs (
C-01,H-01,M-01, …). - Appendix A — findings excluded as false-positives or duplicates (client-facing summary).
- Appendix B — flagged obligations: any unfinished work the haltless pipeline could not fully complete is surfaced here for human triage instead of silently dropped (see Resumable Pipeline).
Intermediate artifacts live in a per-audit workspace inside the project:
<project>/.scratchpad/
This holds everything the pipeline produced on the way to the report — recon
context, the findings inventory, depth traces, verification PoCs, the
report index, and the resume checkpoint. It is preserved between runs so the
audit can resume on crash, and discarded only on a --fresh restart. You
normally never need to open it; AUDIT_REPORT.md is the deliverable. See
docs/glossary.md for the .scratchpad/ layout.
v2.1.0 changes how the pipeline runs, not what the agents analyze. Highlights:
- PTY-supervised execution + disk-derived completion — workers are driven through a pseudo-terminal and completion is inferred from on-disk
<!-- PLAMEN_STATUS: COMPLETE -->markers, eliminating the 0-byte-stdio silent-hang class. - Haltless recovery —
report_index, verify, inventory, and resume paths repair-then-degrade and surface unfinished obligations as Appendix-B items instead of halting. - Codex CLI backend (cost-saving BETA) — OpenAI Codex (
codex exec) as an alternative worker backend. See docs/codex-backend.md for its known limitations. - Opus 4.8 defaults — Opus-tier phases default to
claude-opus-4-8(override withPLAMEN_OPUS_MODEL/PLAMEN_THOROUGH_OPUS_MODEL). - Ecosystem auto-detection — the audited language is detected and auto-corrected at startup, no halt-to-rerun.
Full release notes: CHANGELOG.md. Upgrade guidance: docs/updating.md.
| Mode | Plan | Agents | Indicative Cost | Key Features |
|---|---|---|---|---|
| Light | Pro | ~18-22 | ~$1–5 / ~10-25 min | Fast scan, all Sonnet, no fuzzing |
| Core | Max | ~30-50 | ~$10–30 / ~30-90 min | Full depth, PoC verification for Medium+ |
| Thorough | Max | ~40-100 | ~$30–100+ / ~1-4 hr | Iterative depth, invariant fuzzing, Medusa, skeptic-judge, Exploration-Completeness skeptic |
On the Claude backend, Opus phases default to
claude-opus-4-8(override withPLAMEN_OPUS_MODEL/PLAMEN_THOROUGH_OPUS_MODEL).
Cost / runtime are rough indicators for a ~5k-line codebase on a Claude subscription. Larger codebases scale roughly linearly. The wizard runs
plamen --estimate(an internal flag invoked by the wizard //plamenslash command — not a direct-CLI option, see docs/usage.md) before each audit to show a per-project number based on lines, scope, and target plan — use the interactiveplamenwizard for a standalone estimate. API-key users (pay-as-you-go) see costs ~2–3× higher than subscription users.
See docs/audit-modes.md for the full comparison.
Plamen also audits L1 node clients and blockchain infrastructure — consensus engines, p2p networking, mempool logic, RPC surfaces, and validator lifecycle code in Go and Rust.
plamen l1 core /path/to/node-clientOr inside Claude Code: /plamen l1 core · Inside Codex CLI: /plamen l1 core /path/to/node-client
L1 mode adds:
- 22+ injectable skills covering consensus safety, fork choice, p2p DoS/eclipse, mempool asymmetric DoS, BLS aggregation, light client proofs, state sync/pruning, execution client hardening, validator lifecycle, and more
- 2 new depth agents:
depth-consensus-invariantanddepth-network-surface - Phase 0.5 "Bake": Batch-indexes repos with scip-go / rust-analyzer SCIP before depth agents run
- L1-specific severity matrix aligned with Immunefi v2.3 classification
- Go and Rust language support with concurrency safety and unsafe-block auditing
See docs/l1-mode/design.md for the full L1 architecture.
Terminal wrapper (recommended — includes setup, cost estimation):
plamen # interactive wizard
plamen core /path/to/project # skip wizard
plamen thorough /path/to/project --proven-only # strict evidence mode
plamen setup # install tools onlyInside Claude Code:
> /plamen core
> /plamen thorough docs: whitepaper.pdf scope: scope.txt
Inside Codex CLI:
> /plamen core
> /plamen l1 thorough /path/to/node-client
See docs/usage.md for PATH setup and all CLI options.
Plamen is a Python orchestrator that drives Claude (or Codex) workers. Phases run in one of three shapes: LLM phase session (single claude -p / codex exec subprocess), Python mechanical (no LLM), or Direct PTY worker pool (driver supervises one Claude PTY per worker artifact — used for breadth, depth, and rescan). PTY-supervised execution drives each worker through a pseudo-terminal and infers turn completion from artifacts written to disk rather than a fragile stdout/JSON envelope — eliminating the 0-byte-stdio ambiguity and silent-hang class. A dedicated PTY transport preflight runs at startup to pick a working terminal transport. For worker-pool phases the driver treats disk artifacts with <!-- PLAMEN_STATUS: COMPLETE --> markers as the only source of truth — Claude saying "done" is no longer trusted. If usage runs out or the process crashes, re-run the same command — it auto-resumes from the last successful checkpoint and, for worker-pool phases, only retries missing or IN_PROGRESS rows (completed worker rows are preserved). Stale or corrupt checkpoints recover rather than stranding the run.
Haltless resilience. A finished audit is never thrown away at the finish line. The report_index, verify, inventory, and resume paths repair-then-degrade — surfacing unfinished obligations as flagged items in AUDIT_REPORT.md (Appendix B) instead of halting the pipeline. Several formerly fragile LLM phases are now deterministic Python (LLM out of the loop): mechanical smart-contract report_index recovery, verify backfill / queue manifests, the data-loss-free report_dedup builder, and the recon prepass.
# Launch via wizard (interactive)
plamen # terminal wrapper starts wizard
/plamen-wizard # inside Claude Code
/plamen-wizard # inside Codex CLI
# Resume a crashed/interrupted audit
python3 ~/.plamen/scripts/plamen_driver.py /path/to/project/.scratchpad/config.jsonThe driver handles: phase scheduling, worker-pool orchestration, artifact gating, rate-limit pauses, retry-with-degradation, and subprocess isolation via the plamen_home() abstraction (resolves to ~/.claude/ or ~/.codex/plamen/ based on the configured backend). The LLM handles: in-phase agent reasoning for phase-LLM phases, finding analysis, PoC execution, and report content. For worker-pool phases the LLM is a bounded executor — one role, one output file, one artifact.
After install, ~/.plamen/ is canonical (the Git checkout). ~/.claude/ (and ~/.codex/plamen/ if Codex is installed) are install-created runtime integration symlinks — edits go in ~/.plamen/. /plamen and /plamen-wizard can be launched from inside an active Claude Code session: the driver strips parent Claude identity env vars from child subprocesses so nested invocations don't collide with the parent session.
Plamen supports OpenAI Codex CLI as an alternative, cost-saving backend (beta). The V2 driver translates prompt text (Write→apply_patch, Bash→shell, Task()→spawn_agent, ~/.claude/→~/.codex/plamen/) and adapts sandbox constraints. Note: this is prompt-text rewriting, not an MCP transport shim — MCP servers run natively on both backends. The Claude PTY transport is Claude-only; Codex invokes codex exec directly — one codex exec per depth job, so depth fans out cleanly across jobs. Codex usage-cap errors (which Codex emits as natural-language text, not structured codes) are detected and the driver auto-waits instead of halting, and context-exceeded no longer perma-fails. Codex depth runs real Devil's-Advocate iteration 2 and seeds the full mandatory first-pass artifact set so recon/depth stop degrading lossily.
# Install Codex backend (after standard install)
plamen install --codex
# Run via Codex
/plamen core /path/to/project # inside Codex CLICodex shares methodology via ~/.codex/plamen/ (symlinked to ~/.plamen/). Config files are copied to ~/.codex/:
| Claude Code | Codex CLI | Purpose |
|---|---|---|
~/.claude/CLAUDE.md |
~/.codex/AGENTS.md |
Orchestrator rules |
~/.claude/settings.json |
~/.codex/config.toml |
Permissions, env vars |
~/.claude/mcp.json |
~/.codex/config.toml ([mcp_servers.*] blocks, generated by scripts/codex_adapter.py) |
MCP server definitions |
~/.claude/commands/ |
~/.codex/commands/ (from codex-adapter/commands/) |
Slash commands |
| Language | Build Tool | Static Analysis | Fuzzing |
|---|---|---|---|
| EVM/Solidity | Foundry, Hardhat | Slither, Aderyn | Foundry invariant, Medusa |
| Solana/Anchor | Anchor, cargo-build-sbf | Fender | Trident, proptest |
| Aptos Move | aptos CLI | Move Prover | Parameterized tests |
| Sui Move | sui CLI | -- | Parameterized tests |
| Soroban/Stellar | Stellar CLI | -- | proptest, cargo-fuzz |
| DAML/Canton | daml CLI (daml build/daml test) |
-- (DLint is style-only) | DAML Script boundary-value tests |
| L1 Go/Rust | go build, cargo | scip-go, rust-analyzer, Opengrep | proptest, go test -fuzz |
Ecosystem (language) is auto-detected and auto-corrected at startup with no halt-to-rerun — the resolved ecosystem is shown on the startup banner. Detection uses manifest-priority rules (a suffix-only match never clobbers an explicit config; Pinocchio / native-SDK Solana is detected at high confidence).
| Topic | Link |
|---|---|
| Docs index | docs/README.md |
| Glossary of terms | docs/glossary.md |
| Full setup guide | docs/setup.md |
| Updating after git pull | docs/updating.md |
| Platform dependencies | docs/dependencies.md |
| Audit mode comparison | docs/audit-modes.md |
| Pipeline architecture | docs/architecture.md |
| MCP servers & API keys | docs/mcp-servers.md |
| Codex backend (BETA) limitations | docs/codex-backend.md |
| Usage & CLI options | docs/usage.md |
| Skills, rules & internals | docs/internals.md |
| Repository structure | docs/repository-structure.md |
| L1 mode design | docs/l1-mode/design.md |
| L1 severity matrix | docs/l1-mode/severity-matrix.md |
| Automated setup (Claude) | SETUP.md |
See CONTRIBUTING.md. Skills are the most impactful contribution — teach methodology (how to look), not patterns (what to find).
- Trail of Bits — Slither MCP server
- Farofino — Aderyn integration
- SunWeb3Sec — DeFiHackLabs exploit corpus
- Solodit — Audit finding database
- Immunefi — Bug bounty & audit competition findings
- Anthropic — Claude Code runtime
- OpenAI — Codex CLI runtime