Changelog

3.41.0

Minor Changes

323e267: Add an observability skill and a four-tier observability-placement model across the portfolio.

The new skill teaches wide events / canonical log lines as the default instrumentation primitive (with the pillars-vs-events debate handled honestly), the OpenTelemetry substrate for TypeScript services, a cardinality routing rule, the head/tail sampling ladder, SLO/error-budget discipline with multiwindow burn-rate alerting, symptom-based paging with mandatory runbooks, and telemetry as test-driven behavior via in-memory exporters. The hexagonal-architecture skill's Logging section becomes a four-tier model organized by GOOS's "logging is a feature" test: technical telemetry stays in adapters, domain-significant observations flow through a Domain Probe driven port or a domain-event subscriber (a generic Logger port is banned), correlation and wide-event assembly live at the edges, and instrumentation is tested with recording fakes. Canonical events are guaranteed on every terminal outcome — clean finish, exception, and client abort. Small cross-references land in twelve-factor, ci-debugging, api-design (trace ID as the RFC 9457 correlation key), and domain-driven-design.

3.40.0

Minor Changes

08d046a: Reconcile the hexagonal-architecture and folder-structure skills with the source pattern, from a cover-to-cover read of Hexagonal Architecture Explained (Cockburn & Garrido de Paz).

Corrections: the pattern is symmetric (inside vs outside) — the left/right asymmetry exists only in implementation as who-knows-whom (provided vs required interfaces); runtime configurability of driven actors is the actual pattern requirement (parameter injection is one of three sanctioned configurator shapes, kept as this skill's house default). Disclosures: the source convention purpose-names every port (ForGettingTaxRates, ForPaying) — this skill's role-noun driven ports are now a documented house choice with book-style names as equal alternatives. New: actor/interactor and primary/secondary vocabulary, pattern-requirements vs house-style checklist tiers, the "a port is only real if it has a test interactor" principle, two anti-patterns (port for a domain concept; nested hexagons), a greenfield-sequence resource (folders first, disposable first test, walking skeleton), purpose-named ports/ folder conventions, and per-chapter citations in references. Fakes location reconciled to adapters/fakes/ across both skills.

3.39.0

Minor Changes

891d693: Improve the cli-design skill with a full sweep of the Command Line Interface Guidelines (clig.dev) and a recommended TypeScript stack.

One genuine correction: secrets get a channel hierarchy instead of a flat "files/stdin/env" list — never via flags; OS keychain or a 0600 credential file preferred, then stdin (--with-token < token.txt), with env vars acceptable only where the platform injects them (CI secret stores), never as the primary documented path (they leak to child processes and crash reports). New guidance: tiered confirmation by severity (y/N → suggest --dry-run → typed resource name for irreversible actions), a "State Changes and Transparency" section (confirm what changed, the status pattern, make hidden file/network access explicit, page long output via $PAGER), a "Robustness" section (validate early, 100ms responsiveness, configurable network timeouts with exit 75, recover by re-run, expect misuse), the variadic-args exception to the flags-over-args rule, help support-links and typo suggestions, general-purpose env vars (DEBUG, EDITOR, PAGER, proxies), deprecation warnings that stop once users migrate, and a "Naming, Distribution, Telemetry" section (opt-in telemetry only).

Also adds a "Recommended TypeScript Stack" section: Stricli (commands as pure typed functions with injected context — testable without subprocess spawning, introspectable command tree, light startup) plus @clack/prompts (interactivity as an optional TTY-gated presentation adapter), with honest trade-offs against commander, oclif, and @effect/cli. All sources credited in skills/REFERENCES.md.

3.38.0

Minor Changes

f68299e: Add the event-sourcing skill: a functional-TypeScript guide to persisting state as an append-only log of events and rebuilding it by folding them (the Decider), positioned as the top rung of the complexity ladder rather than a default.

Builds on the existing domain-driven-design (Decider), hexagonal-architecture (event store as a driven port, CQRS-lite), typescript-strict, and testing skills. The main SKILL.md covers when to use it (and when not), the Decider write model, the command-handler loop, the event store port, events-as-data, projections, and behaviour-driven testing. Eight deep-dive resources plus source notes cover the decision framework, event modelling (EventStorming), rehydration and decider composition, the event store and Postgres storage, projections and read models, event versioning (tolerant reader/upcasting), testing event-sourced systems, and production concerns (snapshots, sagas, delivery guarantees, GDPR crypto-shredding). Grounded in primary sources — Young, Fowler, Chassaing, Wlaschin, Brandolini, Verraes, Dudycz — recorded in skills/REFERENCES.md. The domain-driven-design and hexagonal-architecture skills now cross-link into it at their event-sourcing decision points (the domain-events complexity ladder and the CQRS-lite upgrade path).

Patch Changes

331a637: Fix install-claude.sh mishandling skills installed by skills CLI ≥ 1.5, which copies each skill into ~/.claude/skills/<name> as a regular directory (tracked in ~/.agents/.skill-lock.json) instead of symlinking through the universal ~/.agents/skills/ cache. The installer treated every regular directory as a pre-skills.sh leftover, so on each run it moved all CLI-managed skills aside as "legacy" and then warned that the freshly reinstalled copies "won't be visible to non-Claude agents" — an endless move-and-reinstall cycle that could leave skills missing if an install step failed after the move (and made a newly merged skill look like it never installed).

The installer now consults the skills CLI lock file: symlinked entries and lock-tracked directories are both recognised as CLI-managed and left alone; only genuinely unmanaged directories are migrated aside or warned about. Adds test/install-claude-skill-layout.sh covering all three layouts.
331a637: Quality sweep across 20 existing skills, driven by a four-agent adversarial review (every finding independently verified — compile claims against tsc 5.7.2 --strict, factual claims against primary sources or live registries/CLIs):
- Non-compiling snippets fixed (9): union member access without narrowing in testing and DDD's testing-by-layer (replaced with whole-value toEqual assertions); implicit-any parameters and await in a non-async callback in testing's extraction example; implicit-any in react-testing's legacy resource; a Record indexed on a union member missing the key in hexagonal's cross-cutting concerns (now Extract<...>); a spread that silently overwrote its own sanitised fields plus a zero-arg call to a one-arg factory in DDD's aggregate design; an evolve that could not typecheck against the discriminated-union state DDD itself mandates; a dangling result variable in api-design's idempotency example; a non-UUID literal passed to crypto.randomUUID mocking in characterisation-tests; and the incoherent over-engineered compose example in functional.
- Stale/wrong facts corrected: Stryker's trim() mutation (removed, not trimStart()); deprecated Zod idioms (z.string().email() → z.email()); a fabricated npm package and nonexistent "Vitest Jest-compatibility mode" in characterisation-tests; getByDisplayValue wrongly listed as a Vitest Browser Mode locator; RFC 9745's Deprecation header (published, @-timestamp syntax — no longer a draft with HTTP-date); "Stripe never made a v2" (they did, 2024); 422's renamed reason phrase; a dead attribution link in test-design-reviewer; npx skills check documented as read-only when it applies updates; PlantUML mxgraph stencils flagged as a docu.md-viewer extension that standard renderers reject, with a new "where will this be viewed?" rule in diagrams; stale Gemini model example in double-check.
- Contradictions reconciled: DDD's Occasion entity now matches every snippet that uses it; cli-design's FakeLogger now implements the real Logger port and two falsely-justified type assertions are gone; optimistic-locking guidance now follows the skills' own error-modeling rule; CORS vs "disable OPTIONS" conflict resolved; Given/When/Then framing in find-gaps/story-splitting reworded to precondition → trigger → observable outcome (this repo's BDD is not Gherkin); refactoring's priority table and "when not to refactor" bullets no longer forbid all refactoring; storyboard's /harden references corrected to /impeccable harden and its commit steps now respect the commit-approval gate; three wrong relative resource paths fixed.

3.37.1

Patch Changes

17046cf: Fix double-check skill being silently skipped by the skills.sh installer

The skill's description frontmatter contained an inline ": " (in Provider-agnostic: it always picks...). YAML reads colon-space inside an unquoted plain scalar as a nested mapping, so npx skills add failed to parse the frontmatter and silently dropped the skill — the installer found 27 of 28 skills and double-check never landed in ~/.agents/skills/. Replaced the colon with an em dash to match the description's existing style.

3.37.0

Minor Changes

476f512: Add the double-check skill — independent cross-provider verification

New auto-discovered skill that gets a second opinion on finished work from a different AI provider's CLI agent (codex, claude, gemini, or cursor-agent) running locally, then drives a constructive back-and-forth between the two agents until both genuinely agree. Host-agnostic: it detects the hosting agent's model lab and always picks a verifier from a different lab, configured for the best available model and highest reasoning effort, in a read-only sandbox. Includes a provider command reference and a verifier brief template in resources/.

3.36.1

Patch Changes

3852666: teach-me: add quality-gated reading lists to HTML lessons and indexes

3.36.0

Minor Changes

6caba93: teach-me: HTML lessons, mission grounding, trusted sources, learning records, and a living glossary

Integrates the best ideas from mattpocock/skills' teach skill into the existing tutoring-first teach-me skill:
- HTML lessons — each session can now produce a self-contained, Tufte-style HTML lesson (lessons/NNNN-slug.html): printable, cited, cross-linked to the cheat sheet and glossary, with a reveal-answer recap quiz. New resources/html-lessons.md covers format, design principles, and a full template. Includes capture mode: when the learner is short on time, the lesson is generated up front instead of after the interactive flow — logged as captured-not-taught, with the deferred retrieval owed at the next session's review.
- Mission grounding — the discovery interview now distills a concrete mission ("ship a Rust CLI to my team", not "learn Rust") persisted at the top of plan.md; every session objective must trace back to it.
- Trusted-source curation — new resources.md per topic: 2-3 high-trust annotated sources gathered before the first session, cited while teaching, one recommended per session. No teaching from parametric knowledge alone on fast-moving or factual topics.
- Learning records — ADR-style decision-grade insights (LR-NNNN) in the session log, used to compute the zone of proximal development; covers demonstrated understanding, disclosed prior knowledge, corrected misconceptions, and mission shifts, with supersession rules.
- Living glossary — glossary.md as canonical terminology; terms promoted only on demonstrated understanding.
- Fluency vs storage strength — new learning-science section (Bjork) with the asymmetric difficulty rule: difficulty is the enemy during TEACH, the tool during PRACTICE; mastery judged only on spaced performance.
- Wisdom delegation — real-world-judgment questions route to high-reputation communities recorded in resources.md.
- Answer-option hygiene — multiple-choice rules (equal-length options, parallel grammar, misconception-based distractors) in assessment patterns.
- One workspace per topic — canonical locations (learning/[topic-slug]/ at the repo root for project topics, ~/.claude/learning/[topic-slug]/ for general ones); artifacts are never split across locations or duplicate slugs, resume consolidates strays, and first project use asks whether to commit or gitignore the workspace.

3.35.0

Minor Changes

87827c4: Comprehensive quality pass across all Claude config: skills, agents, commands, settings

Correctness fixes
- folder-structure: lint examples now block bare react/next/react-dom imports (minimatch react/** only matches subpaths, so the rule as written missed the most common violation)
- front-end-testing/react-testing: MSW guidance now covers setupWorker from msw/browser for Browser Mode (previously only setupServer from msw/node was shown, which does not work in a real browser); act() guidance scoped correctly for renderHook
- functional: fixed example that returned boolean | undefined under strict mode
- typescript-strict: branded-type example now uses a validating constructor instead of bare as assertions (which violated the skill's own rule); added schemas-at-trust-boundaries and Standard Schema guidance
- expectations: removed "then commit" guidance that contradicted the wait-for-commit-approval rule; added MUTATE/KILL MUTANTS to the cycle
- tdd: added commit-approval STOP to the workflow; added triangulation guidance to GREEN
- mutation-testing: removed non-existent Stryker testFiles option, named @stryker-mutator/vitest-runner as required, added Browser Mode caveat
- testing: jest.spyOn → vi.spyOn; documented the callback-prop exception to the spy anti-pattern
- api-design: rate limiting updated to current IETF draft fields (RateLimit/RateLimit-Policy)
- generate-pr-review: removed Go-style string concatenation artifacts that would corrupt generated /pr commands
- agents: added missing Write tool to progress-guardian, twelve-factor-audit, docs-guardian (their jobs require creating files); pr-reviewer greps now cover Vitest
Progressive disclosure restructures (lean SKILL.md + on-demand resources)
- front-end-testing: 1061 → 360 lines + 3 new resources (async-patterns, msw, dom-testing-library-legacy); universal query-priority/philosophy content un-mislabeled from "Legacy"
- react-testing: 590 → 319 lines + testing-library-react-legacy resource; forms/error-boundaries/portals/Suspense ported to Browser Mode dialect; RSC note added
- functional: 732 → 273 lines + immutability-catalog and composition-patterns resources
- twelve-factor: 413 → 237 lines + node-patterns resource
- api-design: RFC 9457 deep detail extracted to problem-details resource
- cli-design: intra-file duplicate tables removed; composability examples extracted
Deduplication and consistency
- domain-driven-design: branded types and illegal-states sections now defer to typescript-strict, keeping only DDD-specific guidance; testing-priority table deduplicated with hexagonal-architecture; use-case placement reconciled with folder-structure across both architecture skills; broken ../REFERENCES.md link fixed for standalone installs
- CLAUDE.md: added missing folder-structure and production-parity-skill-builder routing (previously undiscoverable); seo-audit added to external skills list
- Sharper trigger descriptions for tdd, refactoring, mutation-testing, ci-debugging, expectations, find-gaps; mutually exclusive scopes for the four review agents (tdd-guardian/ts-enforcer/refactor-scan/pr-reviewer)
- ci-debugging: added "Getting the data" (gh run commands) and TDD handoff sections
- expectations: rewritten around a learning-destination matrix (CLAUDE.md vs ADR vs plans vs memory) with routing to learn/adr agents
- commands: /pr gets main-branch guard + package-manager permissions for its quality gate; /continue gets dirty-worktree and merge-state checks; /plan and /pr get argument hints
- settings.json: hook uses npx --no-install (no silent registry installs), runs eslint even when prettier fails, 30s timeout, NotebookEdit matcher
- README: stale counts corrected (27 skills, ~160-line CLAUDE.md, 5 api-design resources)

3.34.0

Minor Changes

b1aec2e: Strengthen DDD aggregate design guidance with invariant-first approach. Add 'Relationship-Driven Aggregates' anti-pattern, 'Design From Invariants, Not Relationships' section with litmus test and code examples, 'Aggregates Serve Commands, Not Queries' section connecting CQRS thinking to aggregate boundary decisions, 'Enforcing Boundaries in TypeScript' section with three concrete patterns (accept child IDs not objects, create children through the root, expose ReadonlyArray), lifecycle identity as a boundary discovery heuristic, and data locality / cross-aggregate eventual consistency principles. Add two checklist items for invariant-justified boundaries and no query-only properties in aggregates.

3.33.0

Minor Changes

2335c7f: Add a folder-structure skill for designing screaming, feature-based project structures with source-backed guidance for vertical slices, colocation, protected DDD/hex domain cores, linted import boundaries, shared boundaries, and monorepo organization. Cross-reference it from the DDD and hexagonal architecture skills, while making clear those physical layout and lint rules apply only after folder-structure has been adopted.

3.32.0

Minor Changes

aaec53b: Add the Vercel Labs Next.js skills bundle to the default external skills installed via skills.sh.

The installer now includes vercel-labs/next-skills, which currently provides next-best-practices, next-cache-components, and next-upgrade.

3.31.0

Minor Changes

c624a29: Enable OpenCode's built-in LSP support, including TypeScript, and include the OpenCode package in the stow installer.

3.30.1

Patch Changes

fad45fd: Clarify that story-splitting hands implementation to planning, and that every planned implementation slice must load and apply the full tdd, testing, mutation-testing, and refactoring cycle before moving to the next slice.
2dcc9f1: Clarify the requirements-to-code skill pipeline so consumers know when to use grill-me, story-splitting, find-gaps, planning, and the TDD execution skills. Also enforce loading the full tdd, testing, mutation-testing, and refactoring skill cycle before implementing planned slices.

3.30.0

Minor Changes

33cc791: Add a story-splitting skill for breaking large stories, epics, features, and backlog items into small end-to-end user-value slices.

The skill synthesizes Tim Ottinger's story-splitting resource list and linked articles, with on-demand resources for the detailed pattern catalog and source notes.

3.29.1

Patch Changes

2c32802: Clarify that generated production parity skills must be stored directly inside the target project repository, never in user or global skill folders.

3.29.0

Minor Changes

8c06947: Add production-parity-skill-builder skill for creating app-specific skills that detect and fix drift between production and non-production environments. Idea credited to GitHub user @dm.

3.28.3

Patch Changes

c5e188e: Clarify that a vertical slice maps to a PR, not a single commit

The planning skill previously equated slice = commit, which contradicted the reality that TDD increments within a slice naturally produce multiple commits. Updated to clarify that a slice is the smallest independently mergeable PR, and that multiple TDD commits within a slice are expected.

3.28.2

Patch Changes

f0aaed5: Improve the planning skill to default to vertical slices.

The planning guidance now asks agents to plan thin end-to-end behaviours through the real production path, while keeping existing TDD, mutation testing, and commit-approval quality gates intact.

3.28.1

Patch Changes

7624f8e: Fix the Claude installer on macOS Bash by avoiding the Bash 4-only mapfile builtin when de-duplicating skill agent targets.

3.28.0

Minor Changes

7f35175: Add seo-audit as an external community skill from coreyhaines31/marketingskills

The installer now adds the seo-audit skill from coreyhaines31/marketingskills via skills.sh. Documentation now lists the marketing skill alongside the existing external skills and attributes the upstream source and MIT license.
9a774de: Improve mutation testing guidance with a Stryker-first workflow, an on-demand mutator-rules resource, full-project and diff-against-main commands, survivor triage, and mutation-aware TDD/testing guidance on when to fix issues directly versus asking for human judgment.

3.27.0

Minor Changes

4d08be3: Add Matt Pocock's grill-me skill to the default skills installer.

The installer now adds grill-me from mattpocock/skills alongside the existing external skill sources, and the docs explain when to use it for plan and design interrogation.

3.26.0

Minor Changes

618aab7: feat(install): install skills via the skills.sh CLI

Replace the three direct-curl skill install blocks (own, web-quality-skills, impeccable) in install-claude.sh with npx skills add -g -a claude-code -s '*' -y calls against skills.sh. The installer shrinks from ~700 to ~370 lines and skills can now be managed with npx skills list/update/find/remove after install instead of re-running the installer.

Why it matters:
- Multi-agent portability — the same skills are now installable against 40+ coding agents (Claude Code, Cursor, Codex, Copilot, OpenCode, Gemini CLI, Cline, Continue, Windsurf, …) via the -a <agent> flag. --with-opencode is now just an extra -a opencode on the existing install rather than a duplicated tree.
- New --agent <name> flag — repeatable, lets you target any supported agent without knowing the npx skills add syntax (e.g. --agent codex --agent cursor). Paired with --no-claude-code to target only non-Claude agents. Default remains claude-code.
- Lifecycle management — npx skills update -g propagates upstream changes instead of requiring a full reinstall, and npx skills find <query> surfaces skills beyond this repo from the open ecosystem.
- Installer no longer grows with the skill list — three curl loops with hard-coded file lists (including every resources/*.md and references/*.md) collapsed into three npx skills add calls; adding a new skill to claude/.claude/skills/ no longer requires a matching installer edit.
Migration from the old curl-based installer is automatic. The installer detects any pre-existing non-symlink directories under ~/.claude/skills/ (left behind by the curl-based installer), moves them aside to ~/.claude/skills.pre-skills-sh.<timestamp>/, then lets the skills CLI install each source through the universal ~/.agents/skills/ cache. Without this step, the skills CLI would keep the stale directories in place as Claude-Code-specific installs and they'd stay invisible to non-Claude agents (Codex, OpenCode, etc.), defeating the multi-agent point. After install, a verification pass warns if anything ended up as a regular directory.

Trade-offs (documented in the README):
- --skills-only now requires Node.js for npx. The script preflights and errors clearly if Node is missing; --claude-only and --agents-only still work without it.
- The skills CLI doesn't surface ref pinning yet, so skills always install from the latest upstream commit. --version still pins CLAUDE.md, commands, and agents.
CLAUDE.md, slash commands, and agents continue to download directly from this repo — they aren't skills and aren't part of the skills.sh ecosystem.

3.25.1

Patch Changes

4bf77b1: fix(find-gaps): use AskUserQuestion for enumerable decisions

Weave the AskUserQuestion tool into the find-gaps loop wherever the choice space is the value (failure strategies, severity re-triage, state-variance scoping, parking decisions, write-back confirmation, variant comparison via preview) — while keeping free text for questions where the user's specific words are the value (microcopy, domain vocabulary, novel decisions).

Why it matters: structured options surface trade-offs the user might not have thought through ("silent retry once" is rarely what they say first, but often the right call), speed up picking vs. generating, and let you batch 2–4 tightly-related sub-questions in one turn without reverting to a gap dump.

What's added:
- New Asking with Structure section with the core heuristic, good-fit table, bad-fit list, structural rules, and a worked payment-decline example showing how to batch two related sub-questions in one AskUserQuestion call
- Step 4 of the gap-closing loop now routes enumerable questions to AskUserQuestion and uses it for the write as-is / edit inline / discard close, with optional preview for comparing two G/W/T drafts side-by-side
- "One question per turn" rule clarified: a structured call with 2–4 tightly-related sub-questions is still one turn; batching unrelated gaps is a gap dump wearing a hat
What's forbidden:
- Inventing options just to use the tool (fabricated options anchor the user to guesses and hide their real answer)
- Using AskUserQuestion for microcopy (destroys the exact wording that is the point)
- Batching unrelated gaps into one call
No other docs change — this is an internal refinement of how the skill interacts with the user. README/CLAUDE.md entries are unchanged.

3.25.0

Minor Changes

dc9d98e: feat: add find-gaps skill for collaborative pre-implementation review

Add a new find-gaps skill that systematically surfaces missing states, unhandled edge cases, unstated assumptions, and unverifiable criteria in plans, acceptance criteria, and design mocks — then works interactively with the user to close each gap, turning answers into new acceptance criteria, plan updates, or mock-state specs written back to the source of truth.

Two core principles:
1. What isn't on the page is more dangerous than what is. Treat silence as a red flag, not a green light.
2. Every gap is a conversation, not a comment. A gap list nobody answers is a todo list with extra steps. The output of this skill is a tightened artifact, not a gap report.
The shape is a conversational loop:
- Survey the artifact against an artifact-specific checklist (plans / AC / mocks)
- Triage candidates into Blocker / Should-address / Nice-to-have
- Tell the user how many gaps and where — get agreement to proceed
- Walk them one at a time (or a tightly-coupled pair), starting with Blockers
- For each: ask the concrete question, refine vague answers until testable, convert to an artifact update, show it back, confirm, write to source of truth
- Recap every 3–5 closed gaps to catch contradictions
- Exit when Blockers + Should-addresses are closed, or park explicitly with owner if the user calls time
Answer-to-artifact conversion patterns (with worked examples in the skill):
- AC: Given / When / Then with a single observable outcome, actor, and any emitted events. A QA engineer should be able to execute it without follow-up questions.
- Plan update: the sentence or subsection that would have been in the plan if the author had thought of it — section, trade-off, failure mode named.
- Mock state spec: name / trigger / visual / behaviour / exit for each missed state, so it can be implemented without re-asking.
Working-with-the-user patterns the skill enforces:
- One question per turn (no bundling)
- Mirror the user's vocabulary verbatim — no silent "buyer → user" promotions
- Every confirmed answer is visible in the artifact before moving on
- Surface downstream trade-offs explicitly
- Escalate gaps the user can't decide — name the actual owner, park with question
- Don't re-ask triage — you already decided severity
Per-artifact discovery checklists (the engine for surfacing candidates):
- Plans: scope boundary, prerequisites, sequencing, failure & recovery, state & data, observability, security, testing, tribal knowledge
- Acceptance criteria: measurability (vague-word hit list), G/W/T discipline, negative paths, input edge cases, time & locale, actors, non-functional targets, completion
- Design mocks: UI states (loading/empty/error/offline/rate-limited), content variance, interaction states, responsive, accessibility, theme, permissions, i18n
Outputs:
- Primary: the updated artifact written to the source of truth (AC list, plan doc, states.md)
- Secondary: a resolution log with closed gaps (→ location) and parked gaps (→ owner)
Registered in CLAUDE.md (skills list + pointer), README.md (skill count 23→24, Key Sections row, two Quick Navigation by Problem rows), and install-claude.sh (mkdir + skills-array entry).

3.24.0

Minor Changes

640ee0a: feat: add find-skills skill for discovering agent skills from the open ecosystem

Add the find-skills skill sourced from vercel-labs/skills. Helps Claude discover and install skills from the open agent skills ecosystem (npx skills, skills.sh) when the user asks "how do I do X", "find a skill for X", or expresses interest in extending capabilities.

What it does:
- Activates when users ask how to accomplish something that might exist as an installable skill
- Checks the skills.sh leaderboard first for well-known skills
- Runs npx skills find [query] with domain-appropriate keywords
- Verifies quality before recommending (install count ≥ 1K, trusted sources, GitHub stars)
- Presents options with install commands and links; offers to install with npx skills add <owner/repo@skill> -g -y
- Falls back to direct help or suggesting npx skills init when no skill matches
Licensing: Vendored under MIT. The upstream repository declares MIT in its package.json and README but does not ship a root LICENSE file, so a reproduced MIT notice is included at claude/.claude/skills/find-skills/LICENSE to preserve attribution. The install script downloads both SKILL.md and LICENSE.

Patch Changes

640ee0a: fix: install-claude.sh now covers storyboard, teach-me, and diagrams skills

Three skills that had been committed to the repo were never added to the installer's skill list, so they were silently missing from ~/.claude/skills/ for anyone running install-claude.sh:
- storyboard (added in #128) — multi-surface design audit
- teach-me (added in #126) — evidence-based private tutor, plus 4 resource files
- diagrams (added in #122) — Mermaid/Graphviz/Vega-Lite/etc., plus LICENSE, examples.md, and 8 reference files
The installer now creates the missing directories, downloads all three SKILL.md files, and pulls in each skill's resources/references and any vendored LICENSE file.

Also updates README.md to reflect the actual catalog:
- Skill count bumped 21 → 23 (summary line and detailed install breakdown)
- Key Sections and Quick Navigation by Problem tables both now have rows for teach-me and diagrams, with MIT attribution visible inline for diagrams

3.23.0

Minor Changes

ed5ef9b: Add storyboard skill for multi-surface design audits

The storyboard skill produces a single HTML page that stitches every UX surface in a scope of work into one reviewable view — each existing mock embedded as a live <iframe>, a flow diagram showing how the user moves through them, per-mock audit checklists, and gap cards for mocks still to produce.

Solves the "open a dozen tabs and try to hold the flow in your head" problem. Use before any feature whose scope touches ≥ 2 UX surfaces begins implementation, before a planned chunk of work that covers multiple surfaces, or whenever the user asks for "the whole flow in one place" / "audit the mocks".

Pairs with the impeccable family (/shape, /critique, /layout, /clarify, /audit, /polish, /adapt, /harden, /distill) — the audit checklists reference these, and gap-mock production runs them.

Non-negotiables baked in: live iframes not screenshots (screenshots go stale), every gap card has brainstorm questions (forcing function for decisions before drafting), one scrollable page not a tree, <pre> for ASCII diagrams (prettier-safe).

3.22.0

Minor Changes

42599bf: feat: add teach-me skill for structured learning and tutoring

Add a new /teach-me [topic] skill that turns Claude into an evidence-based private tutor for any topic. Grounded in learning science research (active recall, spaced repetition, Bloom's Taxonomy, Feynman Technique, deliberate practice, metacognition).

What it does:
- Discovery interview to assess learner level, goals, and context
- Generates structured learning plans using the 80/20 principle and spiral curriculum
- Interactive sessions: review → teach → check → practice → reflect → log
- Socratic questioning — guides discovery rather than giving answers
- Progressive difficulty through Bloom's Taxonomy levels
- Spaced repetition scheduling across sessions
- Confidence calibration (self-rated vs actual performance)
- Integrates with existing skills when the topic matches (e.g., /teach-me hexagonal-architecture uses the hex arch skill as curriculum)
Course generation:
- Creates structured, standalone course materials with sessions and exercises
- Project-local (learning/) or general (~/.claude/learning/) placement
- Work-derived courses that reference actual project code as examples
- Cheat sheet / reference card generation
Persistence:
- Learning files (plan, session log, cheat sheet) persist on disk
- Memory system integration for cross-session continuity
- Automatic resume on re-invocation with spaced review
Resources (4):
- learning-science.md — evidence-based techniques reference (active recall, spaced repetition, interleaving, elaborative interrogation, desirable difficulties, testing effect, concrete examples, dual coding)
- assessment-patterns.md — Bloom's Taxonomy question bank, quiz design, feedback patterns, confidence calibration, code exercise patterns (PEMC)
- course-generation.md — templates for learning plans, course files, session materials, exercises, session logs
- session-management.md — multi-session tracking, spaced repetition scheduling, adaptation signals, graduation criteria

3.21.0

Minor Changes

a9e65a8: feat: integrate impeccable design skills from pbakaus/impeccable

Replace the frontend-design skill with the comprehensive impeccable design system by Paul Bakaus. This adds 18 externally-fetched design skills (1 core + 17 steering commands) with a systematic methodology for creating distinctive, high-quality frontend interfaces.

What changed:
- Removed frontend-design skill (replaced by impeccable, which is a strict superset)
- Added external fetch of 18 impeccable skills from pbakaus/impeccable:
  - Core: impeccable (with 9 reference files for typography, color, spatial design, motion, interaction, responsive, UX writing, craft flow, extract flow)
  - Steering commands: shape, critique, audit, polish, harden, typeset, colorize, animate, layout, clarify, adapt, bolder, quieter, distill, delight, optimize, overdrive
  - Critique reference files: cognitive-load, heuristics-scoring, personas
- Added --no-impeccable install flag (and --no-external now skips both web-quality-skills and impeccable)
- Added impeccable workflow documentation to README
- Apache 2.0 license and NOTICE files stored alongside skills for attribution compliance
Getting started:
- /impeccable teach - Set up design context for your project
- /impeccable craft [feature] - Full shape-build-iterate design flow
- /critique - UX review with Nielsen's heuristics scoring
- /polish - Final quality pass
Attribution: Paul Bakaus (Apache 2.0 License)

3.20.0

Minor Changes

067db3c: Add consolidated diagrams skill for Markdown visualizations
- New diagrams skill with decision-tree router for 8 diagram engines
- Covers Mermaid, Graphviz, Vega-Lite, PlantUML, Infographic, JSON Canvas, Architecture (HTML), and Infocard (HTML)
- Consolidates 15 upstream skills into 1 directory with 8 reference files
- PlantUML reference covers UML, cloud (AWS/Azure/GCP), network, security, ArchiMate, BPMN, data analytics, and IoT
- Includes examples file with 15 rendered diagram samples
- Adapted from markdown-viewer/skills (MIT license) with proper attribution

3.19.3

Patch Changes

0551337: fix: improve characterisation-tests and finding-seams skills

characterisation-tests:
- Add async characterisation guidance with worked examples (SKILL.md + writing-process.md)
- Mention "golden master testing" as alternative name for discoverability
- Replace beforeEach/afterEach fake timers with withFrozenTime helper in modern-tooling.md
- Replace jest-extended-snapshot example with pure Vitest it.each + inline snapshot approach
- Add "not awaiting async results" to common mistakes table
finding-seams:
- Add inline worked example to main SKILL.md so it's useful without loading resources
- Add code smell → technique quick-lookup table
- Fix duplicate const calculateOrder variable name in seam-types.md
- Add async seam patterns (seam-types.md + creating-seams.md Technique 6)
- Add seam granularity guidance (when to create a seam vs when not to)

3.19.2

Patch Changes

c71ebe3: Final polish for legacy code skills based on second review round

characterisation-tests:
- Add explicit "When NOT to Use" section (greenfield code, existing specs, adequate test coverage, permanent strategy)
- Add "Naming and Identification" section: characterises prefix in test names, .characterisation.test.ts file suffix, block comment explaining purpose and lifecycle, SUSPICIOUS markers for potential bugs. Another LLM or human should immediately recognise these as temporary characterisation tests.
- Update worked example to follow naming conventions
- Fix External Service Responses guidance in modern-tooling.md to recommend parameter injection first (consistent with finding-seams "last resort" messaging)
finding-seams:
- Fix sensing example in seam-types.md: remove type assertion, return defensive copy from closure
- Fix inMemoryStorage in creating-seams.md: return defensive copy, simplify verbose return type annotation

3.19.1

Patch Changes

ef3e5ba: Align finding-seams and characterisation-tests skills with FP-first principles

Both skills were too class-heavy, reading like they were written for an OOP/Java audience rather than for a TypeScript FP workflow. This brings them in line with the functional skill's conventions.

finding-seams:
- Reorder to lead with function parameter injection as the primary seam technique
- Move class-based patterns (object seams, extract and override, parameterize constructor) to a separate resources/oop-patterns.md with clear "legacy OOP" framing
- Add React/Next.js seam examples (props as seams, context as seams, MSW for API boundaries)
- Add connection to hexagonal architecture (ports = designed-in seams)
- Strengthen vi.mock() warning as last-resort scaffolding
- Replace over-engineered class examples with simple default parameters
characterisation-tests:
- Add "when to stop" heuristic (cover every branch your change touches + one layer out)
- Add mutation testing validation step after characterising
- Replace monkey-patching sensing with parameter injection
- Add anti-pattern for vi.mock() sensing in common mistakes table

3.19.0

Minor Changes

7fd646d: Add finding-seams and characterisation-tests skills

Two new skills extracted from Michael Feathers' Working Effectively with Legacy Code (2004), adapted for TypeScript/JavaScript. These fill a gap in the existing skill set -- the current workflow (tdd, testing, mutation-testing, refactoring) assumes code is already testable. These two skills address the prerequisite step: making untestable legacy code testable and documenting its existing behavior before changing it.

finding-seams -- identify substitution points (seams) that make legacy or tightly-coupled code testable without editing at the call site:
- SKILL.md: core concept, seam types quick reference, how to find seams, progression from quick-fix to proper design
- resources/seam-types.md: module, object, function parameter, and configuration seams with TypeScript examples
- resources/creating-seams.md: six techniques for introducing seams (extract and override, parameterize method/constructor, extract interface, wrap static calls, module indirection)
characterisation-tests -- document actual behavior of existing code before making changes:
- SKILL.md: core concept, the 5-step algorithm, heuristics, handling bugs, temporary nature of characterisation tests
- resources/writing-process.md: worked example with targeted testing, sensing variables, pinch points
- resources/modern-tooling.md: Vitest snapshots, combination testing, non-determinism handling, approval testing, coverage-guided characterisation

3.18.0

Minor Changes

3cfe887: Add cli-design skill for Unix-composable CLI patterns

New skill covering how to build CLI tools that compose well in Unix pipelines. Language-agnostic core principles (stdout/stderr stream separation, format flags, exit codes, TTY detection, composability, error design) with TypeScript implementation patterns in resources/.
- SKILL.md: language-agnostic CLI design principles
- resources/output-architecture.md: TypeScript patterns (Result types, entry point wiring, formatters, JSON envelope)
- resources/testing-cli.md: Vitest testing patterns (stream separation, exit codes, pipe simulation, contract tests)
- resources/stream-contracts.md: buffering behavior, NDJSON, signal handling, crash-only design
Synthesized from 8 authoritative sources: clig.dev, 12 Factor CLI Apps, Heroku CLI Style Guide, galligan's three-layer architecture, yogin16/better-cli, steipete/create-cli, lirantal/nodejs-cli-apps-best-practices, Orhun Parmaksiz stdout vs stderr.

3.17.1

Patch Changes

a0192cb: Make api-design skill flexible on error response format

RFC 9457 Problem Details is now recommended for public APIs with external consumers. Internal APIs with a single frontend can use a simpler consistent shape (error code + optional message + field errors). The key requirement is consistency across endpoints, not a specific format.
- Add "Choosing an Error Format" section with guidance by API type
- Show simpler ApiError shape as a valid alternative
- Update verification checklist to accept either format
- Update Content-Type guidance for both formats

3.17.0

Minor Changes

cd92a7e: feat: enrich api-design skill with RFC BCP guidance (HTTP fundamentals, JWT/OAuth security, caching)

New resources:
- http-fundamentals.md: HTTP protocol guidance from RFC 9205 (BCP 56) — caching, URI design, browser security, content negotiation, status code discipline
- auth-security.md: JWT and OAuth 2.0 security deep-dive from RFC 8725 (BCP 225) and RFC 9700 (BCP 240) — algorithm allowlisting, PKCE, token handling, redirect validation
Updates:
- api-design/SKILL.md: added HTTP Caching section, URI Ownership principle, header naming guidance (no X- prefix), browser security headers in red flags and verification checklist
- api-security.md: expanded JWT/OAuth sections with RFC references, added Browser Security Headers and Transport Security sections, expanded security checklist
- REFERENCES.md: added 9 new RFC/BCP sources (RFC 9205, 8820, 8725, 9700, 9325, 8996, 6648, 8941, 6302)
- twelve-factor/SKILL.md: added RFC 6302 (BCP 162) logging recommendations for internet-facing servers to Factor XI

Patch Changes

8cbaacc: fix: add missing resource files and REFERENCES.md to install script

The install script only downloaded SKILL.md for each skill but missed 15 deep-dive resource files across hexagonal-architecture (5), domain-driven-design (6), and api-design (4), plus REFERENCES.md. Also removes accidentally committed plans/ddd-hex-arch-95-plus.md.
a60cb31: fix: remove protocol-spec guidance not relevant to normal web development

Removed content aimed at protocol specification authors rather than web developers:
- URI Ownership / URI Design & Discovery (RFC 8820) — not relevant when documenting your own API
- Content Negotiation custom media type registration — most web devs use application/json
- Versioning via HTTP Mechanisms (link relations, media types) — already covered practically in api-evolution.md
- Protocol Version Independence — devs don't specify HTTP versions
- Weak Algorithm Avoidance details (deterministic ECDSA, RSA-PKCS1 v1.5) — implementation details most devs never touch
- Client Authentication (mTLS, Private Key JWT) — enterprise-grade, not typical web dev
- Mix-Up Attack Defense — niche scenario (multiple auth servers)
- Structured Fields (RFC 8941) references — most devs don't design new HTTP header formats

3.16.0

Minor Changes

05483d7: feat: add api-design skill with deep-dive resources

Adapted from addyosmani/agent-skills, significantly expanded and modified to align with existing conventions.

Main skill covers:
- Hyrum's Law, One-Version Rule, contract-first development
- RFC 9457 error semantics (Problem Details for HTTP APIs) with security considerations
- Idempotency patterns (Stripe's idempotency keys for POST)
- Rate limiting (standard headers, 429 responses, Retry-After)
- REST conventions, pagination, filtering, input/output separation
- Backward compatibility, red flags, rationalizations, verification checklist
Deep-dive resources:
- resources/api-evolution.md — versioning strategies (Stripe date-pinning, URL, header), Postel's Law, Sunset/Deprecation headers, enum evolution, consumer-driven contract testing (Pact)
- resources/api-security.md — OWASP API Security Top 10 with TypeScript examples, authentication patterns (API keys, OAuth2+PKCE, JWT), security checklist
REFERENCES.md updated with authoritative sources (RFC 9457, RFC 8594, OWASP, Google/Microsoft/Zalando API guides, Brandur Leach, Phil Sturgeon, Arnaud Lauret, Joshua Bloch)

3.15.0

Minor Changes

eb50c18: Reorder TDD cycle to RED-GREEN-MUTATE-KILL MUTANTS-REFACTOR (credit: Eran Boudjnah)

Core change: Mutation testing now comes before refactoring in the TDD cycle, not after. You verify test strength before restructuring code, so you refactor with genuine confidence that your tests catch real bugs.

Rename: The "FIX" step (previously only in the planning skill) is renamed to "KILL MUTANTS" and promoted to a core step everywhere.

The full cycle is now: RED → GREEN → MUTATE → KILL MUTANTS → REFACTOR

Why this order matters: The previous RED-GREEN-REFACTOR-MUTATE ordering meant refactoring code whose test effectiveness was unverified. By mutating first, you validate your safety net before changing structure. This insight was pointed out by Eran Boudjnah on LinkedIn.

tdd skill:
- Core cycle updated to RED-GREEN-MUTATE-KILL MUTANTS-REFACTOR
- MUTATE and KILL MUTANTS added as explicit phases with guidance
- Commit history examples updated to show mutation testing step
- Summary checklist includes mutation testing verification
mutation-testing skill:
- Integration diagram updated to show MUTATE as step 3 of the core cycle (not a separate validation step)
- Added rationale for why MUTATE comes before REFACTOR
refactoring skill:
- Repositioned as the final step of TDD (after mutation testing)
- Workflow updated to include MUTATE and KILL MUTANTS before refactoring
planning skill:
- Extended cycle reordered to CONFIRM-RED-GREEN-MUTATE-KILL MUTANTS-REFACTOR-STOP
- Step template and quick reference updated
plan command:
- Step template reordered to match new cycle
tdd-guardian agent:
- Sacred Cycle updated to 5 steps
- Added MUTATE and KILL MUTANTS phase coaching guidance
- Response patterns updated to include mutation testing before refactoring
refactor-scan agent:
- Description updated: invoked after mutation testing, not after GREEN
progress-guardian agent:
- Workflow reference updated
agents README:
- All cycle references updated
- Workflow diagrams updated
CLAUDE.md:
- Core principle and quick reference updated
README.md:
- All workflow descriptions updated with new cycle and rationale
REFERENCES.md:
- Added Eran Boudjnah credit for the RED-GREEN-MUTATE-KILL MUTANTS-REFACTOR reordering insight

3.14.0

Minor Changes

a593cf0: Mutation testing skill now instructs literal code mutation and test execution, not just analysis

mutation-testing skill:
- Replaced analytical "Generate Mental Mutants" process with literal mutate-run-revert cycles
- AI actually changes production code, runs the test suite, evaluates results, and reverts
- Produces a structured mutation testing report (killed/survived/score)
- Added nuance for surviving mutants: fix critical ones immediately, ask the human when value is unclear
planning skill:
- Added CONFIRM gate: human must approve acceptance criteria before each step begins
- Expanded cycle to RED-GREEN-REFACTOR-MUTATE-FIX with explicit "kill surviving mutants" step
- Human reviews mutation testing report and approves before every commit
- Step template now requires specific, observable acceptance criteria per step
- Clarified test level guidance: prefer unit tests (vitest) for logic, browser tests (vitest browser mode) for UI, Playwright only for end-to-end flows
- Now references tdd skill for workflow alongside testing skill for factory patterns

3.13.0

Minor Changes

753c1bb: Restructure DDD and hexagonal architecture skills with decision frameworks, deep-dive resources, and authoritative references

DDD skill (6 resources) based on Evans, Vernon, Fowler, Stemmler, Wlaschin, Khorikov, Chassaing, Microsoft:
- "Where Does This Code Belong?" decision framework (purity is necessary but not sufficient)
- Model evolution as first-class principle ("Resisting Model Evolution" anti-pattern)
- Domain services with comparison table vs use cases
- Always-valid entities principle
- Make Illegal States Unrepresentable: boolean-to-union + exhaustive switch with never
- Domain Events: Decider pattern, in-process dispatch, outbox pattern, process managers
- Value object equality, Currency type, Zod bridging, reconstitution from persistence
- Glossary supports multiple bounded contexts
- Branded type factories with validation-then-brand pattern
- Specifications (predicate functions) as named building block
- Bounded Contexts: ACL, context mapping, comprehensive discovery methodology (language test, signal strength, workflow mapping)
- Error modeling (result types for business outcomes, exceptions for invariant violations, factory-vs-schema boundary)
- Property-based testing with fast-check
- Optimistic locking with version fields
- Interface vs type rationale for repository ports
- Use case placement resolved to domain/ (no ambiguity)
- Resource loading heuristics ("Load when..." table)
- Resources: aggregate-design.md, domain-services.md, testing-by-layer.md, domain-events.md, bounded-contexts.md, error-modeling.md
Hex arch skill (5 resources) based on Cockburn, Pierrain, Graca, Netflix, Seemann, Valentina Jemuović:
- Driving (left) vs driven (right) adapter distinction with visual diagram
- CQRS-lite (reads bypass repositories, query functions JOIN freely)
- DI via impureim sandwich (Seemann), wrong/right comparison, composition roots
- Event-driven driving adapters (SQS consumer) + event publishing port
- Adapter error handling (domain-specific errors for constraint violations)
- Cross-cutting concerns (auth vs authz, logging, transactions, error formatting)
- Anti-patterns with code examples (5 patterns, all wrong/right)
- Use case naming convention (business language, not pattern suffixes)
- Full stack worked example (one feature through every layer with tests and file map)
- Incremental adoption guide (strangler fig, step-by-step extraction)
- File organization accurately labels use cases as orchestration
- Mutable fakes acknowledged as deliberate testing-only exception
- createTestDb helper (fresh DB per test, no shared state)
- Inline Valentina Jemuović attribution in testing resources
- Resource loading heuristics ("Load when..." table)
- Resources: cqrs-lite.md, testing-hex-arch.md, worked-example.md, cross-cutting-concerns.md, incremental-adoption.md
New: REFERENCES.md — 15+ authoritative sources with clickable URLs and bidirectional traceability. Sources: Evans, Vernon, Fowler, Wlaschin, Chassaing, Khorikov, Greg Young, Udi Dahan, Stemmler, Gorodinski, Microsoft, Cockburn, Pierrain, Graca, Netflix, Seemann, Valentina Jemuović (5 article URLs), Farley, Beck, Bernhardt.

README — dedicated showcase sections for hex arch and DDD with code examples, matching the format of testing, TypeScript, TDD sections.

3.12.1

Patch Changes

4fe1ab1: Add "extract for readability, not testability" rule to testing and refactoring skills
- testing skill: new section "Don't Extract for Testability" with examples showing inline code tested through behavioral tests vs over-extracted unit-tested functions
- refactoring skill: added to "When NOT to Refactor" list, referencing existing DRY rules

3.12.0

Minor Changes

5cf7c5b: Add twelve-factor app skill and audit agent
- New twelve-factor skill with actionable TypeScript patterns for 12-factor compliant services
- New twelve-factor-audit agent for auditing existing codebases against the methodology
- Greenfield projects must follow all factors; brownfield projects adopt incrementally
- Covers config, dependencies, backing services, stateless processes, disposability, logging
- Integrated with /setup command for automatic detection of 12-factor patterns

3.11.2

Patch Changes

ea35242: Enforce TDD in plan documents
- Plan step template now uses RED/GREEN/REFACTOR labels instead of Test/Implementation
- Acceptance criteria must describe observable behaviour, not implementation details
- Plans must read project CLAUDE.md and testing rules before writing steps
- DDD glossary check wording clarified: mandatory when the project uses DDD
- Added constraints: TDD mandatory, test behaviour not implementation, read project testing rules

3.11.1

Patch Changes

7a7099a: fix: strip incompatible frontmatter when copying agents/commands to OpenCode

OpenCode validates agent frontmatter strictly — tools must be an object (not a string), color must be hex (not a named color), and allowed-tools is not a recognised field. The installer now copies files with sed to strip these Claude Code-specific fields instead of symlinking, fixing the "Configuration is invalid" error on startup.

3.11.0

Minor Changes

052edae: feat: add full OpenCode compatibility for commands and agents

OpenCode uses different directory paths for discovering slash commands and agents:
- Commands: ~/.config/opencode/command/ (singular) vs Claude Code's ~/.claude/commands/
- Agents: ~/.config/opencode/agent/ (singular) vs Claude Code's ~/.claude/agents/
The installer now creates symlinks from OpenCode's expected directories to the Claude Code source files when using --with-opencode or --opencode-only, so all 5 slash commands and 9 agents work identically in both tools with zero duplication.

Also updated opencode.json to include agent instructions in the instructions array.

3.10.0

Minor Changes

314bbeb: Replace single PLAN.md with plans/ directory system:
- Plans now live in plans/<feature-name>.md — multiple plans can coexist without conflicts across branches or worktrees
- Remove WIP.md and LEARNINGS.md — simplify to just plan files that get deleted when complete
- Remove PLANS.md index file — the directory itself is the index, avoiding merge conflicts
- Update /plan command, /continue command, planning skill, progress-guardian agent, agents README, and main README for consistency
- Fix /plan command to create regular PR (not draft)

3.9.1

Patch Changes

4178488: Add pre-PR quality gate and small PRs preference:
- Add mutation testing and refactoring assessment as explicit pre-PR steps in recommended flow, agents README, /plan and /pr commands
- Add "Prefer Multiple Small PRs" section to planning skill
- Remove arbitrary numeric thresholds (line counts, minute counts, file counts) from planning skill, expectations skill, refactor-scan agent, and learn agent

3.9.0

Minor Changes

0819e82: Improvements based on Claude Code insights analysis (63 sessions):
- Add output guardrails section to CLAUDE.md (write to files, plan-only mode, incremental output)
- Add ci-debugging skill for systematic CI failure diagnosis
- Add /plan slash command for plan-only workflows
- Add /continue slash command for post-merge workflow (pull, branch, update plan)
- Improve /generate-pr-review to also generate project hooks and /pr command
- Deduplicate typescript-strict and functional skills (~460 lines removed)
- Extract hexagonal-architecture as opt-in skill (not all projects use it)
- Add domain-driven-design as opt-in skill with glossary enforcement
- Add typecheck hook pattern docs and hexagonal-architecture reference to CLAUDE.md
- Add agent decision framework to agents README
- Trim adr agent (~585 → ~250 lines) and ts-enforcer agent (~649 → ~300 lines) via cross-references
- Update front-end-testing and react-testing skills to recommend Vitest Browser Mode
- Add cross-references between testing, mutation-testing, and test-design-reviewer skills
- Add Pick tip to testing skill factory pattern
- Add corrected example to refactoring skill speculative code section
- Update skills list in CLAUDE.md header to include all 15 skills
- Add /setup command for one-shot project onboarding (replaces /init)
- Update README.md with all new skills, commands, and Vitest Browser Mode references
- Update install-claude.sh to include all 15 skills and 5 commands
- Add "Recommended Flow" section to README and agents README showing full command lifecycle with rationale
- Improve skill frontmatter descriptions with trigger phrases and negative triggers per Anthropic best practices
- Add Playwright/Browser Mode test idempotency requirement to front-end-testing and react-testing skills

3.8.0

Minor Changes

04e245e: Add frontend-design skill from anthropics/skills (Apache 2.0 licensed)

3.7.0

Minor Changes

11071d7: Add browser automation section to CLAUDE.md

Documents agent-browser as the preferred tool for web automation, with fallback guidance to use other available tools (WebFetch, curl, MCP browser tools) when agent-browser is not installed.

3.6.0

Minor Changes

3109de0: Add web quality skills from addyosmani/web-quality-skills

The install script now fetches 6 web quality skills directly from Addy Osmani's web-quality-skills repository at install time, ensuring users always get the latest versions:
- accessibility - WCAG compliance, screen reader support, keyboard navigation
- best-practices - Security, modern APIs, code quality patterns
- core-web-vitals - LCP, INP, CLS specific optimizations
- performance - Loading speed, runtime efficiency, resource optimization
- seo - Search engine optimization, crawlability, structured data
- web-quality-audit - Comprehensive Lighthouse-based quality review
Skills are sourced from upstream rather than vendored, so they stay current as the original repository is updated. The upstream MIT License is preserved alongside the installed skills.

Use --no-external flag to skip external community skills during installation.

Attribution: Addy Osmani - web-quality-skills (MIT License).

3.5.0

Minor Changes

565c59b: Add test-design-reviewer skill for evaluating test quality using Dave Farley's principles

New skill that scores tests on 8 properties and provides a comprehensive Farley Score:
- test-design-reviewer skill: Evaluates test quality against Dave Farley's testing best practices
- Scores 8 properties: Understandable, Maintainable, Repeatable, Atomic, Necessary, Granular, Fast, First (TDD)
- Calculates weighted Farley Score (1-10) with detailed breakdown
- Provides actionable recommendations prioritized by impact
- Identifies test brittleness, maintenance issues, and TDD violations
- Runs in forked context using Explore agent for isolated analysis
The skill helps developers write tests that serve as living documentation and reliable safety nets. It provides specific evidence for each score and suggests concrete improvements with code examples.

Why a skill instead of an agent:

Skills are Anthropic's recommended pattern for analysis frameworks that should be auto-discovered by Claude. The test-design-reviewer is loaded on-demand when reviewing tests and can fork to an isolated context for analysis, providing the same capabilities as an agent while following modern Claude Code architecture patterns.

Usage examples:
- Auto-discovered when asking: "Review my authentication tests"
- Manual invocation: /test-design-reviewer path/to/tests
- Contextual analysis: "Are these tests maintainable?"
Attribution: This skill is adapted from Andrea Laforgia's claude-code-agents repository. Special thanks to Andrea for creating and sharing this comprehensive test design review framework.

Documentation updates:
- Moved test-design-reviewer from agents to skills directory
- Updated skill count from 10 to 11 across all documentation
- Updated agent count from 10 to 9 across all documentation
- Converted frontmatter to skill format with context: fork
- Added test-design-reviewer to skills installation in install-claude.sh
- Removed test-design-reviewer from agents installation
- Updated attribution to clarify it's now a skill
- Added to Key Sections table in main README
- Removed agent section #10 from main README
- Removed from agents/README.md
Reference: Based on Dave Farley's Properties of Good Tests: https://www.linkedin.com/pulse/tdd-properties-good-tests-dave-farley-iexge/

3.4.0

Minor Changes

6a4043c: Add mutation testing skill for verifying test effectiveness

New skill that provides systematic guidance for mutation testing analysis:
- Comprehensive mutation operator reference (arithmetic, conditional, logical, boolean, etc.)
- Weak vs strong test pattern examples for each operator
- Systematic branch analysis process (4-step workflow)
- Equivalent mutant identification and handling
- Test strengthening patterns (boundary values, branch coverage, avoiding identity values)
- Integration with TDD workflow (verify after GREEN phase)
- Optional Stryker integration guide
- Quick reference for operators most likely to have surviving mutants
Key insight: Code coverage measures execution; mutation testing measures detection. A test suite with 100% coverage can still miss 40% of potential bugs if tests don't make proper assertions.

3.3.0

Minor Changes

059e176: Add PR reviewer agent with direct GitHub commenting

New features:
- pr-reviewer agent: Comprehensive pull request review for TDD compliance, TypeScript strictness, testing quality, functional patterns, and general code quality
- /generate-pr-review command: Creates project-specific PR review automation combining global rules with project conventions
- Direct PR commenting: Agent posts reviews directly to GitHub PRs using MCP tools
The pr-reviewer agent reviews PRs across five categories:
1. TDD Compliance - Was test-first development followed?
2. Testing Quality - Are tests behavior-focused?
3. TypeScript Strictness - No any, proper types?
4. Functional Patterns - Immutability, pure functions?
5. General Quality - Clean code, security, scope?
Design decisions:
- Manual invocation only: Designed for use during Claude Code sessions rather than automated CI/CD pipelines. This saves significant API costs while still providing comprehensive reviews when needed.
- Direct GitHub integration: Posts reviews as PR comments using GitHub MCP tools (add_issue_comment, pull_request_review_write, add_comment_to_pending_review)
The /generate-pr-review command analyzes multiple sources to create project-specific reviewers:
- AI/LLM config files (.cursorrules, CLAUDE.md, .github/copilot-instructions.md, .aider.conf.yml)
- Architecture Decision Records (docs/adr/*.md)
- Project documentation (CONTRIBUTING.md, DEVELOPMENT.md, CODING_STANDARDS.md)
- Tech stack (package.json, tsconfig.json, eslint configs)
- Existing code patterns and conventions

3.2.0

Minor Changes

d65d843: # Add Front-End Testing Skills for Testing Library Patterns

Context: The existing testing skill covers general testing patterns (factories, public API, coverage theater) and tdd skill covers the RED-GREEN-REFACTOR workflow. However, there was no dedicated guidance for Testing Library-specific patterns and best practices.

This minor release adds two new skills to fill that gap:
1. front-end-testing: Framework-agnostic DOM Testing Library patterns
2. react-testing: React-specific Testing Library patterns
New Skills

1. front-end-testing (~890 lines) - Framework-Agnostic DOM Testing Library

DOM Testing Library patterns for behavior-driven UI testing across all frameworks (React, Vue, Svelte, etc.)

Key Sections

1. Core Philosophy (80 lines)
- Test behavior users see, not implementation details
- False negatives (tests break on refactor) vs false positives (bugs pass)
- Kent C. Dodds principle: "Test how software is used"
- Framework-agnostic examples (vanilla JS/HTML)
2. Query Selection Priority (100 lines) ⭐ MOST CRITICAL
- Accessibility-first query hierarchy (getByRole → getByLabelText → ... → getByTestId)
- Query variants: getBy* (throws), queryBy* (null), findBy* (async)
- Common mistakes with correct alternatives
- Works across all Testing Library implementations
3. User Event Simulation (80 lines)
- userEvent vs fireEvent (why userEvent is superior)
- userEvent.setup() pattern (2025 best practice)
- Common interactions: click, type, keyboard, select
- Framework-agnostic patterns
4. Async Testing Patterns (110 lines)
- findBy queries for async elements
- waitFor utility for complex conditions
- waitForElementToBeRemoved for disappearance
- Common patterns: loading states, API responses, debounced inputs
5. MSW Integration (90 lines)
- Why MSW (network-level interception)
- setupServer pattern for test setup
- Per-test overrides with server.use()
- Works across all frameworks
6. Accessibility-First Testing (70 lines)
- Why accessible queries improve both tests AND app quality
- When to add ARIA (custom components only)
- Semantic HTML priority over ARIA
7. Testing Library Anti-Patterns (200 lines) ⭐ HIGH VALUE 14 common mistakes with ❌ WRONG and ✅ CORRECT examples:
1. Not using screen object
2. Using querySelector
3. Testing implementation details
4. Not using jest-dom matchers
5. Manual cleanup() calls
6. Wrong assertion methods
7. beforeEach render pattern
8. Multiple assertions in waitFor
9. Side effects in waitFor
10. Exact string matching
11. Wrong query variants
12. Wrapping findBy in waitFor
13. Using testId when role available
14. Not installing ESLint plugins
8. Summary Checklist (30 lines) Quick reference for test review with cross-references to tdd, testing, and react-testing skills

2. react-testing (~460 lines) - React-Specific Patterns

React Testing Library patterns for testing React components, hooks, and context

Key Sections

1. Opening Paragraph (10 lines)
- References front-end-testing skill for general DOM patterns
- References tdd skill for RED-GREEN-REFACTOR workflow
- References testing skill for factory patterns
2. Testing React Components (60 lines)
- Components as functions: props → rendered DOM
- Testing props and their effects
- Testing conditional rendering
- Example patterns with ❌/✅ comparisons
3. Testing React Hooks (60 lines)
- renderHook API (built into RTL since v13)
- result.current pattern
- act() for state updates
- rerender() for testing with different props
4. Testing Context (60 lines)
- wrapper option for context providers
- Multiple providers pattern
- Testing components that consume context
- Custom render helpers
5. Testing Forms (60 lines)
- Controlled inputs
- Form submissions
- Form validation
- userEvent integration
6. React-Specific Anti-Patterns (80 lines) ⭐ HIGH VALUE 5 React-specific mistakes:
1. Unnecessary act() wrapping (RTL handles it)
2. Manual cleanup() calls (automatic since RTL 9)
3. beforeEach render pattern (use factories)
4. Testing component internals (state, methods)
5. Shallow rendering (use full render)
7. Advanced React Patterns (90 lines)
- Testing loading states
- Testing error boundaries
- Testing portals
- Testing Suspense
8. Summary Checklist (20 lines) React-specific checks with cross-references to front-end-testing, tdd, and testing skills

Separation of Concerns

front-end-testing (Framework-Agnostic) DOES cover:
- DOM Testing Library query APIs (works with React, Vue, Svelte)
- userEvent vs fireEvent
- Async patterns (findBy, waitFor)
- MSW integration for API mocking
- Accessibility-first querying
- Testing Library anti-patterns (screen, jest-dom)
- Generic UI testing patterns
front-end-testing does NOT cover:
- React-specific APIs (renderHook, wrapper option)
- React component testing patterns
- React hooks testing
- React context testing
- Framework-specific anti-patterns
react-testing (React-Specific) DOES cover:
- React Testing Library specific APIs
- renderHook for custom hooks
- wrapper option for context providers
- Testing React components, hooks, context
- React-specific anti-patterns (act, cleanup)
- React patterns (Suspense, error boundaries, portals)
react-testing does NOT cover:
- Generic DOM Testing Library patterns (delegates to front-end-testing)
- General testing patterns (delegates to testing skill)
- TDD workflow (delegates to tdd skill)
Cross-References

front-end-testing references:
- tdd skill for RED-GREEN-REFACTOR workflow
- testing skill for factory patterns
- react-testing skill for React-specific patterns
react-testing references:
- front-end-testing skill for general DOM Testing Library patterns
- tdd skill for TDD workflow
- testing skill for factory patterns
CLAUDE.md references:
- Both skills in Architecture section skill list
- Testing Principles section references both skills
Files Modified

New Files
- claude/.claude/skills/front-end-testing/SKILL.md (~890 lines)
- claude/.claude/skills/react-testing/SKILL.md (~460 lines)
Updated Files
- README.md: Updated skill count (8 → 9), added both skills to Key Sections table and Quick Navigation table
- install-claude.sh: Added both skills to directory creation, skills array, and install summary
- ~/.claude/CLAUDE.md (user's global): Updated skill list in Architecture section and Testing Principles section
Key Principles from Sources

From fullstack-react-tdd-example (https://github.com/citypaul/fullstack-react-tdd-example)
- "Testing against behavior rather than implementation details provides more value"
- "The purpose of good tests is to give us the confidence to make changes over time"
- User-perspective testing with accessible selectors
- MSW for consistent mocking across tests and development
From Testing Library Philosophy
- "The more your tests resemble the way your software is used, the more confidence they can give you"
- Accessibility queries improve both tests AND app quality
- Query priority: role → label → text → testId
- Framework-agnostic patterns (DOM Testing Library works everywhere)
From Kent C. Dodds' React Testing Library Best Practices
- False negatives (break on refactor) = brittle tests
- False positives (bugs pass) = useless tests
- Test the contract (public API), not the implementation
- userEvent over fireEvent (realistic simulation)
Impact

Before:
- No dedicated Testing Library guidance
- React developers had to piece together patterns from general testing skill
- No query selection priority guidance
- No Testing Library anti-patterns catalog
- MSW integration patterns missing
After:
- ✅ Two comprehensive Testing Library skills (~1350 lines total)
- ✅ Framework-agnostic patterns work across React, Vue, Svelte
- ✅ React-specific patterns separated into dedicated skill
- ✅ Clear query selection hierarchy (accessibility-first)
- ✅ 14 general + 5 React-specific anti-patterns with solutions
- ✅ MSW integration patterns documented
- ✅ userEvent best practices (setup() pattern)
- ✅ renderHook patterns for custom hooks
- ✅ All cross-references between skills maintained
Total skills: 8 → 9 (tdd, testing, front-end-testing, react-testing, typescript-strict, functional, refactoring, expectations, planning)

3.1.0

Minor Changes

36fbd1e: Add OpenCode support for using CLAUDE.md guidelines with opencode.ai
- Added opencode/.config/opencode/opencode.json configuration file that loads CLAUDE.md and skills
- Added --with-opencode and --opencode-only flags to install-claude.sh
- Updated README.md with OpenCode integration documentation
OpenCode (https://opencode.ai) is an open source AI coding agent. This configuration allows OpenCode users to use the same CLAUDE.md guidelines and skills that work with Claude Code.

Patch Changes

5227203: Document how to pass arguments to one-liner installation
- Add bash -s -- pattern for passing flags to curl-piped scripts
- Add --with-opencode to the install options list
- Update OpenCode installation section with one-liner examples

3.0.1

Patch Changes

380cbfa: # Global Skills Restoration: v2.0.0 → v3.0.0 Complete Recovery + Reorganization

Context: The v3.0.0 CLAUDE.md refactor successfully reduced the main file from 4,936 to ~350 lines by moving content to skills, but investigation revealed 1,714 lines (48%) of critical guidance was lost rather than moved.

This patch:
1. Restores all missing content to the global skills system
2. Reorganizes testing/tdd skills to eliminate duplication
3. Removes project-specific content to make skills universally applicable
Restoration Summary

testing skill (+381 lines - Previously Missing!)

The testing skill was completely missing from the dotfiles repo (69 lines → 425 lines)
- ✅ Core Principle: Test behavior, not implementation
- ✅ Test Through Public API Only (with detailed examples)
- ✅ Coverage Through Behavior (how to achieve coverage without testing implementation)
- ✅ Test Factory Pattern (moved from tdd skill - belongs here)
- ✅ Coverage Theater Detection (4 anti-patterns consolidated from both skills)
- ✅ No 1:1 Mapping Between Tests and Implementation
- ✅ All examples now generic (removed scenarist-specific content)
tdd skill Reorganization (581 → 343 lines)

Focused on TDD workflow, references testing skill for "how to write good tests"
- ✅ Opening reference to testing skill
- ✅ Removed Test Factory Pattern (moved to testing)
- ✅ Removed Coverage Theater Detection (consolidated in testing)
- ✅ References testing skill for anti-patterns
- ✅ All examples now generic (removed scenarist-specific content)
Critical Restorations

typescript-strict skill (+653 lines, was 94% missing)
- ✅ Schema Placement Architecture (500 lines) - CRITICAL: Schemas ALWAYS in core, NEVER in adapters
- ✅ Dependency Injection Pattern (100 lines) - CRITICAL: Domain logic must NEVER create port implementations
- ✅ Type vs Interface Rationale (80 lines) - WHY interface for contracts, type for data
- ✅ Strict Mode Configuration, Immutability Patterns, Factory Pattern
tdd skill (+529 lines, was 83% missing)
- ✅ Coverage Verification Protocol (78 lines) - CRITICAL: "NEVER trust coverage claims without verification"
- ✅ Coverage Theater Detection (4 patterns that give fake 100% coverage)
- ✅ TDD Evidence in Commit History (40 lines) - How to document multi-session TDD work
- ✅ 100% Coverage Exception Process (23 lines) - Formal process for requesting exceptions
- ✅ Test Factory Pattern (107 lines) - Factory composition, schema validation, anti-patterns
functional skill (+563 lines, was 76% missing)
- ✅ No Comments / Self-Documenting Code (30 lines) - FIXES ORPHANED ITEM from CLAUDE.md
- ✅ Array Methods Over Loops (40 lines) - FIXES ORPHANED ITEM from CLAUDE.md
- ✅ Options Objects Over Positional Parameters (30 lines) - FIXES ORPHANED ITEM from CLAUDE.md
- ✅ Pure Functions (50 lines)
- ✅ Composition Over Complex Logic (40 lines)
- ✅ Readonly Keyword for Immutability (25 lines)
- ✅ Deep Nesting Limitation (12 lines)
refactoring skill (+62 lines, was 71% missing)
- ✅ Commit Before Refactoring - WHY (15 lines) - Safety net for experimentation
- ✅ Speculative Code is TDD Violation (15 lines) - Delete "just in case" logic
- ✅ When NOT to Refactor (20 lines) - Criteria for deferring refactoring
- ✅ Commit Messages for Refactoring (10 lines)
- ✅ FIXED: Now properly referenced in CLAUDE.md (was completely missing from documentation)
CLAUDE.md Fixes
- ✅ Added pointer to functional skill in Code Style section
- ✅ Added pointer to tdd skill in Development Workflow
- ✅ Added pointer to refactoring skill in Development Workflow (was completely missing)
- ✅ Added pointer to refactoring skill in Working with Claude
- ✅ Added pointer to testing skill in Testing Principles section (fixes orphaned content)
- ✅ Added pointer to typescript-strict skill in TypeScript Guidelines section (fixes orphaned content)
- ✅ All orphaned quick reference items now have proper skill coverage
Final Verification Additions (Agent-Discovered)

After multi-agent verification against v2.0.0, additional critical content was added:

functional skill (additional +70 lines):
- ✅ "Why Immutability Matters" section with 5 key benefits (predictable, debuggable, testable, React-friendly, concurrency-safe)
- ✅ "Functional Light" philosophy section explaining practical FP approach (no category theory/monads)
- ✅ Complete array mutation catalog with pop, shift, unshift, reverse, index assignment alternatives
typescript-strict skill (additional +10 lines):
- ✅ 5 critical compiler flags added:
  - noUncheckedIndexedAccess (arrays return T | undefined)
  - exactOptionalPropertyTypes (precise optional types)
  - noPropertyAccessFromIndexSignature (safer dynamic access)
  - forceConsistentCasingInFileNames (cross-OS safety)
  - allowUnusedLabels: false (catch accidental labels)
Total Impact

Lines Restored: 1,807 lines across 4 skills
- typescript-strict: +653 lines (CRITICAL)
- tdd: +529 lines (CRITICAL)
- functional: +563 lines (fixes 3 orphaned items)
- refactoring: +62 lines (now discoverable)
Issues Fixed:
1. ❌ → ✅ Schema placement guidance prevents adapter duplication (violates hexagonal architecture)
2. ❌ → ✅ Coverage verification protocol prevents fake coverage from passing reviews
3. ❌ → ✅ DI pattern guidance preserves hexagonal architecture
4. ❌ → ✅ CLAUDE.md quick reference items all have proper skill coverage (no more orphaned content)
5. ❌ → ✅ Refactoring skill now discoverable (was hidden)
6. ❌ → ✅ Multi-session TDD work can be properly documented in PRs
7. ❌ → ✅ Formal process for coverage exceptions
8. ❌ → ✅ Type vs Interface has WHY explained, not just WHAT
9. ❌ → ✅ Refactoring assessment framework accessible
10. ❌ → ✅ Functional programming patterns prevent code quality degradation
Documentation Added

Three comprehensive documentation files added to track the analysis and restoration:
- docs/v2-v3-detail-restoration-plan.md - Original testing skill analysis (358 lines)
- docs/comprehensive-v2-v3-restoration-plan.md - Complete skill-by-skill analysis (744 lines)
- docs/v2-v3-critical-findings-summary.md - Executive summary with top 10 critical issues (295 lines)
Verification

All restored content emphasizes:
- ✅ Behavior-driven testing (not implementation testing)
- ✅ WHY explained for every principle (not just WHAT)
- ✅ Real-world examples with WRONG ❌ vs CORRECT ✅ patterns
- ✅ Architectural rationale (hexagonal architecture, DI, schema placement)
- ✅ All CLAUDE.md quick reference items have full skill coverage
Impact on Development

Without these restorations, developers would:
- ❌ Duplicate schemas across adapters (violates hexagonal architecture)
- ❌ Submit fake 100% coverage (no verification protocol)
- ❌ Hardcode implementations (no DI pattern guidance)
- ❌ Follow CLAUDE.md quick ref to dead ends (3 orphaned principles)
- ❌ Miss refactoring methodology (skill existed but not mentioned)
- ❌ Document multi-session TDD incorrectly (no PR pattern)
- ❌ Request coverage exceptions informally (no formal process)
- ❌ Follow rules without understanding WHY
- ❌ Refactor without methodology
- ❌ Write imperative instead of functional code
All issues resolved ✅

3.0.0

Major Changes

fd148e9: feat: skills-based architecture with planning workflow (v3.0)

Skills (7 auto-discovered patterns)
- tdd - RED-GREEN-REFACTOR workflow
- testing - Factory patterns and behavior testing
- typescript-strict - TypeScript strict mode patterns
- functional - Functional programming with immutability
- refactoring - Assessment framework and priorities
- expectations - Working expectations and documentation practices
- planning - NEW Small increments, three-document model, commit approval
Planning Workflow (NEW)

Three-document model for significant work:
- PLAN.md - What we're doing (changes require approval)
- WIP.md - Where we are now (constantly updated)
- LEARNINGS.md - What we discovered (merged at end, then deleted)
Key principles:
- All work in small, known-good increments
- TDD non-negotiable (RED-GREEN-REFACTOR)
- Commit approval required before every commit
- Learnings captured as they occur, merged into CLAUDE.md/ADRs at end
Agents
- Renamed wip-guardian → progress-guardian
- progress-guardian now manages three-document model
Commands (1 slash command)
- /pr - Create pull requests (no test plan needed with TDD)
Context Optimization
- CLAUDE.md reduced to ~100 lines (always loaded)
- No @imports - fully self-contained
- Detailed patterns loaded on-demand via skills
Breaking Changes from v2.0
- Removed docs/ directory entirely
- Content migrated to skills (loaded on-demand instead of always)
- wip-guardian renamed to progress-guardian with enhanced functionality
Migration from v2.0
- Use --version v2.0.0 with install script to keep modular docs
- Skills provide same content but with better context efficiency

2.3.0

Minor Changes

Add Claude Code settings.json with claude-powerline statusline and plugins
- Add claude/.claude/settings.json with personal Claude Code configuration
- Configure claude-powerline for vim-style statusline with usage tracking and git integration
- Enable official Anthropic plugins: feature-dev, frontend-design, hookify, learning-output-style, plugin-dev, security-guidance
- Document settings.json in README with explanation of each plugin and how to use/merge with existing settings

2.2.0

Minor Changes

db4fc5c: Add use-case-data-patterns agent for architectural analysis

Added a new agent that analyzes how user-facing use cases map to underlying data access patterns and architectural implementation in the codebase. This agent helps developers understand existing patterns before implementing new features.

This agent is adapted from Kieran O'Hara's dotfiles. Thank you to Kieran O'Hara for creating and sharing this excellent agent specification.

Key features:
- Creates comprehensive analytical reports mapping use cases to data patterns
- Traces through architecture layers (endpoints, middleware, business logic, data access)
- Identifies database patterns, caching strategies, and external integrations
- Highlights gaps and provides recommendations
- Does NOT edit files - purely analytical

2.1.2

Patch Changes

1e63ef7: Fix YAML frontmatter syntax in all agent files

All custom agent files had malformed YAML in the description field causing parsing errors on GitHub ("mapping values are not allowed in this context").

Fixed:
- Removed embedded examples with 'nn' pseudo-newlines from description fields
- Converted descriptions to YAML folded block scalar (>) format for proper parsing
- All agent files now have valid YAML frontmatter per Claude Code documentation
Agents Updated:
- refactor-scan.md
- tdd-guardian.md
- ts-enforcer.md
- docs-guardian.md
- learn.md
- wip-guardian.md
- adr.md
Per Claude Code official docs, the description field should be a concise natural language description for task matching, not include examples. Examples belong in the system prompt body, not YAML frontmatter.

2.1.1

Patch Changes

87aec6a: fix: add correct front matter to agent files

Updated agent files (adr.md, wip-guardian.md, docs-guardian.md, learn.md, refactor-scan.md, tdd-guardian.md, ts-enforcer.md) to include proper front matter with name, description, tools, model, and color fields required for agent functionality.

2.1.0

Minor Changes

4c773ac: Add wip-guardian and adr agents for workflow management and architectural decisions

New Claude Code Agents:

Added two new specialized agents that integrate with the existing agent system:
1. wip-guardian - Work In Progress Guardian
  - Creates and maintains living WIP.md plan documents for complex, multi-step features
  - Tracks current progress, next steps, and blockers
  - Enforces small PRs, incremental work, tests always passing
  - Orchestrates all other agents at appropriate times (tdd-guardian, ts-enforcer, refactor-scan, adr, learn, docs-guardian)
  - Updates plan as learning occurs
  - Deletes WIP.md when work completes (ephemeral short-term memory)
  - Identifies ADR opportunities during development
  - Prevents context loss during multi-day features
2. adr - Architecture Decision Records
  - Creates ADRs for significant architectural decisions
  - 5-question decision framework for determining when ADRs are needed
  - Documents alternatives considered, trade-offs, and consequences
  - Maintains ADR index in docs/adr/README.md
  - Integrated with wip-guardian and docs-guardian
  - Prevents "why did we do it this way?" confusion
  - Clear guidance on when NOT to create ADRs (trivial choices, temporary workarounds, standard patterns)
Agent System Enhancements:
- Updated .claude/agents/README.md with comprehensive overview of all 7 agents
- Added clear distinctions between agent purposes and lifespans
- Added complete workflow integration showing how agents work together
- Added decision matrix for which agent to use when
- Added documentation type comparison table (wip vs adr vs learn vs docs)
Key Features:
- wip-guardian orchestrates the entire development workflow:
  - Invokes tdd-guardian for RED-GREEN-REFACTOR cycle
  - Invokes ts-enforcer before commits/PRs
  - Invokes refactor-scan after green tests
  - Invokes adr when architectural decisions arise
  - Invokes learn when significant learnings occur
  - Invokes docs-guardian when features complete
- Clear documentation boundaries established:
  - wip-guardian: Temporary progress tracking (deleted when done)
  - adr: Permanent "why" (architectural decisions)
  - learn: Permanent "how" (gotchas, patterns)
  - docs-guardian: Permanent "what" (features, API, setup)
Documentation Updates:
- Updated README.md agent count from 5 to 7 agents
- Added comprehensive sections for both new agents in README
- Updated installation instructions to include new agent download commands
- Updated all agent count references throughout documentation

2.0.4

Patch Changes

546c057: Improve installation documentation order and clarity

Fixed:
- Option 3 now correctly uses v1.0.0 single-file version (v2.0.0 had broken imports)
- Installation options now ordered by recommendation (global install first, not third)
- Navigation table accurately describes each option's purpose
Added:
- Quick navigation table by user situation
- "Best for" and "Why choose this" sections for each option
- Clear tradeoffs for v1.0.0 vs v2.0.0 choice
- Workflow explanation before installation options
- Agent invocation examples integrated into workflow section
Changed:
- Moved version note to end of section (less critical information)
- Removed duplicate sections for cleaner structure
- First navigation row now says "I want this on all my personal projects" instead of misleading "I want this working in 30 seconds"

2.0.3

Patch Changes

c673d57: Fix automated tagging and releases for private packages

The workflow wasn't creating git tags or GitHub releases automatically because pnpm changeset tag only works for packages published to npm. Since this package has "private": true (GitHub releases only, no npm), we need to manually create tags and releases.

This adds a new workflow step that:
- Reads the version from package.json after changesets bumps it
- Creates and pushes a git tag (v2.0.x format)
- Creates a GitHub Release from that tag
Future releases (v2.0.2+) will now be fully automated when the Version Packages PR is merged.

2.0.2

Patch Changes

e5377a7: Fix automated tagging and releases for private packages

The workflow wasn't creating git tags or GitHub releases automatically because pnpm changeset tag only works for packages published to npm. Since this package has "private": true (GitHub releases only, no npm), we need to manually create tags and releases.

This adds a new workflow step that:
- Reads the version from package.json after changesets bumps it
- Creates and pushes a git tag (v2.0.x format)
- Creates a GitHub Release from that tag
Future releases (v2.0.2+) will now be fully automated when the Version Packages PR is merged.

2.0.1

Patch Changes

e5b9d00: Add node_modules to .gitignore

The changesets action accidentally committed node_modules/ directory in PR #21. This adds node_modules/ to .gitignore to prevent this from happening.
004d570: Fix GitHub Actions workflow pnpm version incompatibility

The release workflow failed again after adding pnpm-lock.yaml because the lockfile was generated with pnpm v10 but the workflow used pnpm v8, causing:

WARN Ignoring not compatible lockfile at pnpm-lock.yaml ERR_PNPM_NO_LOCKFILE Cannot install with "frozen-lockfile"

This updates the GitHub Actions workflow to use pnpm v10 to match the lockfile.
039e448: Fix GitHub Actions workflow to correctly run changesets versioning

The workflow was failing with "No commits between main and changeset-release/main" because the version command was configured as pnpm version (which just prints version info) instead of pnpm changeset version (which actually bumps versions).

This also simplifies the workflow to use a single changesets action call that handles both creating the Version Packages PR and creating GitHub releases.

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Unreleased

2.0.0 - 2025-11-01

BREAKING CHANGES

Modular CLAUDE.md Structure

CLAUDE.md has been split from a single 1,818-line file into a modular structure:

Main CLAUDE.md reduced to 156 lines (core philosophy + quick reference)
Detailed content extracted to 6 separate files in docs/ directory
All imports use absolute paths (@~/.claude/docs/...) for dotfiles compatibility

Why this is breaking:

If you manually created symlinks or custom imports, you'll need to update paths
The file structure has changed (though the monolithic version still works if you have it)

Migration: See MIGRATION.md for detailed upgrade instructions.

Added

Modular Documentation Structure:

docs/testing.md (238 lines) - Testing principles and behavior-driven development
docs/typescript.md (305 lines) - TypeScript guidelines and schema-first approach
docs/code-style.md (370 lines) - Functional programming and immutability patterns
docs/workflow.md (671 lines) - TDD process, refactoring, and git workflow
docs/examples.md (118 lines) - Code examples and anti-patterns
docs/working-with-claude.md (74 lines) - Expectations and learning capture

Versioning Infrastructure:

package.json - Version tracking (not an npm package, just for semver)
CHANGELOG.md - This file
MIGRATION.md - Upgrade guide from v1.x to v2.x

Changed

CLAUDE.md: Reduced from 1,818 lines to 156 lines
Import paths: All use absolute paths (@~/.claude/docs/...) instead of relative
README.md: Added separate installation instructions for CLAUDE.md-only vs full dotfiles

Documentation

Updated SPLIT-CLAUDE-MD-PLAN.md with absolute path requirements
Documented why absolute paths are required for dotfiles installation
Added examples showing project vs dotfiles import syntax differences

1.0.0 - 2025-11-01

Added

CLAUDE.md Development Framework (1,818 lines):

📄 View the v1.0.0 monolithic file:

Content:

Core philosophy: TDD is non-negotiable
Testing principles: Behavior-driven testing with 100% coverage
TypeScript guidelines: Strict mode with schema-first approach
Code style: Functional programming with immutability
Development workflow: RED-GREEN-REFACTOR TDD process
Refactoring guidance: Semantic vs structural decision framework
Working with Claude: Expectations and learning documentation

Claude Code Agents:

tdd-guardian - Proactive TDD coaching and reactive compliance verification
ts-enforcer - TypeScript best practices and schema-first enforcement
learn - Proactive learning capture and CLAUDE.md documentation
refactor-scan - Refactoring assessment and semantic analysis

Documentation:

Comprehensive README with agent documentation
Git aliases documentation (30+ aliases)
Personal dotfiles for shell and git configuration

Initial Release

This release represents the state of the repository before the modular split. All functionality was contained in a single CLAUDE.md file.

Version History

2.0.0 - Modular CLAUDE.md structure with absolute imports
1.0.0 - Initial release with monolithic CLAUDE.md

FilesExpand file tree

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

3.41.0

Minor Changes

3.40.0

Minor Changes

3.39.0

Minor Changes

3.38.0

Minor Changes

Patch Changes

3.37.1

Patch Changes

3.37.0

Minor Changes

3.36.1

Patch Changes

3.36.0

Minor Changes

3.35.0

Minor Changes

3.34.0

Minor Changes

3.33.0

Minor Changes

3.32.0

Minor Changes

3.31.0

Minor Changes

3.30.1

Patch Changes

3.30.0

Minor Changes

3.29.1

Patch Changes

3.29.0

Minor Changes

3.28.3

Patch Changes

3.28.2

Patch Changes

3.28.1

Patch Changes

3.28.0

Minor Changes

3.27.0

Minor Changes

3.26.0

Minor Changes

3.25.1

Patch Changes

3.25.0

Minor Changes

3.24.0

Minor Changes

Patch Changes

3.23.0

Minor Changes

3.22.0

Minor Changes

3.21.0

Minor Changes

3.20.0

Minor Changes

3.19.3

Patch Changes

3.19.2

Patch Changes

3.19.1

Patch Changes

3.19.0

Minor Changes

3.18.0

Minor Changes

3.17.1