docs: align goal session specs with /goal

wuman001 · wuman001 · commit 42c07ff337fa · 2026-05-28T11:10:59.000+08:00
diff --git a/openspec/changes/2026-04-27-ralph-session-loop-plugin/README.md b/openspec/changes/2026-04-27-ralph-session-loop-plugin/README.md
@@ -1,9 +1,10 @@
 # 2026-04-27-ralph-session-loop-plugin
 
-Track the phase-1 design for introducing Ralph session-loop commands in AWorld.
+Track the phase-1 design lineage that ultimately landed as the shared `/goal` session loop in AWorld.
 
 Implementation note:
 
-- the user-facing Ralph commands remain `/ralph-loop` and `/cancel-ralph`
-- the shared persisted contract now lives in the built-in `goal-session` plugin
+- the user-facing entrypoint is now `/goal`
+- `/goal "..."` starts a session goal directly
+- `/goal status`, `/goal pause`, and `/goal clear` control the persisted goal state
 - continuation is driven from task lifecycle hooks, while the `stop` hook only blocks accidental exit from an active goal
diff --git a/openspec/changes/2026-04-27-ralph-session-loop-plugin/design.md b/openspec/changes/2026-04-27-ralph-session-loop-plugin/design.md
@@ -40,7 +40,7 @@ The current `RalphRunner` is not the right primary abstraction for this phase be
 - Do not implement fresh-process or fresh-session orchestration in phase 1.
 - Do not execute verification commands inside the stop hook in phase 1.
 - Do not add phase-1 loop-local runtime overrides such as `--model` or `--work-dir`.
-- Do not introduce Claude-specific state files such as `.claude/ralph-loop.local.md`.
+- Do not introduce Claude-specific state files such as `.claude/goal.local.md`.
 - Do not redesign the general plugin framework as part of this change.
 
 ## Decisions
@@ -51,9 +51,10 @@ The first AWorld Ralph capability should be implemented as a normal plugin, not
 
 The plugin should own these entrypoints:
 
-- `commands/ralph-loop.md`
-- `commands/cancel-ralph.md`
-- `hooks/stop_hook.py`
+- `hooks/stop.py`
+- `hooks/task_completed.py`
+- `hooks/task_error.py`
+- `hooks/task_interrupted.py`
 - `hud/status.py`
 - `.aworld-plugin/plugin.json`
 
@@ -83,7 +84,7 @@ Implications:
 
 - the phase-1 plugin must not invoke `RalphRunner` implicitly
 - the plugin must not treat `RalphRunner` as the only way AWorld can expose Ralph semantics
-- plugin `--max-iterations` applies only to session continuation
+- plugin `--max-turns` applies only to session continuation
 - runner `completion_criteria.max_iterations` applies only to inner task execution
 - phase 1 does not define a priority or override relationship between those two limits
 
@@ -97,7 +98,7 @@ Why:
 
 The phase-1 control path should be:
 
-1. `/ralph-loop` initializes loop state.
+1. `/goal "..."` initializes loop state.
 2. The current session executes the task.
 3. The shared `goal-session` task lifecycle hooks update turn state and decide whether unfinished work should continue immediately.
 4. When the operator attempts to exit, the `goal-session` `stop` hook only decides whether exit is safe, paused, or should be denied.
@@ -128,7 +129,7 @@ Recommended minimum state shape:
     "pytest tests/api -q",
     "ruff check ."
   ],
-  "source": "ralph_compat",
+  "source": "goal",
   "started_at": "2026-04-27T10:00:00Z",
   "last_task_status": "initialized",
   "last_final_answer_excerpt": null
@@ -146,18 +147,18 @@ Why:
 - The plugin framework already exposes persisted state APIs.
 - It avoids introducing a second state persistence model just for Ralph.
 
-### Decision: `/ralph-loop` stores structured verify requirements in goal-session state, but hooks do not run them
+### Decision: `/goal` stores structured verify requirements in goal-session state, but hooks do not run them
 
 Phase-1 verification requirements should be declared structurally and then injected into the effective prompt.
 
 Recommended user-facing contract:
 
 ```text
-/ralph-loop "Implement the todo API" \
+/goal "Implement the todo API" \
   --verify "pytest tests/api -q" \
   --verify "ruff check ." \
   --completion-promise "COMPLETE" \
-  --max-iterations 20
+  --max-turns 20
 ```
 
 The plugin should persist these `verification_commands` in goal-session state and normalize the working prompt into a goal contract similar to:
@@ -167,7 +168,7 @@ The plugin should persist these `verification_commands` in goal-session state an
 Objective: Implement the todo API
 Status: active
 Turns: 1/20
-Source: ralph_compat
+Source: goal
 Verification commands:
 1. pytest tests/api -q
 2. ruff check .
@@ -245,12 +246,12 @@ Why:
 
 ### Command Contract
 
-`/ralph-loop`
+`/goal`
 
 - accepts task prompt text
 - accepts repeatable `--verify`
 - accepts optional `--completion-promise`
-- accepts optional `--max-iterations`
+- accepts optional `--max-turns`
 - initializes or replaces the active Ralph session state
 - emits a confirmation message describing the active loop policy
 
@@ -259,10 +260,10 @@ Explicitly deferred from the phase-1 command surface:
 - `--model`
 - `--work-dir`
 
-`/cancel-ralph`
+`/goal clear`
 
-- clears the active Ralph loop state
-- emits a confirmation message describing that the loop has been cancelled
+- clears the active goal loop state
+- emits a confirmation message describing that the loop has been cleared
 
 ### Hook Contract
 
@@ -296,8 +297,8 @@ The HUD provider reads plugin state and renders status lines only.
 Phase-1 validation should cover:
 
 - command registration and manifest loading
-- `/ralph-loop` state initialization
-- `/cancel-ralph` state clearing
+- `/goal` state initialization
+- `/goal clear` state clearing
 - `/goal` status, pause, and clear behavior
 - task-completed continuation behavior
 - exact completion-promise match behavior
@@ -309,11 +310,11 @@ Phase-1 validation should cover:
 Recommended simple acceptance cases for phase 1:
 
 - default unbounded loop:
-  `/ralph-loop "Build a Python course"`
+  `/goal "Build a Python course"`
 - explicit iteration cap:
-  `/ralph-loop "Build a REST API" --max-iterations 5`
+  `/goal "Build a REST API" --max-turns 5`
 - declarative verification:
-  `/ralph-loop "Create a CLI tool" --verify "pytest tests/cli -q" --completion-promise "COMPLETE"`
+  `/goal "Create a CLI tool" --verify "pytest tests/cli -q" --completion-promise "COMPLETE"`
 
 Examples intentionally not adopted as phase-1 acceptance cases:
 
diff --git a/openspec/changes/2026-04-27-ralph-session-loop-plugin/implementation-plan.md b/openspec/changes/2026-04-27-ralph-session-loop-plugin/implementation-plan.md
@@ -1,12 +1,12 @@
-# Ralph Session Loop Plugin Implementation Plan
+# Goal Session Loop Implementation Plan
 
-> Historical note: this implementation plan predates the shared `goal-session` refactor. The shipped phase-1 behavior keeps `/ralph-loop` and `/cancel-ralph`, but uses the built-in `goal-session` plugin as the single persisted contract and loop-control surface.
+> Historical note: this implementation plan predates the final command unification. The shipped phase-1 behavior uses the built-in `goal-session` plugin as the single persisted contract and exposes `/goal` as the only user-facing command surface.
 
 > **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
 
-**Goal:** Build a standalone phase-1 Ralph plugin for the interactive AWorld CLI that loops within the current session using plugin commands, plugin state, and stop hooks.
+**Goal:** Build a standalone phase-1 session-goal loop for the interactive AWorld CLI that runs within the current session using plugin commands, plugin state, and stop hooks.
 
-**Architecture:** Extend plugin commands so a plugin can contribute Python-backed slash commands in addition to markdown prompt commands. Implement a built-in Ralph plugin with a prompt command for `/ralph-loop`, a tool command for `/cancel-ralph`, task-state hooks for final-answer diagnostics, a stop hook for continuation, and a HUD provider for loop status.
+**Architecture:** Extend plugin commands so a plugin can contribute Python-backed slash commands in addition to markdown prompt commands. Implement a built-in goal-session plugin with a slash command that starts a goal via `/goal "..."`, handles exact control actions for `/goal status`, `/goal pause`, and `/goal clear`, and uses task-state hooks plus a HUD provider for loop status.
 
 **Tech Stack:** Python, pytest, AWorld CLI plugin framework, plugin state store, command registry, plugin hooks, HUD providers
 
@@ -27,7 +27,7 @@ Add tests that require:
 ```python
 def test_register_python_backed_plugin_command_from_manifest():
     ...
-    command = CommandRegistry.get("ralph-loop")
+    command = CommandRegistry.get("goal")
     assert command is not None
     assert command.command_type == "prompt"
 
@@ -71,38 +71,37 @@ else:
 Run: `pytest tests/plugins/test_plugin_commands.py -k "python_backed or session_id" -v`
 Expected: PASS
 
-### Task 2: Built-in Ralph Plugin Commands
+### Task 2: Built-in Goal Plugin Commands
 
 **Files:**
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/.aworld-plugin/plugin.json`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/__init__.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/commands/ralph_loop.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/commands/cancel_ralph.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/.aworld-plugin/plugin.json`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/common.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hooks/stop.py`
 - Test: `tests/plugins/test_plugin_commands.py`
 
-- [ ] **Step 1: Write the failing tests for `/ralph-loop` and `/cancel-ralph` state behavior**
+- [ ] **Step 1: Write the failing tests for `/goal "..."` and `/goal clear` state behavior**
 
 Add tests that require:
 
 ```python
-async def test_ralph_loop_command_initializes_session_state(tmp_path):
+async def test_goal_command_initializes_session_state(tmp_path):
     ...
     payload = json.loads(state_path.read_text())
     assert payload["active"] is True
-    assert payload["iteration"] == 1
-    assert payload["verify_commands"] == ["pytest tests/api -q"]
+    assert payload["turn_count"] == 1
+    assert payload["verification_commands"] == ["pytest tests/api -q"]
 
-async def test_cancel_ralph_clears_session_state(tmp_path):
+async def test_goal_clear_clears_session_state(tmp_path):
     ...
     result = await command.execute(context)
-    assert "cancelled" in result.lower()
+    assert "cleared" in result.lower()
     assert handle.read() == {}
 ```
 
 - [ ] **Step 2: Run the focused tests to verify they fail**
 
-Run: `pytest tests/plugins/test_plugin_commands.py -k "ralph_loop_command or cancel_ralph" -v`
-Expected: FAIL because the built-in plugin and command modules do not exist yet.
+Run: `pytest tests/plugins/test_plugin_commands.py -k "goal_command" -v`
+Expected: FAIL because the built-in goal-session command behavior does not exist yet.
 
 - [ ] **Step 3: Implement the built-in plugin commands with minimal argument parsing**
 
@@ -111,13 +110,13 @@ Required behavior:
 ```python
 state = {
     "active": True,
-    "prompt": prompt_text,
-    "iteration": 1,
-    "max_iterations": max_iterations,
+    "status": "active",
+    "objective": prompt_text,
+    "turn_count": 1,
+    "max_turns": max_turns,
     "completion_promise": completion_promise,
-    "verify_commands": verify_commands,
+    "verification_commands": verify_commands,
     "started_at": started_at,
-    "last_stop_reason": None,
     "last_final_answer_excerpt": None,
 }
 ```
@@ -137,17 +136,17 @@ Completion rule:
 
 - [ ] **Step 4: Run the focused tests to verify they pass**
 
-Run: `pytest tests/plugins/test_plugin_commands.py -k "ralph_loop_command or cancel_ralph" -v`
+Run: `pytest tests/plugins/test_plugin_commands.py -k "goal_command" -v`
 Expected: PASS
 
-### Task 3: Ralph Hooks And HUD
+### Task 3: Goal Hooks And HUD
 
 **Files:**
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/hooks/task_completed.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/hooks/task_error.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/hooks/task_interrupted.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/hooks/stop.py`
-- Create: `aworld-cli/src/aworld_cli/builtin_plugins/ralph_session_loop/hud/status.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hooks/task_completed.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hooks/task_error.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hooks/task_interrupted.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hooks/stop.py`
+- Create: `aworld-cli/src/aworld_cli/builtin_plugins/goal_session/hud/status.py`
 - Test: `tests/plugins/test_plugin_hooks.py`
 - Test: `tests/plugins/test_plugin_hud.py`
 
@@ -156,23 +155,23 @@ Expected: PASS
 Add tests that require:
 
 ```python
-async def test_ralph_stop_hook_blocks_and_continues_when_active(...):
+async def test_goal_task_completed_hook_blocks_and_continues_when_active(...):
     assert result.action == "block_and_continue"
-    assert "Task:" in result.follow_up_prompt
+    assert "<goal_contract>" in result.follow_up_prompt
 
-async def test_ralph_stop_hook_allows_exit_on_exact_completion_promise(...):
-    assert result.action == "allow"
+async def test_goal_stop_hook_denies_exit_when_goal_is_active(...):
+    assert result.action == "deny"
 
-async def test_ralph_stop_hook_allows_exit_when_max_iterations_reached(...):
+async def test_goal_task_completed_hook_marks_goal_complete_on_exact_completion_promise(...):
     assert result.action == "allow"
 
-def test_ralph_hud_renders_active_state(...):
-    assert any("Ralph: active" in segment for segment in lines[0].segments + lines[1].segments)
+def test_goal_hud_renders_active_state(...):
+    assert any("Goal: active" in segment for segment in lines[0].segments + lines[1].segments)
 ```
 
 - [ ] **Step 2: Run the focused tests to verify they fail**
 
-Run: `pytest tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py -k "ralph" -v`
+Run: `pytest tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py -k "goal" -v`
 Expected: FAIL because the hook and HUD modules do not exist yet.
 
 - [ ] **Step 3: Implement task-state hooks, stop hook, and HUD provider**
@@ -192,18 +191,18 @@ Stop hook policy:
 if not active:
     return {"action": "allow"}
 if completion promise matched:
-    handle.clear()
+    handle.update({"status": "complete", "active": False})
     return {"action": "allow"}
-if max iterations reached:
-    handle.clear()
+if max turns reached:
+    handle.update({"status": "budget_limited", "active": False})
     return {"action": "allow"}
-handle.update({"iteration": next_iteration, ...})
+handle.update({"turn_count": next_turn, ...})
 return {"action": "block_and_continue", "follow_up_prompt": normalized_prompt}
 ```
 
 - [ ] **Step 4: Run the focused tests to verify they pass**
 
-Run: `pytest tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py -k "ralph" -v`
+Run: `pytest tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py -k "goal" -v`
 Expected: PASS
 
 ### Task 4: Runtime Integration
@@ -217,16 +216,16 @@ Expected: PASS
 Add a test that:
 
 ```python
-1. loads the built-in Ralph plugin
-2. executes `/ralph-loop ...`
+1. loads the built-in goal plugin
+2. executes `/goal ...`
 3. simulates a completed task hook update without promise match
 4. runs the stop hook
 5. asserts `block_and_continue` and iteration increment
 ```
 
 - [ ] **Step 2: Run the focused integration test to verify it fails**
 
-Run: `pytest tests/plugins/test_plugin_end_to_end.py -k "ralph" -v`
+Run: `pytest tests/plugins/test_plugin_end_to_end.py -k "goal" -v`
 Expected: FAIL until the full plugin surface is wired together.
 
 - [ ] **Step 3: Implement any missing glue and keep scope minimal**
@@ -235,7 +234,7 @@ Only fill gaps required to make the built-in plugin work through the existing pl
 
 - [ ] **Step 4: Run the focused integration test to verify it passes**
 
-Run: `pytest tests/plugins/test_plugin_end_to_end.py -k "ralph" -v`
+Run: `pytest tests/plugins/test_plugin_end_to_end.py -k "goal" -v`
 Expected: PASS
 
 ### Task 5: Final Verification
@@ -245,9 +244,9 @@ Expected: PASS
 
 - [ ] **Step 1: Mark completed OpenSpec tasks that were actually implemented**
 
-- [ ] **Step 2: Run the complete Ralph-related verification suite**
+- [ ] **Step 2: Run the complete goal-related verification suite**
 
-Run: `pytest tests/plugins/test_plugin_commands.py tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py tests/plugins/test_plugin_end_to_end.py -k "ralph or python_backed or session_id" -v`
+Run: `pytest tests/plugins/test_plugin_commands.py tests/plugins/test_plugin_hooks.py tests/plugins/test_plugin_hud.py tests/plugins/test_plugin_end_to_end.py -k "goal or python_backed or session_id" -v`
 Expected: PASS
 
 - [ ] **Step 3: Run manifest and runtime regression checks around the touched plugin framework paths**
diff --git a/openspec/changes/2026-04-27-ralph-session-loop-plugin/proposal.md b/openspec/changes/2026-04-27-ralph-session-loop-plugin/proposal.md
@@ -22,7 +22,7 @@ For AWorld, phase 1 should optimize for the smallest clean integration boundary:
 ## What Changes
 
 - Introduce a standalone AWorld goal-session controller plugin that provides the shared in-session loop contract for the interactive CLI.
-- Keep `/ralph-loop` and `/cancel-ralph` as compatibility-facing Ralph commands layered on top of that shared goal-session state.
+- Use `/goal` as the only user-facing command surface for starting and controlling that shared goal-session state.
 - Define the phase-1 plugin shape around:
   - prompt commands
   - task lifecycle hooks that update and continue the active goal
@@ -37,8 +37,8 @@ For AWorld, phase 1 should optimize for the smallest clean integration boundary:
 
 ### New Capabilities
 
-- `ralph-session-loop-plugin`: Adds a standalone plugin-hosted Ralph interaction model for the AWorld interactive CLI.
-- `goal-session-plugin`: Adds the shared persisted goal contract and exit-control surface used by Ralph compatibility commands.
+- `ralph-session-loop-plugin`: Tracks the original design lineage for the interactive session loop that now ships as `/goal`.
+- `goal-session-plugin`: Adds the shared persisted goal contract and exit-control surface used by `/goal`.
 
 ### Modified Capabilities
 
@@ -47,7 +47,7 @@ For AWorld, phase 1 should optimize for the smallest clean integration boundary:
 ## Impact
 
 - Affects plugin manifests and plugin entrypoint usage under the AWorld CLI plugin framework.
-- Affects the interactive CLI experience by adding Ralph-specific slash commands plus a shared goal-status surface (`/goal`) for pause, clear, and status inspection.
+- Affects the interactive CLI experience by adding a single `/goal` slash-command surface for start, pause, clear, and status inspection.
 - Moves continuation control to task lifecycle hooks while leaving stop-hook behavior focused on exit gating.
 - Does not require `aworld/core` changes for phase 1.
 - Does not require `RalphRunner` changes for phase 1.
diff --git a/openspec/changes/2026-04-27-ralph-session-loop-plugin/tasks.md b/openspec/changes/2026-04-27-ralph-session-loop-plugin/tasks.md