beautistart
diff --git a/‎audit/reports/llm_attention_selection_implementation.md‎
Lines changed: 5 additions & 4 deletions b/‎audit/reports/llm_attention_selection_implementation.md‎
Lines changed: 5 additions & 4 deletions
diff --git a/‎config/zulong_config.yaml‎
Lines changed: 9 additions & 7 deletions b/‎config/zulong_config.yaml‎
Lines changed: 9 additions & 7 deletions
diff --git a/‎docs/LLM_Attention_Selection_Integration_Guide.md‎
Lines changed: 10 additions & 4 deletions b/‎docs/LLM_Attention_Selection_Integration_Guide.md‎
Lines changed: 10 additions & 4 deletions
diff --git a/‎docs/architecture/system-overview.md‎
Lines changed: 9 additions & 10 deletions b/‎docs/architecture/system-overview.md‎
Lines changed: 9 additions & 10 deletions
diff --git a/‎docs/task-system/fc-loop.md‎
Lines changed: 5 additions & 5 deletions b/‎docs/task-system/fc-loop.md‎
Lines changed: 5 additions & 5 deletions
@@ -49,11 +49,12 @@
 | 配置项 | 默认值 | 说明 |
 |--------|--------|------|
 | enabled | True | 功能开关 |
-| pressure_threshold_high | 0.9 | 高压阈值 |
-| pressure_threshold_medium | 0.75 | 中压阈值 |
+| threshold_budget_ratio | 0.5 | 阈值预算=LLM原始上下文窗口的50% |
+| pressure_threshold_high | 1.0 | RED触发线：上下文压力 >100% |
+| pressure_threshold_medium | 0.9 | YELLOW触发线：上下文压力 >90% |
 | cooldown_base_seconds | 30.0 | 基础冷却时间 |
 | fallback_mode | FOCUS | Fallback模式 |
-| decision_timeout_ms | 500 | 决策超时 |
+| decision_timeout_ms | None | 无限等待LLM完成注意力选择 |
 | oscillation_detection_window | 10 | 震荡检测窗口 |
 
 **核心方法**:
@@ -244,7 +245,7 @@ async def _try_llm_mode_selection(self):
 | 指标 | 目标值 | 实现情况 |
 |------|--------|----------|
 | 压力检测延迟 | < 5ms | ✅ 实现，有性能日志监控 |
-| LLM决策超时 | 500ms | ✅ 实现，asyncio.wait_for() |
+| LLM决策等待 | 默认无限等待 | ✅ 实现；配置为正数时才使用 `asyncio.wait_for()` |
 | 冷却时间 | 30秒起 | ✅ 实现，动态调整 |
 | 震荡检测 | 实时 | ✅ 实现，ABA/ABAB模式检测 |
 
 
@@ -1,13 +1,14 @@
 attention_selection:
   cooldown_base_seconds: 30.0
-  decision_timeout_ms: 500
+  decision_timeout_ms: null
   enabled: true
   fallback_mode: FOCUS
   max_switch_history: 50
   min_confidence_threshold: 0.3
   oscillation_detection_window: 10
-  pressure_threshold_high: '0.1'
-  pressure_threshold_medium: '0.1'
+  threshold_budget_ratio: '0.15'
+  pressure_threshold_high: '1.0'
+  pressure_threshold_medium: '0.9'
 audio:
   asr:
     backend: sensevoice
@@ -130,9 +131,10 @@ l2_inference:
   backup_model: deepseek-v4-pro
   circuit_breaker:
     bfs_min_interval: 3
-    context_red_ratio: '0.1'
+    threshold_budget_ratio: '0.15'
+    context_red_ratio: '1.0'
     context_window_size: 131072
-    context_yellow_ratio: '0.1'
+    context_yellow_ratio: '0.9'
     enabled: true
     max_yellow_before_red: 4
     no_progress_red: 8
@@ -229,7 +231,7 @@ llm:
     base_url: http://localhost:11434/v1
     model_id: qwen3.5:4b
   deepseek:
-    api_key: sk-f5fbf20095de478fb711cfd0d573739e
+    api_key: ${DEEPSEEK_API_KEY}
     backend: deepseek
     base_url: https://api.deepseek.com
     model_id: deepseek-v4-pro
@@ -296,7 +298,7 @@ llm:
     schedule_policy: lpm
     tp_size: 1
   siliconflow:
-    api_key: sk-fmylzckwftjrirdovmqygassifruqckzjsqzpbmyjrbqivcy
+    api_key: ${SILICONFLOW_API_KEY}
     backend: siliconflow
     base_url: https://api.siliconflow.cn/v1
     model_id: deepseek-ai/DeepSeek-V4-Flash
 
@@ -323,8 +323,9 @@ attention_selection:
   enabled: true                      # 启用/禁用LLM自主选择
 
   # 压力阈值配置
-  pressure_threshold_high: 0.6       # 高压阈值
-  pressure_threshold_medium: 0.5     # 中压阈值
+  threshold_budget_ratio: 0.5        # 阈值预算=LLM原始上下文窗口的50%
+  pressure_threshold_high: 1.0       # RED触发线：上下文压力 >100%
+  pressure_threshold_medium: 0.9     # YELLOW触发线：上下文压力 >90%
 
   # 冷却时间配置
   cooldown_base_seconds: 30.0        # 基础冷却时间(秒)
@@ -333,7 +334,7 @@ attention_selection:
   fallback_mode: "FOCUS"             # Fallback默认模式
 
   # 性能配置
-  decision_timeout_ms: 500           # LLM决策超时(毫秒)
+  decision_timeout_ms: null          # null表示等待LLM完成注意力选择
 
   # 震荡检测配置
   oscillation_detection_window: 10   # 震荡检测窗口大小
@@ -359,7 +360,12 @@ def test_pressure_detection():
 
 def test_threshold_check():
     """测试阈值判断"""
-    config = AttentionConfig(pressure_threshold_high=0.6)
+    config = AttentionConfig(
+        threshold_budget_ratio=0.5,
+        pressure_threshold_high=1.0,
+        pressure_threshold_medium=0.9,
+        decision_timeout_ms=None,
+    )
     detector = PressureDetector(mock_awm, config)
     metrics = PressureMetrics(current_pressure=1.0, ...)
     result = detector.check_threshold(metrics)
 
@@ -383,16 +383,15 @@ Requirement (深度 0)
 **源文件**: `zulong/l2/attention_window.py`
 **成熟度**: 基本可用
 
-**三模式状态机**:
-- **GLOBAL**: 全局视角，关注大纲和整体结构，深层节点权重递减
-- **FOCUS**: 聚焦特定节点，提高关联上下文权重
-- **SINGLE_CHAIN**: 单链推理，只保留当前执行链路的高权重信息
-
-**模式切换由工具调用驱动** (零 LLM 开销):
-- `recall_memory` / `read_memory_node` → GLOBAL → FOCUS
-- `exec_write_file` / `exec_run_command` → FOCUS → SINGLE_CHAIN
-- `task_view_overview` / `submit_final_answer` → 强制回 GLOBAL
-- `navigate_attention` → deeper / broader / jump 三种导航
+**三模式动态注意力**:
+- **GLOBAL**: 全局视角，关注完整任务结构、跨分支依赖、历史证据与最终复核。
+- **FOCUS**: 局部注意，围绕当前节点/需求缺口/关键证据按需注入上下文，暂时排除无关上下文。
+- **SINGLE_CHAIN**: 单链注意，围绕当前推理链或调试链按需注入必要上下文；当需要跨分支判断时可重新注入其他上下文或上浮回 GLOBAL。
+
+**模式切换原则**:
+- 上下文压力阈值监控是触发 L2 动态注意力切换的主信号。
+- LLM 根据压力值、当前 TaskGraph 节点、未覆盖 TaskSpec、MemoryGraph 证据和工具结果，自主选择 GLOBAL / FOCUS / SINGLE_CHAIN。
+- 显式注意力导航接口可执行 LLM 的注意力选择；普通工具调用不得作为 L2 注意力主规则绑定，只能进入工具账本、压力观测和质量证据。
 
 **Token 预算**: `budget = (context_window - reserved) × 90%`，reserved = 7096 tokens
 
 
@@ -70,10 +70,10 @@ fc_graph.py 路径完全缺失以下能力：
 ### 3.3 AttentionWindow (attention_window.py, 948 行, 生产级 4/5)
 
 - **三种模式**:
-  - GLOBAL: 大纲权重高, 深度递减 (depth0→×1.2, depth4+→×0.3)
-  - FOCUS: 当前节点×3.0, 祖先/依赖×2.0, 兄弟×1.5, 无关×0.5
-  - SINGLE_CHAIN: 当前链×5.0, 祖先×3.0, 依赖×2.5, 无关×0.2
-- **自动状态机**: recall_memory→FOCUS, exec_write_file→SINGLE_CHAIN, task_view_overview→GLOBAL
+  - GLOBAL: 保持全局任务/记忆上下文视角，用于规划、复核、汇总与跨分支判断
+  - FOCUS: 由 LLM 在上下文压力或任务阶段需要时选择焦点节点，按需注入当前节点、祖先、依赖与关键证据，暂排无关上下文
+  - SINGLE_CHAIN: 由 LLM 在单链深度执行/调试时选择当前推理链，优先注入当前链路必要上下文；需要跨分支判断时应能重新注入其他上下文或回到 GLOBAL
+- **触发原则**: 上下文压力阈值监控是触发 L2 动态注意力切换的主信号；LLM 根据当前任务、压力、覆盖缺口和证据状态自主选择 GLOBAL / FOCUS / SINGLE_CHAIN。普通工具调用只能进入工具账本、压力观测和质量证据，不得作为主规则绑定或直接触发模式切换。
 - **权重公式**: `base × time_decay(0.95^age) × mode_mult × memory_boost(1.0~1.5)`
 - **消息分组**: tool_group 确保 assistant+tool 消息原子性淘汰
 - **淘汰处理**: 淘汰摘要持久化到 MemoryGraph + TaskGraph, 提示LLM用recall_memory恢复
@@ -229,7 +229,7 @@ Cline v3.82.0 fork → zulong-ide 插件 → 全面重写通信协议 (XML → W
 |------|------|---------|---------|
 | `circuit_breaker.py` | L61 | `CB_RETAINED_NAMES` 白名单 | CB RED 状态下仍允许调用 |
 | `circuit_breaker.py` | L74 | 终结类工具白名单 | 分类为"终结工具" |
-| `attention_window.py` | L89 | GLOBAL_TRIGGER_TOOLS 集合 | 调用后自动切换到 GLOBAL 模式 |
+| `attention_window.py` | 历史旧实现 | GLOBAL_TRIGGER_TOOLS 集合 | 已废弃：普通工具名不得直接触发 GLOBAL 模式 |
 | `attention_window.py` | L519 | 特殊处理分支 | 调用时清除焦点状态 |
 | `task_graph.py` | L129 | 注释文档 | 模型通过它完成任务 |