工作流引擎：用 YAML 编排多 Agent 协作

当你有 10+ agent 时，不能靠即兴指令来协调。你需要结构——但又不能僵化到无法应对现实。

为什么需要工作流引擎#

多 Agent 团队面临的核心挑战不是”agent 能不能做”，而是”谁先做、谁后做、做完了找谁验收、验收不过怎么办”。

没有工作流引擎时，协调者（PraestoClaw）需要在脑子里维护所有状态。一旦 session 重启或上下文窗口溢出，状态就丢了。

工作流引擎把这些逻辑外化为可版本控制的 YAML 文件。

架构#

1
触发（/实现 xxx）
2
    │
3
    ▼
4
加载 YAML 定义
5
    │
6
    ▼
7
生成执行计划（DAG）
8
    │
9
    ▼
10
逐节点执行
11
    │  ├── 派工给指定 agent
12
    │  ├── 收集产出物
13
    │  ├── 检查门禁条件
14
    │  └── 审查不过则循环
15
    │
16
    ▼
17
PR ready → 人类 review
18
    │
19
    ▼
20
自动复盘

6 种节点类型#

`bash` — 确定性节点#

无 AI 参与，100% 可靠。用于 git 操作、构建、测试等。

1
- id: create-branch
2
  type: bash
3
  command: "git fetch origin main && git checkout -b {{branch}} origin/main"

`agent` — AI 节点#

派给指定 agent 执行，绑定领域专家 skill。

1
- id: product-spec
2
  type: agent
3
  agent: xiaomi          # 奶茶（产品经理）
4
  skill: product-master  # 绑定产品大师知识库
5
  prompt: "撰写产品设计文档"
6
  artifacts:
7
    - docs/product-spec.md

支持输出门禁——agent 输出必须包含指定 token 才算通过：

1
  pass_if_output_contains: "PR_READY"
2
  on_output_missing: "escalate"

`parallel` — 并行节点#

多个子节点同时执行。典型场景：三方审查并行。

1
- id: parallel-review
2
  type: parallel
3
  nodes:
4
    - id: visual-review
5
      agent: kele
6
      skill: visual-design-master
7
    - id: product-review
8
      agent: xiaomi
9
      skill: product-master
10
    - id: qa-review
11
      agent: niunai
12
      skill: qa-master

`gate` — 门禁节点#

必须通过才能继续。用于自动化测试、构建检查。

1
- id: run-tests
2
  type: gate
3
  command: "cd {{repo}} && npm run test"
4
  on_fail: abort  # abort | retry | notify

`loop` — 循环节点#

审查-修复循环的核心。内置迭代上限和自动升级。

1
- id: dev-review-loop
2
  type: loop
3
  until: ALL_UPSTREAM_APPROVED
4
  max_iterations: 10
5
  on_max_iterations: escalate
6
  nodes:
7
    - id: fix-task
8
      type: agent
9
      agent: tangyuan
10
    - id: validate
11
      type: gate
12
      command: "npm run test"
13
    - id: product-revalidate
14
      type: agent
15
      agent: xiaomi
16
    - id: visual-revalidate
17
      type: agent
18
      agent: kele
19
    - id: arch-revalidate
20
      type: agent
21
      agent: yuni
22
    - id: merge-review-issues
23
      type: agent
24
      agent: yuni

关键设计：默认最大 10 轮。达到上限仍未收口时，engine 自动把 run 标记为 escalated，暂停执行，等待人类决策。

`approve` — 人工审批节点#

工作流暂停，等待人类操作。

1
- id: human-review
2
  type: approve
3
  prompt: "PR 已创建，等待 review"
4
  notify: "chat:oc_2e18504f35810ae7949c149098cd4364"

10 种内置工作流#

命令	流水线	典型参与者
`/实现`	PRD → 线框图 → 视觉设计 → 架构设计 → 开发 → 三方审查	全员
`/测试`	测试计划 → 证据采集 → 执行 → 修复 → 审查	牛奶 + 年糕 + 工程师
`/修复`	拆任务 → 修复 → 截图 → 内审 → 三方验收	芋泥 + 工程师 + 年糕
`/视觉审查`	证据采集 → 视觉审查 → 交叉 review → 修复 → 验收	可乐 + 奶茶 + 年糕
`/产品审查`	证据采集 → 产品审查 → 交叉 review → 修复 → 验收	奶茶 + 可乐 + 年糕
`/功能审查`	证据采集 → 功能审查 → 交叉 review → 修复 → 验收	牛奶 + 奶茶 + 年糕
`/架构审查`	架构审查 → 拆任务 → 修复 → 验收	芋泥
`/隐私审查`	证据采集 → 隐私审查 → 修复 → 验收	奶茶 + 毛球
`/安全审查`	证据采集 → 安全审查 → 修复 → 验收	毛球
`/全审查`	证据采集 → 三路并行审查 → 合并去重 → 修复 → 验收	全部审查者

完整示例：`/修复` 工作流#

1
name: fix
2
trigger: "/修复"
3
description: "Bug 修复：拆任务 → 修复 → 截图 → 内审 → 三方验收 → PR"
4
branch_pattern: "fix/{{name}}"
5

6
retry:
7
  max: 2
8
  on_timeout: "model: gpt-5.4"
9

10
nodes:
11
  - id: create-branch
12
    type: bash
13
    command: "git fetch origin main && git checkout -b fix/{{name}} origin/main"
14

15
  - id: task-breakdown
16
    type: agent
17
    depends_on: [create-branch]
18
    agent: yuni
19
    skill: architecture-master
20
    prompt: "分析问题「{{name}}」，拆解修复任务。"
21

22
  - id: dev-review-loop
23
    type: loop
24
    depends_on: [task-breakdown]
25
    until: ALL_UPSTREAM_APPROVED
26
    max_iterations: 10
27
    nodes:
28
      - id: fix-task
29
        type: agent
30
        agent: tangyuan
31
        skill: fullstack-master
32
        prompt: "修复任务。完成后 git commit。"
33
      - id: validate
34
        type: gate
35
        command: "cd {{repo}} && npm run test"
36
        on_fail: retry
37
      - id: post-fix-screenshot
38
        type: agent
39
        agent: niangao
40
        timeout: 1800
41
        prompt: "采集修复后证据包。"
42
      - id: internal-review
43
        type: agent
44
        agent: yuni
45
        skill: architecture-master
46
        prompt: "全量内审所有改动。"
47
      - id: product-revalidate
48
        type: agent
49
        agent: xiaomi
50
        skill: product-master
51
        prompt: "验收修复结果。通过输出 APPROVED，否则按统一 schema 输出问题。"
52
      - id: visual-revalidate
53
        type: agent
54
        agent: kele
55
        skill: visual-design-master
56
        prompt: "验收修复结果。通过输出 APPROVED，否则按统一 schema 输出问题。"
57
      - id: arch-revalidate
58
        type: agent
59
        agent: yuni
60
        skill: architecture-master
61
        prompt: "验收修复结果。通过输出 APPROVED，否则按统一 schema 输出问题。"
62
      - id: merge-review-issues
63
        type: agent
64
        agent: yuni
65
        prompt: "汇总三方 review，去重合并，统一优先级。三方都通过则输出 ALL_UPSTREAM_APPROVED。"
66

67
  - id: pr-ready-checklist
68
    type: agent
69
    depends_on: [dev-review-loop]
70
    agent: yuni
71
    pass_if_output_contains: "PR_READY"
72
    on_output_missing: "escalate"
73

74
  - id: push-and-pr
75
    type: bash
76
    depends_on: [pr-ready-checklist]
77
    command: |
78
      git push -u origin fix/{{name}}
79
      gh pr create --title "fix: {{name}}" --reviewer <reviewer>
80

81
  - id: retro
82
    type: agent
83
    depends_on: [push-and-pr]
84
    agent: main
85
    skill: workflow-retro
86
    prompt: "执行工作流复盘。"

统一问题输出 Schema#

所有 review 节点的问题输出遵循同一格式：

1
issue_id: ISSUE-001
2
page_or_module: profile/edit
3
severity: blocker        # blocker | high | medium | low
4
tags: [product, state]
5
suggested_fix: "保存失败时需要展示错误提示"
6
blocker: true
7
owner_role: tangyuan     # 最适合接手的角色
8
source_reviewers: [xiaomi, kele, yuni]

这使得 merge-review-issues 节点可以自动去重、合并、统一优先级，而不是让工程师面对三份格式不同的问题清单。

升级与恢复#

当 loop 达到上限被自动升级后，支持四种恢复动作：

动作	效果
`resume`	继续下一轮迭代
`abort`	终止工作流
`force-pass`	人工认可通过，继续后续节点
`reset-iteration`	清零迭代计数，重新开始

设计取舍#

为什么用 YAML 而不是代码？

Git-diffable，人类可读
工作流定义和执行逻辑分离
新增工作流类型只需要写一个文件

为什么循环要有上限？ 真实的审查很少一轮就过。但无限循环浪费资源。10 轮上限是在”彻底”和”务实”之间的平衡。如果 10 轮解决不了，说明需要人类来看。

为什么需要自动升级？ Agent 团队会陷入审查循环——每一轮修复引入新问题。自动升级机制防止这种情况无限持续。

为什么需要工作流引擎#

架构#

6 种节点类型#

bash — 确定性节点#

agent — AI 节点#

parallel — 并行节点#

gate — 门禁节点#

loop — 循环节点#

approve — 人工审批节点#