Batch Executor

Corpus-scale processing: classify → prioritize → spawn → checkpoint → reconcile.

Unlike task-extractor (for 3-12 inline tasks) or batch-cognition (for idea analysis), this skill EXECUTES at scale with sub-agent parallelism.

When to Use

- Google Drive folder dump (mixed docs, notes, spreadsheets)
ChatGPT conversation export (3K+ prompts)
Apple Notes dump (years of ideas)
Any input > 20 items or > 10K tokens of raw content
File-based input (not inline chat messages — use task-extractor for those)

Architecture

CODEBLOCK0

Phase 1: INGEST

Save ALL raw input to systems/batch-executor/corpus/YYYY-MM-DD-SOURCE.md BEFORE any processing.

For file inputs:

- PDF → extract text via pdf tool
CSV/JSON → parse, one item per row/object
Markdown → split on ## headers or --- separators
ChatGPT export → parse conversations.json, group by chain_id
Google Drive → process each file, flatten into items

Create the manifest:
CODEBLOCK1

Phase 2: CLASSIFY

For each item, assign:

Type	Description	Action
TASK	Has a clear action verb + deliverable	EXECUTE
IDEA

Effort per item:

- TRIVIAL (< 1 min): file rename, note capture, config change
QUICK (1-5 min): web search, small edit, API call
MEDIUM (5-30 min): build a page, write a doc, research topic
HEAVY (30+ min): full app build, deep research, multi-step workflow
BLOCKED: needs human input, credentials, or external dependency

Update manifest with Type + Effort columns.

Phase 3: TRIAGE

Score each TASK and IDEA using quick ICE:

- I (Impact): 1-5 — how much does this move the needle?
C (Cost): 1-5 — how cheap/fast to do? (inverted: 5 = trivial)
E (Exploit): 1-5 — how quickly does this produce value?
Score = I × C × E (max 125)

Sort by score descending. Group by dependency chains.

Create execution plan:
CODEBLOCK2

Phase 4: EXECUTE

Rules:

1. Max 3 sub-agents concurrent. Wait for one to complete before spawning another.
QUICK items: execute inline (no sub-agent overhead for < 5 min tasks).
MEDIUM/HEAVY items: spawn sub-agent with clear task description + acceptance criteria.
Each sub-agent gets: the item content, relevant context from other items, and the target artifact path.
Track in manifest: status → EXECUTING, then ✅ DONE / ❌ FAILED / ⚠️ PARTIAL.

Sub-agent spawn template:
CODEBLOCK3

Checkpoint every 5 completed items:

- Update manifest
Report to user: "[X]/[N] done. [Y] in progress. Top findings so far: [...]"
If user is idle (no response in 30s), continue
Commit progress to git

Phase 5: RECONCILE

After all waves complete (or all sub-agents return):

1. Re-read manifest
For each ❌ FAILED: log reason, decide retry or escalate
For each 🔄 sub-agent still running: check status, kill if stale (> 30 min no progress)
For each ⚠️ PARTIAL: note what's left
Retry failed items once (different approach if possible)

Phase 6: REPORT

Generate final report at systems/batch-executor/reports/YYYY-MM-DD-SOURCE-report.md:

CODEBLOCK4

Append to systems/batch-cognition/value-stack.md (shared with batch-cognition skill).
Log learnings to .learnings/LEARNINGS.md.

Commands

INLINECODE8 — show manifest progress
pause — stop spawning, let running agents finish
resume — continue from where we left off (re-read manifest)
skip [#] — skip item number
retry [#] — retry failed item
block [#] [reason] — mark as blocked
priority [#] — move item to top of queue
done — trigger report even if items remain

Key Rules

1. INGEST FIRST. Raw content hits disk before ANY processing.
Max 3 concurrent sub-agents. More = chaos, dropped results, context confusion.
Checkpoint every 5. Git commit progress. User update.
Never mark ✅ without artifact evidence. File exists, build passes, URL responds.
NOISE is not failure. Skipping noise is correct behavior. Report it transparently.
Corpus items cross-reference. Item #14 may be context for item #27. Pass relevant context to sub-agents.
Resume is first-class. If session dies, resume re-reads manifest and continues from last checkpoint.
ICE scoring is fast. 30 seconds per item max. Don't overthink triage — execute.

Integration with Other Skills

- task-extractor: For inline chat messages (3-12 items). Batch-executor is for file/corpus scale (20+).
batch-cognition: For idea analysis (THINK-heavy). Batch-executor is for execution (PLAY-heavy).
orchestrator: Batch-executor can be invoked BY the orchestrator when it detects a corpus dump.
recorder: After batch-executor completes, route to recorder to update STATUS.md.

类型	描述	操作
任务	有明确的动作动词+可交付成果	执行
创意

batch-executor批量执行器

batch-executor

Batch Executor

When to Use

Architecture

Phase 1: INGEST

Phase 2: CLASSIFY

Phase 3: TRIAGE

Phase 4: EXECUTE

Phase 5: RECONCILE

Phase 6: REPORT

Commands

Key Rules

Integration with Other Skills

批量执行器

使用场景

架构

第一阶段：摄取

语料库清单：[来源] [日期]

项目总数：[N]

原始文件：[路径]

状态：已摄取

第二阶段：分类

第三阶段：优先级排序

执行计划

第一波（并行，无依赖）

第二波（依赖第一波）

跳过（噪音/重复项）

受阻（需要人工）

第四阶段：执行

第五阶段：协调

第六阶段：报告

语料库报告：[来源]

处理日期：[日期]

总计：[N]个项目

结果：[完成] ✅ | [失败] ❌ | [部分完成] ⚠️ | [跳过] ⏭️ | [受阻] 🔒

价值堆栈（按影响力排序的顶级项目）

发现的模式

行动项（立即执行）

搁置项（有价值但暂不处理）

受阻项（需要人工）

统计信息

命令

关键规则

与其他技能的集成

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement