功能描述

Control Varie Workstation sessions (Claude Code multi-session orchestration). Use when: (1) user wants to work on / start / resume a coding project, (2) chec...

使用说明 (SKILL.md)

Workstation Control

Name: Coding Agent Orchestrator
Author: masqueradeljb

Control Varie Workstation coding sessions via wctl.

Step 0: Check Pending Prompts (ALWAYS DO THIS FIRST)

Before ANY routing or session work, check if a session is waiting for user input:

cat ~/.openclaw/workspace/pending-prompts.json 2>/dev/null || echo '{"prompts":[]}'

If prompts array is non-empty AND the user's message looks like a response (a number, "approve", "yes", "no", "reject", short answer, or references a project in the pending list): → This is a reply to a pending prompt. Go directly to "Responding to Session Prompts" section below.

If prompts array is empty OR user's message is clearly a new request (mentions a different project, asks to start/create something, etc.): → Continue to Smart Routing below.

Smart Routing (Main Workflow)

When the user mentions working on a project (e.g., "work on my-api", "resume frontend work", "start auth refactor"), follow this decision tree silently — do NOT ask the user unless you hit an ambiguous case:

Step 1: Check daemon + list sessions

wctl list

(If daemon not running, tell user to start the Workstation app.)

Step 2: Match project

Look at the repo field in each worker. Match the user's project mention against repo names (fuzzy — "frontend" matches "my-frontend-app", "api" matches "backend-api-service").

If session exists and task context aligns (user's request fits the current taskId/workContext): → wctl dispatch \x3Csession-id> "\x3Cuser's message>"

If session exists but task context doesn't align (user wants to work on something different in the same repo): → Ask: "There's already a session for {repo} working on {taskId}. Should I send this to that session, or create a fresh one?"

If no session exists for the project: → Go to Step 3.

If multiple repos match (e.g., "api" could be frontend-api or backend-api): → Ask which one.

Step 3: Auto-create session (no matching session found)

wctl discover

Find the project path from the discovered list, then:

wctl create \x3Crepo> \x3Cpath> \x3Ctask-id>

Derive task-id from the user's message (e.g., "work on auth refactor" → task-id: auth-refactor). Keep it short, lowercase, hyphenated.

After creation, confirm: "Started new session for {repo} ({task-id})."

If project not found in discover results, ask the user for the repo path.

Commands Reference

Command	Use
`wctl status --human`	Check daemon alive
`wctl list`	List sessions (JSON, for parsing)
`wctl list --human`	List sessions (readable, for user)
`wctl dispatch \x3Cid> "\x3Cmsg>"`	Send message to existing session
`wctl dispatch-answers \x3Cid> \x3Ca1> \x3Ca2>...`	Send multi-question answers. Use `next:N` for multi-select
`wctl create \x3Crepo> \x3Cpath> [task]`	Create new session
`wctl escape \x3Cid>`	Send Escape key (cancel prompt/menu)
`wctl interrupt \x3Cid>`	Send Ctrl+C (stop running process)
`wctl enter \x3Cid>`	Send Enter key (confirm/dismiss)
`wctl screenshot \x3Cid>`	Screenshot a session (focus + capture)
`wctl screenshot --screen`	Screenshot main display
`wctl set-remote-mode on\|off`	Enable/disable remote mode (bridge auto-focus for screenshots)
`wctl discover`	Scan for project repos

Session Control (Escape / Interrupt)

When the user wants to stop, cancel, or interrupt a session:

User says	Command
"stop session X", "cancel", "kill it", "abort"	`wctl interrupt \x3Cid>` (sends Ctrl+C)
"escape", "go back", "cancel prompt", "dismiss"	`wctl escape \x3Cid>` (sends Escape key)
"press enter", "confirm", "continue", "submit"	`wctl enter \x3Cid>` (sends Enter key)

Strategy: If unsure, try escape first (safe — cancels UI prompts). If still stuck, use interrupt (harder — sends SIGINT).

Screenshots

To show the user what a session looks like:

# 1. Capture the session
wctl screenshot \x3Csession-id>
# Returns: { "status": "ok", "imagePath": "/path/to/screenshot.png" }

# 2. Send to user using the built-in message tool

To deliver the screenshot, use your built-in message tool (not bash) with action: "send" and mediaUrl pointing to the captured image path. The message tool is session-bound — it automatically targets the channel and user you're currently chatting with. No need to specify channel or target manually.

If the message tool is unavailable, fall back to the CLI:

openclaw message send --media \x3CimagePath> --channel \x3Cchannel> --target \x3Ctarget>

Replace \x3Cchannel> and \x3Ctarget> with the values from the current conversation (e.g., telegram + the user's chat ID, or whatsapp + their phone number).

For full screen (e.g., to see Chrome, other apps): wctl screenshot --screen

When to use: User says "show me", "screenshot", "what does it look like", "what's happening in session X".

Always send the image via openclaw message send --media after capturing — wctl only saves the file locally.

Critical Rules

dispatch for existing sessions — always. It types directly into the terminal. Never use wctl route (it may restart Claude and disrupt work).
Never prepend claude to messages — just pass the user's message as-is to dispatch.
Add --human when showing output to user — JSON otherwise for your own parsing.
Ask when unsure — if you can't confidently match the user's message to exactly one session/project, ask to confirm. Wrong dispatches disrupt real coding work. Autonomy is good, but correctness matters more.
Never guess or hallucinate — don't invent project names, session IDs, or options. Always check wctl list and pending-prompts.json for ground truth.
Use "Chat about this" as fallback — if you can't confidently map the user's answer to option numbers for a multi-question prompt, use --chat-arrows 20 to select "Chat about this" and then dispatch their message as text. A stuck question modal is worse than falling back to chat.

Responding to Session Prompts

When Step 0 finds pending prompts and the user's message is a response:

Step 1: Identify the target session

The pending prompt has a project field. Use it to find the session:

wctl list

Find the session whose repo matches the pending prompt's project. Use its sessionId.

If multiple prompts are pending, match the user's message to the most relevant one (by project name mention or most recent).

Step 2: Map intent to response

Plan approval (4 options):

User says	Dispatch
"1", "clear context", "bypass all"	`wctl dispatch \x3Cid> "1"`
"2", "bypass permissions", "yes bypass"	`wctl dispatch \x3Cid> "2"`
"3", "approve", "yes", "go ahead", "lgtm", "manually approve"	`wctl dispatch \x3Cid> "3"`
"reject", "no", feedback like "change X to Y"	Two steps: `wctl dispatch \x3Cid> "4"` then wait 2s then `wctl dispatch \x3Cid> "\x3Ctheir feedback>"`

Default to option 3 ("yes, manually approve edits") when user says generic approval like "yes", "approve", "go ahead".

Important for option 4 (feedback/reject): This is a two-step process. First dispatch "4" to select the text input option, wait 2 seconds for the text prompt to appear, then dispatch the feedback text. Example:

wctl dispatch abc123 "4"
sleep 2
wctl dispatch abc123 "don't modify the database schema"

Question — ALWAYS dispatch the OPTION NUMBER, never text:

Look up the user's answer in the pending prompt's questions array and find the matching option number. Example: if options are ["1. Night", "2. Day", "3. Morning"] and user says "night", dispatch "1" (not "night").

User says	Action
A number ("1", "2")	Dispatch that number directly
A word matching an option label ("night", "dog")	Find the option number and dispatch the NUMBER
Free text not matching any option	Dispatch the text (for "Other" option)

Single question: Use regular dispatch: wctl dispatch \x3Cid> "2"

Multiple questions: Use dispatch-answers — it sends each answer without Enter (Claude auto-advances on single-select), then sends Enter at the end to submit. Map EACH answer to its option NUMBER, then pass them all in one command:

wctl dispatch-answers \x3Cid> 2 1 3

This sends: "2" → wait → "1" → wait → "3" → wait → Enter (submit). No chaining or sleep needed — timing is handled internally.

Multi-select questions (checkboxes — check the multiSelect field in pending-prompts.json): Typing a number toggles it on/off but does NOT advance (cursor stays at position 1). After selecting all options, use next:N to arrow-down N times to the "Next"/"Submit" button and press Enter. N = the number of options for that question (including "Other"), from questions[i].options.length.

wctl dispatch-answers \x3Cid> 1 2 next:5 2

This sends: "1" (toggle) → "2" (toggle) → arrow-down×5 to "Next" → Enter → "2" (next question, single-select) → Enter (submit all).

Example with 4 questions (multi/5opts, single, single, multi/5opts):

wctl dispatch-answers \x3Cid> 1 4 next:5 2 1 1 3 next:5

Each next:N is self-contained — N is always questions[i].options.length for that specific multi-select question.

How to tell if a question is multi-select: The pending prompt's questions array has a multiSelect field per question. If multiSelect: true, you MUST add next:N after their selections. If multiSelect: false (or missing), it's single-select and auto-advances — no next needed.

If the last question is multi-select, use next:N as the last token — it will click "Submit" instead of "Next" (same button position). The final Enter to confirm all answers is sent automatically after all tokens.

"Chat about this" — at the very bottom of the question modal (below all options and Next/Submit), there's a "Chat about this" option. Arrow keys do NOT wrap/circulate, so you can safely overshoot. Use --chat-arrows N to select it. Calculate N based on the first question only:

First question is multi-select with K options (incl. Other): --chat-arrows K+1 (extra arrow for Next button)
First question is single-select with K options (incl. Other): --chat-arrows K

# Example: first question is multi-select with 5 options → 6 arrows
wctl dispatch-answers \x3Cid> --chat-arrows 6
# Example: first question is single-select with 3 options → 3 arrows
wctl dispatch-answers \x3Cid> --chat-arrows 3

When using --chat-arrows, no answer tokens are needed — it replaces the entire answer flow.

FALLBACK RULE: If you are unsure how to map the user's answers to option numbers, or the user's message is vague/unclear, always use --chat-arrows instead of guessing. This lets the user follow up with a simple text prompt rather than getting stuck on a broken selection. Since arrows don't wrap, you can safely use --chat-arrows 20 if unsure about the exact count — it will land on "Chat about this" regardless.

After selecting "Chat about this", immediately dispatch the user's message as a follow-up:

wctl dispatch-answers \x3Cid> --chat-arrows 20
sleep 3
wctl dispatch \x3Cid> "\x3Cuser's original message>"

Step 3: Confirm

After dispatching, tell the user: "Sent response to {project}."

Errors

daemon not running → tell user to start Workstation app
session not found → wctl list to show valid IDs
project not in discover → ask user for repo path
timeout → session busy, retry shortly

Quick Start

Install

Install the Varie Workstation Electron app (macOS arm64).

Install wctl (the CLI that bridges OpenClaw to Workstation):

# wctl ships with Workstation — symlink it to your PATH:
ln -sf /path/to/varie-workstation/openclaw/wctl.js ~/.local/bin/wctl
chmod +x ~/.local/bin/wctl

Copy this skill to your OpenClaw workspace:

cp -r workstation ~/.openclaw/workspace/skills/workstation

Configure

Launch the Workstation app and verify it's running: wctl status
Enable remote mode for mobile screenshot support: wctl set-remote-mode on
The OpenClaw-Workstation bridge (bundled in the app) writes pending prompts to ~/.openclaw/workspace/pending-prompts.json — this enables bidirectional question/approval flows from your phone.

Verify

wctl status --human    # Should show "Workstation is running"
wctl list --human      # Should list active sessions (if any)

Prerequisites

This skill requires the Varie Workstation app — an Electron-based multi-session Claude Code orchestration environment. The skill is the mobile control layer: it lets you manage Workstation sessions from Telegram, WhatsApp, or any OpenClaw channel.

Dependency	What it does	Required?
Varie Workstation	Electron app hosting Claude Code terminals	Yes
`wctl` CLI	Bridges OpenClaw commands to Workstation's Unix socket	Yes (ships with Workstation)
OpenClaw-Workstation bridge	Forwards session events (questions, approvals) to OpenClaw for mobile notifications	Yes (bundled in Workstation)

Without Workstation running, the skill will report "daemon not running" for all commands.

Security & Guardrails

Permissions

wctl communicates with Workstation via a local Unix socket (/tmp/varie-workstation.sock). No network calls — all traffic is local.
Screenshot capture requires macOS Screen Recording permission for the Workstation app.

Declared File Access

~/.openclaw/workspace/pending-prompts.json (read-only) — This file is read on every invocation (Step 0) to check if any Claude Code session is waiting for user input. It is written by the OpenClaw-Workstation bridge, not by this skill. Contents: question text, option labels, and project identifiers from active sessions. No credentials, secrets, or user data. The file may not exist until the bridge creates it — the skill handles this gracefully with a fallback empty response.

Screenshots

Session screenshots (wctl screenshot \x3Cid>) capture only the specific Workstation terminal window for the targeted session.
Full-screen screenshots (wctl screenshot --screen) capture the entire display, which may include unrelated windows and sensitive content. This command is only executed when the user explicitly requests a full-screen capture (e.g., "screenshot my screen", "show me everything").
Screenshots are saved locally to ~/.openclaw/media/ with a 30-minute TTL cleanup.
Screenshots are sent only to the user's own messaging channel (Telegram/WhatsApp) — never to third parties or external services.

Confirmations Before Risky Actions

The skill asks for confirmation before creating new sessions or when multiple repos match ambiguously.
wctl interrupt (Ctrl+C) is reserved for explicit user requests — the skill never sends it autonomously.

Data Handling

openclaw message send routes media through your configured OpenClaw channel (Telegram/WhatsApp). Images traverse the channel provider's servers but are only sent to the requesting user's conversation.

Input Validation

The skill maps user intent to option numbers before dispatching — free text is never injected into PTY commands without validation.
The "Chat about this" fallback is used whenever intent mapping is uncertain, preventing wrong selections.

External Endpoints

Endpoint	Protocol	Data Sent
`/tmp/varie-workstation.sock`	Unix socket (local)	Session commands (list, dispatch, create, screenshot)
`~/.openclaw/workspace/pending-prompts.json`	Local file read	None (read-only)
`openclaw message send --channel --target`	OpenClaw channel (Telegram/WhatsApp)	Screenshot images (when user requests)

No external APIs are called directly by this skill. All network communication goes through OpenClaw's channel layer.

Trust Statement

This skill controls local Claude Code sessions running inside the Varie Workstation app. All communication is via local Unix socket — no data leaves your machine unless you request a screenshot, which is sent through your configured OpenClaw messaging channel. Only install if you trust the Varie Workstation app and your OpenClaw channel configuration.

Publisher

@masqueradeljb

Links

Varie Workstation — The Electron app this skill controls
OpenClaw — The AI agent gateway this skill runs on

安全使用建议

This skill appears to be a legitimate controller for Varie Workstation via the wctl CLI, but pay attention to what it will read and send: SKILL.md tells the agent to read ~/.openclaw/workspace/pending-prompts.json, scan/discover local repos, capture screenshots, and send image files through the messaging tool or CLI. Before installing, confirm you trust the wctl binary and the skill's source (check the GitHub repo). Ask the publisher to (1) declare the config path(s) the skill reads, (2) confirm exactly what will be transmitted (screenshots may include secrets), and (3) document any messaging fallbacks that use channel/target values. If you want to limit risk, run wctl and this skill in an isolated account/container, or require a manual approval step before dispatching actions or sending screenshots. Because metadata doesn't list the file access the instructions perform, treat this as suspicious until the author clarifies the intended file accesses and privacy implications.

功能分析

Type: OpenClaw Skill Name: coding-agent-orchestrator Version: 1.0.1 The skill provides remote orchestration for 'Varie Workstation' coding sessions using the `wctl` CLI, which involves high-risk capabilities such as terminal command dispatch (`wctl dispatch`), session creation, and full-screen captures (`wctl screenshot --screen`). While these functions are aligned with the stated purpose, the instructions in `SKILL.md` encourage passing raw user input directly into shell commands, creating a significant risk for shell injection. Additionally, the ability to capture the entire display and read local workspace files (`pending-prompts.json`) constitutes a broad attack surface, although no explicit evidence of malicious intent or data exfiltration was found.

能力评估

ℹ Purpose & Capability

Name/description and required binary (wctl) align with a Workstation session controller. However, the SKILL.md expects access to ~/.openclaw/workspace/pending-prompts.json and to local screenshot files, yet the registry metadata declares no required config paths. That mismatch (instructions needing local OpenClaw workspace files but metadata not declaring them) is an incoherence.

⚠ Instruction Scope

The runtime instructions tell the agent to read a local file (~/.openclaw/workspace/pending-prompts.json), run wctl commands that can control sessions (create/dispatch/interrupt/escape) and capture screenshots, and then transmit captured images via the messaging tool or CLI fallback. Those actions legitimately belong to a workstation orchestrator, but they involve reading local state and capturing/sending potentially sensitive screen contents — and the skill's metadata does not advertise or restrict that access.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no code files. No additional packages are downloaded or installed by the skill itself, which minimizes install-time risk.

⚠ Credentials

The skill requests no environment variables or credentials (appropriate), but the instructions access local files and artifacts (pending-prompts.json, discovered repo paths, screenshot files) without declaring required config paths. The absence of declared config paths or explicit permission statements is disproportionate to the metadata and makes the access less transparent.

✓ Persistence & Privilege

always=false and the skill is user-invocable; it does not request forced persistent inclusion. Autonomous invocation (model invocation enabled) is normal and not flagged by itself.

版本历史

v1.0.1

Improved security documentation: declared pending-prompts.json file access, clarified screenshot scope (session-only by default, full-screen only on explicit request), noted screenshots are sent only to the requesting user's channel.

v1.0.0

Initial release – enables orchestration and control of Varie Workstation coding sessions via CLI. - Supports starting, resuming, and dispatching messages to coding sessions. - Handles user replies to session prompts, including plan approvals and feedback. - Implements smart routing to match user requests with existing sessions or create new ones. - Allows listing, status checks, and control commands (interrupt, escape, enter) for active sessions. - Provides functionality to capture and deliver session or screen screenshots to the user. - Enforces strict rules for command safety, correct dispatching, and user confirmation on ambiguities.

元数据

Slug coding-agent-orchestrator

版本 1.0.1

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 2

常见问题

Coding Agent Orchestrator 是什么？

Control Varie Workstation sessions (Claude Code multi-session orchestration). Use when: (1) user wants to work on / start / resume a coding project, (2) chec... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 263 次。

如何安装 Coding Agent Orchestrator？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install coding-agent-orchestrator」即可一键安装，无需额外配置。

Coding Agent Orchestrator 是免费的吗？

是的，Coding Agent Orchestrator 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Coding Agent Orchestrator 支持哪些平台？

Coding Agent Orchestrator 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Coding Agent Orchestrator？

由 masqueradeljb（@masqueradeljb）开发并维护，当前版本 v1.0.1。

Coding Agent Orchestrator

Workstation Control

Step 0: Check Pending Prompts (ALWAYS DO THIS FIRST)

Smart Routing (Main Workflow)

Step 1: Check daemon + list sessions

Step 2: Match project

Step 3: Auto-create session (no matching session found)

Commands Reference

Session Control (Escape / Interrupt)

Screenshots

Critical Rules

Responding to Session Prompts

Step 1: Identify the target session

Step 2: Map intent to response

Step 3: Confirm

Errors

Quick Start

Install

Configure

Verify

Prerequisites

Security & Guardrails

Permissions

Declared File Access

Screenshots

Confirmations Before Risky Actions

Data Handling

Input Validation

External Endpoints

Trust Statement

Publisher

Links

Coding Agent Orchestrator 是什么？

如何安装 Coding Agent Orchestrator？

Coding Agent Orchestrator 是免费的吗？

Coding Agent Orchestrator 支持哪些平台？

谁开发了 Coding Agent Orchestrator？

💬 留言讨论