← 返回 Skills 市场
wealthvisionai-source

Autooptimise

作者 WealthVisionAI-Source · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
106
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install autooptimise
功能描述
Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori...
使用说明 (SKILL.md)

autooptimise

Autonomous benchmark-driven skill optimisation for OpenClaw. Inspired by Andrej Karpathy's autoresearch — the same modify → test → score → keep/discard loop, applied to agent skill quality instead of GPU training.

Trigger Phrases

  • "optimise my weather skill"
  • "run autooptimise on [skill-name]"
  • "benchmark my [skill-name] skill"
  • "improve my skill overnight"

Key Files

File Purpose
benchmark/tasks.json Test task suite (prompts + expected qualities)
benchmark/scorer.md LLM judge scoring rubric
runner/run_experiment.md Autonomous loop instructions (load this next)
runner/experiment_log.md Auto-created run log (gitignored)

How to Run

  1. Read runner/run_experiment.md — it contains the full loop instructions
  2. Confirm the target skill with the user if not specified
  3. Execute the loop (max 3 iterations)
  4. Present proposed changes for human approval — never auto-apply

Scoring

Use the best available LLM judge model (prefer a strong reasoning model). Score each task 0–10 on:

  • Accuracy — correct answer / correct tool called
  • Conciseness — no padding, no unnecessary text
  • Tool usage — right tool, right parameters
  • Formatting — output matches expected format

Full rubric: benchmark/scorer.md

Safety Rules

  • Never auto-apply changes. Always present a diff and wait for explicit human approval.
  • Never modify benchmark/tasks.json or benchmark/scorer.md during a run.
  • Never exceed 3 iterations per run in v0.1.
  • Log every action to runner/experiment_log.md.
安全使用建议
This skill conceptually fits its purpose but has a few red flags you should consider before running it against real skills: - The README/SCHEMA claims "no external network calls" but the tool explicitly describes live validation (wttr.in, gh) and running real tool calls; assume the loop may trigger network and CLI activity. If you need offline-only behaviour, don't run it until that is clarified. - The agent will read (and with your approval, write) other skills' SKILL.md files. Inspect target SKILL.md files first for any sensitive content and avoid running autooptimise on skills that access secrets, credentials, or perform destructive actions. - Require explicit human approval for every proposed change (the skill states this, but enforce it operationally). Prefer to run initial experiments in a sandbox or test environment, not against production skills or accounts. - If you plan to use the heartbeat/scheduling suggestions, be explicit about limiting scope (which skills may be optimised) and frequency to avoid unexpected automated runs. If you want to proceed, ask for clarifications from the author about the network claim vs live validation and confirm the exact filesystem paths the skill will access. Running one dry/manual iteration on a harmless skill first (e.g., a simple local test skill) is recommended to verify behaviour.
功能分析
Type: OpenClaw Skill Name: autooptimise Version: 0.1.0 The 'autooptimise' skill is a meta-tool designed to autonomously modify other OpenClaw skills by running benchmarks, scoring results with an LLM judge, and proposing edits to SKILL.md files. While the instructions in 'runner/run_experiment.md' and 'SKILL.md' include safety gates such as mandatory human approval for all changes and a 3-iteration limit, the skill's core functionality involves high-risk behaviors: reading and writing to other skill files and executing arbitrary prompts. There is no evidence of malicious intent or data exfiltration, but the capability to modify agent instructions programmatically creates a significant surface for prompt injection or unintended system changes.
能力评估
Purpose & Capability
Name and description (optimise other skills) match the instructions (read a target SKILL.md, run benchmark tasks, propose diffs). However README/SKILL.md assert "No external dependencies" / "no network calls beyond your existing model provider" while multiple places describe live validation and real tool/API calls (wttr.in, gh CLI). That contradiction between claimed constraints and actual behaviour is unexpected.
Instruction Scope
Runtime instructions explicitly tell the agent to read target skill files, send prompts that 'activate the target skill', run live tool calls where possible, and apply diffs to the skill file (only after approval). Those actions are necessary for an optimiser, but they grant the agent broad capability to exercise the target skill (which itself may read env vars, call network endpoints, or run tools). The docs also reference filesystem paths (e.g. ~/.openclaw/skills/...) despite the skill declaring no required config paths—this implicit file I/O should be made explicit.
Install Mechanism
Instruction-only (no install, no binaries, no extracted archives). This minimizes supply-chain risk since nothing is written by an installer. The only code is runtime instructions and bundled benchmark files.
Credentials
The skill declares no environment variables or credentials (good), but it implicitly relies on access to your OpenClaw installation, installed tools (gh, wttr.in access), and whatever model provider you already have configured. It does not declare required config paths even though it expects to read and (with approval) write other skills' SKILL.md files—this implicit need for filesystem access should be disclosed and considered.
Persistence & Privilege
always is false and autonomous invocation is permitted (the platform default). The skill does not demand permanent inclusion or hidden privileges, and it documents a human approval gate before applying changes. Scheduling/heartbeat suggestions could enable periodic runs if the user configures them, so users should opt into that intentionally.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install autooptimise
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /autooptimise 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
- Initial release of autooptimise: Autonomous benchmark-driven skill improvement for OpenClaw. Measure quality objectively, propose targeted changes, validate with live testing. - Implements a modify → test → score → keep/discard experiment loop inspired by autoresearch. - Scores skills 0–10 across four dimensions: Accuracy, Conciseness, Tool usage, and Formatting. - Identifies weakest performance areas, proposes targeted SKILL.md changes, and re-tests up to 3 iterations per run. - Always presents proposed changes for human approval; never auto-applies modifications. - Includes clear safety rules and logging; does not alter benchmarks or scoring rubrics during runs.
元数据
Slug autooptimise
版本 0.1.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Autooptimise 是什么?

Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 106 次。

如何安装 Autooptimise?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install autooptimise」即可一键安装,无需额外配置。

Autooptimise 是免费的吗?

是的,Autooptimise 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Autooptimise 支持哪些平台?

Autooptimise 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Autooptimise?

由 WealthVisionAI-Source(@wealthvisionai-source)开发并维护,当前版本 v0.1.0。

💬 留言讨论