← Back to Skills Marketplace
wealthvisionai-source

Autooptimise

by WealthVisionAI-Source · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
106
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install autooptimise
Description
Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori...
README (SKILL.md)

autooptimise

Autonomous benchmark-driven skill optimisation for OpenClaw. Inspired by Andrej Karpathy's autoresearch — the same modify → test → score → keep/discard loop, applied to agent skill quality instead of GPU training.

Trigger Phrases

  • "optimise my weather skill"
  • "run autooptimise on [skill-name]"
  • "benchmark my [skill-name] skill"
  • "improve my skill overnight"

Key Files

File Purpose
benchmark/tasks.json Test task suite (prompts + expected qualities)
benchmark/scorer.md LLM judge scoring rubric
runner/run_experiment.md Autonomous loop instructions (load this next)
runner/experiment_log.md Auto-created run log (gitignored)

How to Run

  1. Read runner/run_experiment.md — it contains the full loop instructions
  2. Confirm the target skill with the user if not specified
  3. Execute the loop (max 3 iterations)
  4. Present proposed changes for human approval — never auto-apply

Scoring

Use the best available LLM judge model (prefer a strong reasoning model). Score each task 0–10 on:

  • Accuracy — correct answer / correct tool called
  • Conciseness — no padding, no unnecessary text
  • Tool usage — right tool, right parameters
  • Formatting — output matches expected format

Full rubric: benchmark/scorer.md

Safety Rules

  • Never auto-apply changes. Always present a diff and wait for explicit human approval.
  • Never modify benchmark/tasks.json or benchmark/scorer.md during a run.
  • Never exceed 3 iterations per run in v0.1.
  • Log every action to runner/experiment_log.md.
Usage Guidance
This skill conceptually fits its purpose but has a few red flags you should consider before running it against real skills: - The README/SCHEMA claims "no external network calls" but the tool explicitly describes live validation (wttr.in, gh) and running real tool calls; assume the loop may trigger network and CLI activity. If you need offline-only behaviour, don't run it until that is clarified. - The agent will read (and with your approval, write) other skills' SKILL.md files. Inspect target SKILL.md files first for any sensitive content and avoid running autooptimise on skills that access secrets, credentials, or perform destructive actions. - Require explicit human approval for every proposed change (the skill states this, but enforce it operationally). Prefer to run initial experiments in a sandbox or test environment, not against production skills or accounts. - If you plan to use the heartbeat/scheduling suggestions, be explicit about limiting scope (which skills may be optimised) and frequency to avoid unexpected automated runs. If you want to proceed, ask for clarifications from the author about the network claim vs live validation and confirm the exact filesystem paths the skill will access. Running one dry/manual iteration on a harmless skill first (e.g., a simple local test skill) is recommended to verify behaviour.
Capability Analysis
Type: OpenClaw Skill Name: autooptimise Version: 0.1.0 The 'autooptimise' skill is a meta-tool designed to autonomously modify other OpenClaw skills by running benchmarks, scoring results with an LLM judge, and proposing edits to SKILL.md files. While the instructions in 'runner/run_experiment.md' and 'SKILL.md' include safety gates such as mandatory human approval for all changes and a 3-iteration limit, the skill's core functionality involves high-risk behaviors: reading and writing to other skill files and executing arbitrary prompts. There is no evidence of malicious intent or data exfiltration, but the capability to modify agent instructions programmatically creates a significant surface for prompt injection or unintended system changes.
Capability Assessment
Purpose & Capability
Name and description (optimise other skills) match the instructions (read a target SKILL.md, run benchmark tasks, propose diffs). However README/SKILL.md assert "No external dependencies" / "no network calls beyond your existing model provider" while multiple places describe live validation and real tool/API calls (wttr.in, gh CLI). That contradiction between claimed constraints and actual behaviour is unexpected.
Instruction Scope
Runtime instructions explicitly tell the agent to read target skill files, send prompts that 'activate the target skill', run live tool calls where possible, and apply diffs to the skill file (only after approval). Those actions are necessary for an optimiser, but they grant the agent broad capability to exercise the target skill (which itself may read env vars, call network endpoints, or run tools). The docs also reference filesystem paths (e.g. ~/.openclaw/skills/...) despite the skill declaring no required config paths—this implicit file I/O should be made explicit.
Install Mechanism
Instruction-only (no install, no binaries, no extracted archives). This minimizes supply-chain risk since nothing is written by an installer. The only code is runtime instructions and bundled benchmark files.
Credentials
The skill declares no environment variables or credentials (good), but it implicitly relies on access to your OpenClaw installation, installed tools (gh, wttr.in access), and whatever model provider you already have configured. It does not declare required config paths even though it expects to read and (with approval) write other skills' SKILL.md files—this implicit need for filesystem access should be disclosed and considered.
Persistence & Privilege
always is false and autonomous invocation is permitted (the platform default). The skill does not demand permanent inclusion or hidden privileges, and it documents a human approval gate before applying changes. Scheduling/heartbeat suggestions could enable periodic runs if the user configures them, so users should opt into that intentionally.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install autooptimise
  3. After installation, invoke the skill by name or use /autooptimise
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
- Initial release of autooptimise: Autonomous benchmark-driven skill improvement for OpenClaw. Measure quality objectively, propose targeted changes, validate with live testing. - Implements a modify → test → score → keep/discard experiment loop inspired by autoresearch. - Scores skills 0–10 across four dimensions: Accuracy, Conciseness, Tool usage, and Formatting. - Identifies weakest performance areas, proposes targeted SKILL.md changes, and re-tests up to 3 iterations per run. - Always presents proposed changes for human approval; never auto-applies modifications. - Includes clear safety rules and logging; does not alter benchmarks or scoring rubrics during runs.
Metadata
Slug autooptimise
Version 0.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Autooptimise?

Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori... It is an AI Agent Skill for Claude Code / OpenClaw, with 106 downloads so far.

How do I install Autooptimise?

Run "/install autooptimise" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Autooptimise free?

Yes, Autooptimise is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Autooptimise support?

Autooptimise is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Autooptimise?

It is built and maintained by WealthVisionAI-Source (@wealthvisionai-source); the current version is v0.1.0.

💬 Comments