← Back to Skills Marketplace

Autooptimise

Name: Autooptimise
Author: wealthvisionai-source

by WealthVisionAI-Source · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ⚠ suspicious

106

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install autooptimise

Description

Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori...

README (SKILL.md)

autooptimise

Autonomous benchmark-driven skill optimisation for OpenClaw. Inspired by Andrej Karpathy's autoresearch — the same modify → test → score → keep/discard loop, applied to agent skill quality instead of GPU training.

Trigger Phrases

"optimise my weather skill"
"run autooptimise on [skill-name]"
"benchmark my [skill-name] skill"
"improve my skill overnight"

Key Files

File	Purpose
`benchmark/tasks.json`	Test task suite (prompts + expected qualities)
`benchmark/scorer.md`	LLM judge scoring rubric
`runner/run_experiment.md`	Autonomous loop instructions (load this next)
`runner/experiment_log.md`	Auto-created run log (gitignored)

How to Run

Read runner/run_experiment.md — it contains the full loop instructions
Confirm the target skill with the user if not specified
Execute the loop (max 3 iterations)
Present proposed changes for human approval — never auto-apply

Scoring

Use the best available LLM judge model (prefer a strong reasoning model). Score each task 0–10 on:

Accuracy — correct answer / correct tool called
Conciseness — no padding, no unnecessary text
Tool usage — right tool, right parameters
Formatting — output matches expected format

Full rubric: benchmark/scorer.md

Safety Rules

Never auto-apply changes. Always present a diff and wait for explicit human approval.
Never modify benchmark/tasks.json or benchmark/scorer.md during a run.
Never exceed 3 iterations per run in v0.1.
Log every action to runner/experiment_log.md.

Usage Guidance

This skill conceptually fits its purpose but has a few red flags you should consider before running it against real skills: - The README/SCHEMA claims "no external network calls" but the tool explicitly describes live validation (wttr.in, gh) and running real tool calls; assume the loop may trigger network and CLI activity. If you need offline-only behaviour, don't run it until that is clarified. - The agent will read (and with your approval, write) other skills' SKILL.md files. Inspect target SKILL.md files first for any sensitive content and avoid running autooptimise on skills that access secrets, credentials, or perform destructive actions. - Require explicit human approval for every proposed change (the skill states this, but enforce it operationally). Prefer to run initial experiments in a sandbox or test environment, not against production skills or accounts. - If you plan to use the heartbeat/scheduling suggestions, be explicit about limiting scope (which skills may be optimised) and frequency to avoid unexpected automated runs. If you want to proceed, ask for clarifications from the author about the network claim vs live validation and confirm the exact filesystem paths the skill will access. Running one dry/manual iteration on a harmless skill first (e.g., a simple local test skill) is recommended to verify behaviour.

Capability Analysis

Type: OpenClaw Skill Name: autooptimise Version: 0.1.0 The 'autooptimise' skill is a meta-tool designed to autonomously modify other OpenClaw skills by running benchmarks, scoring results with an LLM judge, and proposing edits to SKILL.md files. While the instructions in 'runner/run_experiment.md' and 'SKILL.md' include safety gates such as mandatory human approval for all changes and a 3-iteration limit, the skill's core functionality involves high-risk behaviors: reading and writing to other skill files and executing arbitrary prompts. There is no evidence of malicious intent or data exfiltration, but the capability to modify agent instructions programmatically creates a significant surface for prompt injection or unintended system changes.

Capability Assessment

ℹ Purpose & Capability

Name and description (optimise other skills) match the instructions (read a target SKILL.md, run benchmark tasks, propose diffs). However README/SKILL.md assert "No external dependencies" / "no network calls beyond your existing model provider" while multiple places describe live validation and real tool/API calls (wttr.in, gh CLI). That contradiction between claimed constraints and actual behaviour is unexpected.

⚠ Instruction Scope

Runtime instructions explicitly tell the agent to read target skill files, send prompts that 'activate the target skill', run live tool calls where possible, and apply diffs to the skill file (only after approval). Those actions are necessary for an optimiser, but they grant the agent broad capability to exercise the target skill (which itself may read env vars, call network endpoints, or run tools). The docs also reference filesystem paths (e.g. ~/.openclaw/skills/...) despite the skill declaring no required config paths—this implicit file I/O should be made explicit.

✓ Install Mechanism

Instruction-only (no install, no binaries, no extracted archives). This minimizes supply-chain risk since nothing is written by an installer. The only code is runtime instructions and bundled benchmark files.

ℹ Credentials

The skill declares no environment variables or credentials (good), but it implicitly relies on access to your OpenClaw installation, installed tools (gh, wttr.in access), and whatever model provider you already have configured. It does not declare required config paths even though it expects to read and (with approval) write other skills' SKILL.md files—this implicit need for filesystem access should be disclosed and considered.

✓ Persistence & Privilege

always is false and autonomous invocation is permitted (the platform default). The skill does not demand permanent inclusion or hidden privileges, and it documents a human approval gate before applying changes. Scheduling/heartbeat suggestions could enable periodic runs if the user configures them, so users should opt into that intentionally.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install autooptimise
After installation, invoke the skill by name or use /autooptimise
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.0

- Initial release of autooptimise: Autonomous benchmark-driven skill improvement for OpenClaw. Measure quality objectively, propose targeted changes, validate with live testing. - Implements a modify → test → score → keep/discard experiment loop inspired by autoresearch. - Scores skills 0–10 across four dimensions: Accuracy, Conciseness, Tool usage, and Formatting. - Identifies weakest performance areas, proposes targeted SKILL.md changes, and re-tests up to 3 iterations per run. - Always presents proposed changes for human approval; never auto-applies modifications. - Includes clear safety rules and logging; does not alter benchmarks or scoring rubrics during runs.

Metadata

Slug autooptimise

Version 0.1.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Autooptimise?

Autonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scori... It is an AI Agent Skill for Claude Code / OpenClaw, with 106 downloads so far.

How do I install Autooptimise?

Run "/install autooptimise" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Autooptimise free?

Yes, Autooptimise is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Autooptimise support?

Autooptimise is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Autooptimise?

It is built and maintained by WealthVisionAI-Source (@wealthvisionai-source); the current version is v0.1.0.

More Skills