← Back to Skills Marketplace
terwox

Skill Evaluator

by Terwox · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
2454
Downloads
3
Stars
9
Active Installs
1
Versions
Install in OpenClaw
/install skill-evaluator
Description
Evaluate Clawdbot skills for quality, reliability, and publish-readiness using a multi-framework rubric (ISO 25010, OpenSSF, Shneiderman, agent-specific heuristics). Use when asked to review, audit, evaluate, score, or assess a skill before publishing, or when checking skill quality. Runs automated structural checks and guides manual assessment across 25 criteria.
Usage Guidance
This skill appears internally consistent and appropriate for reviewing other skills. Before running: 1) Inspect scripts/eval-skill.py yourself (it only reads files and parses YAML/AST; it does not spawn subprocesses or network calls). 2) Ensure you run it on the intended skill directory (don't point it at system or private repos containing secrets). 3) Install Python 3.6+ and PyYAML (pip install pyyaml) if you plan to run the automated checks. 4) Remember the automated script covers only structural/heuristic checks — manual judgment is required for many rubric items. 5) SKILL.md recommends an optional external tool (SkillLens via npm) — that is not required by this skill; treat external tool recommendations as separate dependencies and review them before use.
Capability Analysis
Type: OpenClaw Skill Name: skill-evaluator Version: 1.0.0 The OpenClaw skill 'skill-evaluator' is designed to assess the quality and security of other skills. Its `SKILL.md` provides clear, task-oriented instructions for the AI agent to execute a local Python script (`scripts/eval-skill.py`) on a specified skill directory. The Python script performs static analysis and content checks, including looking for hardcoded credentials and undocumented environment variables within the *target skill* being evaluated, which is a legitimate security function for an evaluator. There is no evidence of prompt injection, data exfiltration, malicious execution, or persistence mechanisms targeting the agent's environment or unrelated data.
Capability Assessment
Purpose & Capability
Name/description match the delivered artifacts: SKILL.md describes running scripts/evaluations and the repo contains scripts/eval-skill.py, a rubric (references/rubric.md), and an evaluation template. The checks the script implements (frontmatter, file structure, docs, simple script analysis) are coherent with the stated evaluator purpose.
Instruction Scope
SKILL.md explicitly instructs the agent to run the local script (python3 scripts/eval-skill.py /path/to/skill) and to read/skim code and docs — this necessarily requires reading files in the target skill directory, which is intended. Manual scoring steps are required and the evaluator recommends an optional external scanner (SkillLens) — that recommendation is optional and not required for operation.
Install Mechanism
No install spec is provided (instruction-only skill). The included Python script requires Python 3.6+ and PyYAML (documented in SKILL.md). No network downloads, external archives, or package installs are required by the skill itself.
Credentials
The skill requests no environment variables, no credentials, and no config paths. The evaluator script scans files for issues (including credential-like patterns) when run, which is appropriate for its purpose but means you should not run it against directories containing secrets you don't want inspected.
Persistence & Privilege
always:false and user-invocable:true. The skill does not request persistent agent presence or attempt to modify other skills or system-wide settings. It performs local read-only analysis of a provided skill directory (writes only when you copy the EVAL_TEMPLATE to create EVAL.md, which is an intended publishing artifact).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install skill-evaluator
  3. After installation, invoke the skill by name or use /skill-evaluator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: 25-criteria evaluation framework (ISO 25010, OpenSSF, Shneiderman, agent-specific heuristics). Automated structural checks + manual rubric. Eval template included.
Metadata
Slug skill-evaluator
Version 1.0.0
License
All-time Installs 10
Active Installs 9
Total Versions 1
Frequently Asked Questions

What is Skill Evaluator?

Evaluate Clawdbot skills for quality, reliability, and publish-readiness using a multi-framework rubric (ISO 25010, OpenSSF, Shneiderman, agent-specific heuristics). Use when asked to review, audit, evaluate, score, or assess a skill before publishing, or when checking skill quality. Runs automated structural checks and guides manual assessment across 25 criteria. It is an AI Agent Skill for Claude Code / OpenClaw, with 2454 downloads so far.

How do I install Skill Evaluator?

Run "/install skill-evaluator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Skill Evaluator free?

Yes, Skill Evaluator is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Skill Evaluator support?

Skill Evaluator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Skill Evaluator?

It is built and maintained by Terwox (@terwox); the current version is v1.0.0.

💬 Comments