← 返回 Skills 市场
fwwdn

Skill Test

作者 Weiwei Fan · GitHub ↗ · v0.1.1 · MIT-0
cross-platform ✓ 安全检测通过
149
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install skills-test
功能描述
Evaluate and QA a skill before release on ClawHub, skills.sh, and similar directories. Includes the bundled static evaluator `scripts/eval_skill.py` plus gui...
安全使用建议
This package appears coherent and non-malicious, but treat it like any tool that reads and parses local files: 1) Inspect scripts/eval_skill.py before running (there is a visible typo/truncation that may crash the script). 2) Run it in an isolated disposable workspace or container so the evaluator can only read the target skill directory. 3) If you plan to automate evals in CI, run the script on a small sample first and fix the code bug (the 'metada' typo / truncated function) to avoid false failures. 4) If you need runtime rubric grading that contacts external model providers, confirm and provide only the intended API keys to those providers — this package itself does not require credentials.
功能分析
Type: OpenClaw Skill Name: skills-test Version: 0.1.1 The skill bundle is a legitimate utility designed for the static analysis and quality assurance of other OpenClaw skills. The core logic in `scripts/eval_skill.py` is a safe Python script that performs regex-based checks on Markdown files and verifies local file existence without any network activity, shell execution, or obfuscation. Furthermore, the documentation in `SKILL.md` and `references/sandbox-testing.md` explicitly promotes security best practices, instructing the AI agent to treat third-party code as untrusted and to use isolated environments for testing.
能力评估
Purpose & Capability
Name, description, SKILL.md instructions, and the bundled scripts/evidence files align: the skill is a skill-evaluator and bundles scripts/eval_skill.py to perform static checks. Required binary is only python3 and there are no unrelated environment variables or config paths.
Instruction Scope
SKILL.md explicitly instructs the assistant to read the target SKILL.md and run python3 scripts/eval_skill.py against a target skill and to prefer isolation. Those instructions stay within the stated purpose. The bundled evaluator performs filesystem reads and regex parsing of frontmatter and docs (expected). Note: the included scripts/eval_skill.py listing contains a clear coding issue (a typo 'metada' / truncated code near platform-readiness checks) which may cause runtime exceptions — this is a bug, not malicious behavior, but you should inspect/fix before running.
Install Mechanism
No install spec and no external downloads; this is instruction-only with a bundled Python script. Risk is low from install mechanics. The only code executed if you follow instructions is the local Python script; there are no network calls or archive extracts shown in the provided files.
Credentials
No environment variables, primary credentials, or config paths are requested. That is proportionate for a static skill-evaluator.
Persistence & Privilege
The skill does not request always:true, does not declare persistence, and provides no indication it will modify other skills or system-wide agent settings. It performs read/analysis of files in the target skill directory only (as expected).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install skills-test
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /skills-test 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.1
- Added bundled static evaluator script (`scripts/eval_skill.py`) for out-of-the-box quality and readiness checks. - Added support for user-defined deterministic and LLM rubric grading via optional workflows, clarified that only the static evaluator is bundled. - Introduced an example `eval.yaml` to guide skill regression test suite creation. - Updated documentation for clearer separation of bundled vs. optional grading tools, including revised prerequisites and notes about LLM/API key handling. - Clarified typical use cases, evaluation modes, and workflow steps for skill QA prior to publication.
v0.1.0
Initial release of skill-test — a toolkit for evaluating and QA'ing agent skills before release on ClawHub, skills.sh, etc. - Provides static and rubric-based evaluation of agent skills via `python3 scripts/eval_skill.py <skill-path>` - Supports trigger tests, deterministic graders, LLM scoring, and regression tracking - Offers two core modes: Trial Mode (first-pass trust) and Publish Evaluation Mode (release readiness) - Details clear workflows, assistant responsibilities, and next test recommendations - Explicitly documents when to use (and not use) this skill for skill QA - Lists prerequisites and offers sample prompts and output/report structure
元数据
Slug skills-test
版本 0.1.1
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 2
常见问题

Skill Test 是什么?

Evaluate and QA a skill before release on ClawHub, skills.sh, and similar directories. Includes the bundled static evaluator `scripts/eval_skill.py` plus gui... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 149 次。

如何安装 Skill Test?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install skills-test」即可一键安装,无需额外配置。

Skill Test 是免费的吗?

是的,Skill Test 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Skill Test 支持哪些平台?

Skill Test 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Skill Test?

由 Weiwei Fan(@fwwdn)开发并维护,当前版本 v0.1.1。

💬 留言讨论