← 返回 Skills 市场
sheepxux

Skill Forge

作者 SheepXu · GitHub ↗ · v0.4.3 · MIT-0
cross-platform ⚠ suspicious
35
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install skills-forge
功能描述
Detect repeated capability gaps, convert recurring user needs into candidate skills, scaffold new OpenClaw-compatible skills, and validate them before instal...
使用说明 (SKILL.md)

skill-forge

Use this skill to turn repeated demand into a reviewed skill candidate.

Current version: v0.4.3 "Safety Tightening".

Core jobs

  1. Detect repeated capability gaps from logs, .learnings, and feature requests.
  2. Decide whether the pattern is stable enough to deserve its own skill.
  3. Classify the opportunity as academic, product, integration, script, or workflow.
  4. Generate a candidate skill folder with a usable SKILL.md, agents/openai.yaml, and profile-specific resources.
  5. Validate and score the candidate before proposing installation.
  6. Run hidden smoke evaluation and Agent profile authorization before confirmed installation.
  7. Record feedback from later usage and propose reviewed updates instead of mutating installed skills directly.
  8. Provide the Somnia runtime used by scheduled nightly review to find skills with bugs, weak scores, or update-worthy feedback.
  9. Use redacted replay cases to check whether candidates actually cover real feedback-derived tasks.

Trigger Cues

Use this skill when the user or agent mentions:

  • repeated failure
  • missing capability
  • recurring workflow
  • feature request
  • make a new skill
  • scaffold a skill
  • validate a generated skill

Default Workflow

  1. Detect repeated capability gaps from learning files or session-derived notes.
  2. Classify the strongest opportunity into a skill profile.
  3. Scaffold a candidate skill with profile-specific resources.
  4. Validate the candidate and inspect score, warnings, and references.
  5. Run hidden smoke evaluation without exposing simulated cases to users.
  6. Propose installation, then require Telegram approval before applying it.
  7. Record future usage feedback and run an evolution pipeline when enough feedback accumulates.
  8. During scheduled reviews, write summary reports and optionally propose updates without exposing hidden evaluation details.
  9. Run replay evaluation before approving evolved candidates when replay cases exist.

Commands

Detect

Run:

python3 {baseDir}/scripts/detect_skill_opportunities.py --json

Add --source paths when a specific workspace or learnings file should be analyzed.

Choose

Only create a skill when the pattern is:

  • recurring
  • broad enough to reuse
  • structured enough to document
  • more stable than a one-off prompt

If the need is too narrow, keep it as a note or workflow rule instead.

Scaffold

Run:

python3 {baseDir}/scripts/generate_skill_scaffold.py \
  --skill-name my-skill \
  --output ./generated \
  --goal "What the skill should achieve." \
  --triggers "keyword1, keyword2" \
  --template auto

Validate

Run:

python3 {baseDir}/scripts/validate_skill_candidate.py ./generated/my-skill

Propose Installation

Run:

python3 {baseDir}/scripts/propose_skill_install.py ./generated/my-skill

Use --apply only after the candidate has been reviewed.

Record Feedback

Run:

python3 {baseDir}/scripts/record_skill_feedback.py \
  --skill my-skill \
  --agent-name StudyAgent \
  --rating negative \
  --feedback "触发词: literature review. The skill should handle literature review planning better." \
  --json

Propose Evolution

Run:

python3 {baseDir}/scripts/evolve_skill_pipeline.py \
  --skill my-skill \
  --output ./generated-updates \
  --install plan \
  --replay hidden \
  --agent-name StudyAgent \
  --json

Replay Evaluation

Run:

python3 {baseDir}/scripts/replay/collect_replay_cases.py \
  --feedback-file ~/.openclaw/workspace/.learnings/skill-feedback.jsonl \
  --skill my-skill \
  --json

python3 {baseDir}/scripts/replay/run_replay_eval.py \
  --skill-dir ./generated-updates/my-skill \
  --skill my-skill \
  --json

Nightly Review

Somnia is now packaged as its own skill for scheduled maintenance. These compatibility commands remain available from Skill Forge:

python3 {baseDir}/scripts/nightly_skill_review.py \
  --scope managed \
  --propose-updates \
  --replay hidden \
  --update-install plan \
  --json

Install a macOS daily sleep-hour schedule, defaulting to 03:00 local time:

python3 {baseDir}/scripts/schedule_nightly_review.py \
  --hour 3 \
  --minute 0 \
  --scope managed \
  --propose-updates \
  --update-install plan \
  --apply \
  --json

Use --telegram-report with --env-file ~/.openclaw/skill-forge.env when scheduled runs should send a summary report to Telegram. The env file should define TELEGRAM_BOT_TOKEN and TELEGRAM_CHAT_ID.

Uninstall or rollback a generated skill:

python3 {baseDir}/scripts/propose_skill_install.py my-skill --uninstall --apply
python3 {baseDir}/scripts/propose_skill_install.py my-skill --uninstall --restore-backup --apply

Full Pipeline

For one-shot operation:

python3 {baseDir}/scripts/forge_pipeline.py \
  --source ~/.openclaw/workspace/.learnings/FEATURE_REQUESTS.md \
  --source ~/.openclaw/workspace/.learnings/ERRORS.md \
  --output ./generated \
  --eval hidden \
  --json

Ask through Telegram before installing:

export TELEGRAM_BOT_TOKEN="..."
export TELEGRAM_CHAT_ID="..."

python3 {baseDir}/scripts/forge_pipeline.py \
  --source ~/.openclaw/workspace/.learnings/FEATURE_REQUESTS.md \
  --source ~/.openclaw/workspace/.learnings/ERRORS.md \
  --output ./generated \
  --install telegram \
  --agent-name StudyAgent \
  --json

Decision rules

  • Prefer creating a skill over a new agent when the new capability is narrow.
  • Prefer a new agent over a skill when the work needs a distinct role, tools, and long-term memory boundary.
  • Prefer references/ when the skill mainly teaches structure and judgment.
  • Prefer scripts/ when the same code would otherwise be rewritten repeatedly.
  • Default to --install plan; use --install telegram for any mutation.
  • Treat --install ask and --install auto as blocked compatibility aliases.
  • Do not call propose_skill_install.py --apply directly; apply is guarded for Telegram-approved pipelines.
  • Keep --eval hidden for user-facing flows so simulated checks and prompts are not exposed.
  • Use --agent-name before installation when a specific agent will receive the skill.
  • Never hard-code Telegram tokens; discover them from OpenClaw/env files or environment variables.
  • Redact feedback text before storing it.
  • Treat feedback-driven changes as update candidates, not direct edits to installed skills.
  • Install evolved skills only after validation, hidden evaluation, authorization, and approval.
  • Use replay as a regression gate when feedback-derived cases exist.
  • Scheduled nightly review may propose updates, but install changes still require Telegram approval.
  • Nightly reports should show only health summaries, not hidden evaluation prompts or simulated checks.
  • Default nightly review scope is managed; use --scope all only for explicit full inventory audits.

Output Contract

The skill-forge output should include:

  • detected opportunity name
  • recommended template profile
  • generated candidate path
  • validation score and grade
  • install status
  • approval status when Telegram confirmation is used
  • nightly review report path when running scheduled review
  • install plan
  • review warnings, if any

Quality Gates

Before proposing installation, confirm:

  • the skill name is concrete and reusable
  • the description has clear trigger conditions
  • the generated structure matches the intended job
  • the candidate improves a real recurring workflow
  • validation score is at least 70
  • grade=milestone is preferred before sharing externally
  • installation threshold is at least 85 unless the user explicitly chooses otherwise
  • hidden smoke evaluation passes
  • target Agent profile policy allows the generated skill profile
  • Telegram approval is available before applying install or uninstall changes
  • feedback-derived updates are reviewed as candidates before replacing an installed skill
  • replay evaluation passes when replay cases exist
  • nightly review reports are written before any update proposal is installed
  • scheduled automation uses explicit launchd configuration and can be uninstalled

Read references/skill-quality-rubric.md when evaluating a draft.

Resources

References:

  • references/heuristics.md
  • references/skill-quality-rubric.md
  • references/milestone-architecture.md

Scripts:

  • scripts/detect_skill_opportunities.py
  • scripts/evaluate_skill_candidate.py
  • scripts/generate_skill_scaffold.py
  • scripts/validate_skill_candidate.py
  • scripts/forge_pipeline.py
  • scripts/install/propose_skill_install.py
  • scripts/install/telegram_approval.py
  • scripts/evolve/evolve_skill_pipeline.py
  • scripts/evolve/propose_skill_update.py
  • scripts/evolve/record_skill_feedback.py
  • scripts/replay/collect_replay_cases.py
  • scripts/replay/compare_replay_outputs.py
  • scripts/replay/redact_replay_case.py
  • scripts/replay/replay_report.py
  • scripts/replay/run_replay_eval.py
  • scripts/somnia/nightly_skill_review.py
  • scripts/somnia/schedule_nightly_review.py

Compatibility wrappers:

  • scripts/propose_skill_install.py
  • scripts/telegram_approval.py
  • scripts/evolve_skill_pipeline.py
  • scripts/propose_skill_update.py
  • scripts/record_skill_feedback.py
  • scripts/nightly_skill_review.py
  • scripts/schedule_nightly_review.py
安全使用建议
Install this only if you want an agent to help create and manage other skills. Before applying installs or updates, inspect the generated SKILL.md and references, especially evolution-feedback.md; keep Telegram credentials isolated; and enable the nightly schedule only if you want persistent maintenance.
功能分析
Type: OpenClaw Skill Name: skills-forge Version: 0.4.3 The 'skill-forge' bundle is a meta-skill designed to allow an agent to autonomously detect capability gaps, generate new code (scaffolding), and install new skills into its own runtime. It possesses high-risk capabilities including system persistence via macOS LaunchAgents (scripts/somnia/schedule_nightly_review.py), external network communication with the Telegram API for approvals (scripts/install/telegram_approval.py), and the ability to modify the agent's execution environment by symlinking or copying generated code (scripts/install/propose_skill_install.py). While the bundle includes extensive safety scaffolding—such as mandatory out-of-band Telegram approval gates, automated data redaction for logs (scripts/replay/replay_common.py), and profile-based authorization policies (scripts/lib/policy.py)—the inherent power of a self-modifying code framework is significant and warrants a suspicious classification despite the lack of clear evidence of intentional malice.
能力标签
requires-oauth-tokenrequires-sensitive-credentials
能力评估
Purpose & Capability
The stated purpose matches the artifacts: it detects recurring needs, scaffolds candidate skills, validates them, and proposes installation. That purpose inherently affects agent behavior, so users should treat it as a high-impact automation skill.
Instruction Scope
Feedback entries are written into candidate skill reference material and linked from SKILL.md, which can let untrusted or mistaken feedback persist as future agent context without a clear instruction/data boundary.
Install Mechanism
There is no package install spec, but SKILL.md directs users to run bundled Python scripts. Mutating install actions are presented as explicit commands and generally gated by review or Telegram approval.
Credentials
Default file access is scoped to OpenClaw learning/feedback paths, and Telegram credentials are optional for approval/reporting. These are purpose-aligned but sensitive.
Persistence & Privilege
The skill can install or uninstall generated skills and can create a scheduled nightly review job. This is disclosed and user-directed, but it changes persistent agent behavior.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install skills-forge
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /skills-forge 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.4.3
Tighten evolution update path safety, add traversal regression checks, and reduce false-positive secret wording.
v0.4.2
Initial public preview with Telegram-gated installs, hidden evaluation, replay regression checks, release checks, and optional Somnia support.
元数据
Slug skills-forge
版本 0.4.3
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 2
常见问题

Skill Forge 是什么?

Detect repeated capability gaps, convert recurring user needs into candidate skills, scaffold new OpenClaw-compatible skills, and validate them before instal... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 35 次。

如何安装 Skill Forge?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install skills-forge」即可一键安装,无需额外配置。

Skill Forge 是免费的吗?

是的,Skill Forge 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Skill Forge 支持哪些平台?

Skill Forge 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Skill Forge?

由 SheepXu(@sheepxux)开发并维护,当前版本 v0.4.3。

💬 留言讨论