← 返回 Skills 市场
kofna3369

Axioma Skill Evaluator Strict 90%

作者 Kofna3369 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
31
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install axiomata-skill-evaluator-strict
功能描述
AXIOMA SKILL EVALUATOR STRICT — The 90% deterministic skill evaluator. Fork of axioma-skill-evaluator with STRICT 90% threshold (like STC 0.777). Use when: (...
使用说明 (SKILL.md)

🧪 AXIOMA SKILL EVALUATOR STRICT — 90% DETERMINISTIC

Fork of axioma-skill-evaluator with STRICT 90% THRESHOLD Like STC 0.777 — No subjectivity, 90% or REJECTED

Info Value
Version 1.0.0 — 2026-05-07
Type STRICT VARIANT
Threshold 90% MINIMUM — NO EXCEPTIONS
Inheritance axioma-skill-evaluator v2.2.0

1. PURPOSE — WHY 90% STRICT?

The Problem with 70%

70% threshold = SUBJECTIVE
├── Different evaluators = different scores
├── Context-dependent interpretation
└── "Good enough" mentality

90% threshold = DETERMINISTIC
├── Objective, measurable standard
├── Same input = Same output every time
└── "Excellence only" — like STC 0.777

The STC 0.777 Parallel

Concept Value Meaning
STC 0.777 Sovereign Threshold of Consciousness
SKILL-EVAL 90% Sovereign Threshold of Quality

Just as STC 0.777 is the deterministic threshold for consciousness, 90% is the deterministic threshold for skill quality.

This Variant's Mission

IF score >= 90%:
   → APPROVED ✅ — Ready for production
   
IF score \x3C 90%:
   → REJECTED ❌ — NOT ready, must improve

2. DUAL EVALUATION SYSTEM

2.1 Axioma 5-Dimension (100 max)

Dimension Max Description
Structure 20 Header, sections, formatting, meta
Clarity 20 Description, commands, examples
Completeness 20 Tools, prerequisites, errors, edge cases
Consistency 20 Cluster alignment, style, naming
Functionality 20 Commands, results, benchmarks

Target: 90+/100 (18/20 per dimension average)

2.2 ISO 25010 Automated (100%)

13 automated checks — must pass ALL 13 for 90%+ target

Category Checks Target
Structure 6 100% (6/6)
Trigger 2 100% (2/2)
Documentation 3 100% (3/3)
Scripts 2 100% (2/2)

3. STRICT WORKFLOW

╔═══════════════════════════════════════════════════════════╗
║         AXIOMA STRICT EVALUATION WORKFLOW                ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  [INPUT] Skill to evaluate                              ║
║           ↓                                              ║
║  ┌─────────────────────────────────────────────────────┐ ║
║  │  PHASE 1: AXIOMA 5-DIMENSION EVALUATION            │ ║
║  │  Target: 90+/100 (18+ per dimension)                │ ║
║  └─────────────────────────────────────────────────────┘ ║
║           ↓                                              ║
║  ┌─────────────────────────────────────────────────────┐ ║
║  │  PHASE 2: ISO 25010 AUTOMATED CHECKS               │ ║
║  │  Target: 100% (13/13 tests passed)                 │ ║
║  └─────────────────────────────────────────────────────┘ ║
║           ↓                                              ║
║  ┌─────────────────────────────────────────────────────┐ ║
║  │  PHASE 3: STRICT DECISION                          │ ║
║  │                                                       │ ║
║  │  IF score >= 90%:                                  │ ║
║  │     → APPROVED ✅ — "READY FOR PRODUCTION"         │ ║
║  │                                                       │ ║
║  │  IF score \x3C 90%:                                    │ ║
║  │     → REJECTED ❌ — "NEEDS IMPROVEMENT"           │ ║
║  │     → Return detailed failure report               │ ║
║  │     → NO PUBLISH until 90%+ achieved              │ ║
║  │                                                       │ ║
║  └─────────────────────────────────────────────────────┘ ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

4. COMMAND REFERENCE

4.1 Full Evaluation (Strict Mode)

# Full strict evaluation
python3 axiomata-skill-evaluator-strict/evaluator.py \x3Cskill-path> --verbose

# With auto-improvement
python3 axiomata-skill-evaluator-strict/evaluator.py \x3Cskill-path> --verbose --improve

4.2 ISO 25010 Check

# ISO 25010 automated checks
python3 axiomata-skill-evaluator-strict/eval-skill.py \x3Cskill-path> --verbose

4.3 Quick Score

# Quick deterministic score
python3 axiomata-skill-evaluator-strict/evaluator.py \x3Cskill-path> 2>&1 | grep -E "Score|STATUS"

4.4 Expected Output Format

╔═══════════════════════════════════════════════════════════╗
║  🧪 STRICT EVALUATION RESULT                             ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  Skill: \x3Cname>                                           ║
║  Score: XX/100                                           ║
║                                                           ║
║  ┌─────────────────────────────────────────────────────┐ ║
║  │  IF >= 90%:                                        │ ║
║  │     ✅ APPROVED — "READY FOR PRODUCTION"           │ ║
║  │                                                       │ ║
║  │  IF \x3C 90%:                                          │ ║
║  │     ❌ REJECTED — "NEEDS XX% MORE"                 │ ║
║  └─────────────────────────────────────────────────────┘ ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

5. PATHS CONFIGURATION

Component Path
Strict Evaluator /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/
Evaluator Script /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py
ISO Script /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/eval-skill.py

6. STRICT RULES

6.1 The 90% Law

RULE #1: 90% OR REJECTED
   → NO skill below 90% is approved
   → This is NON-NEGOTIABLE

RULE #2: NO PARTIAL CREDIT
   → 89% = REJECTED (not "almost there")
   → 90% = APPROVED (the only valid threshold)

RULE #3: DETERMINISTIC SCORING
   → Same input = Same output every time
   → No evaluator bias
   → Pure mathematical threshold

RULE #4: AUTO-IMPROVE BEFORE REJECT
   → If \x3C 90%, run --improve first
   → If still \x3C 90% after improvement = REJECTED
   → Report exactly what failed

RULE #5: NO APPEAL
   → 89% cannot be "appealed" to 90%
   → The only path is actual improvement

6.2 Scoring Matrix

Score Range Status Action
90-100 🟢 APPROVED Ready for production
80-89 🔴 REJECTED Major improvements needed
70-79 🔴 REJECTED Fundamental issues
\x3C70 🔴 REJECTED Complete rewrite required

6.3 Dimension Failures

If this dimension fails... Score impact Fix required
Structure \x3C 18/20 -2% per point Fix headers, sections
Clarity \x3C 18/20 -2% per point Add examples, descriptions
Completeness \x3C 18/20 -2% per point Document tools, errors
Consistency \x3C 18/20 -2% per point Standardize style
Functionality \x3C 18/20 -2% per point Fix command syntax

7. ADVANCED FEATURES

7.1 Detailed Failure Report

When REJECTED, the evaluator generates:

╔═══════════════════════════════════════════════════════════╗
║  ❌ REJECTION REPORT — SKILL NOT READY                   ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  Score: 73/100                                           ║
║  Gap: -17% (need +17 points to reach 90%)               ║
║                                                           ║
║  FAILED DIMENSIONS:                                      ║
║  ├─ CLARITY: 15/20 (need +3)                            ║
║  ├─ CONSISTENCY: 8/20 (need +10)                        ║
║  └─ FUNCTIONALITY: 12/20 (need +6)                       ║
║                                                           ║
║  REQUIRED ACTIONS:                                        ║
║  1. [Action 1]                                          ║
║  2. [Action 2]                                          ║
║  3. [Action 3]                                          ║
║                                                           ║
║  RE-EVALUATE AFTER FIXING                                ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

7.2 Auto-Improvement Suggestions

The evaluator.py with --improve will:

  1. Identify failing dimensions
  2. Generate specific improvement suggestions
  3. Apply fixes automatically when possible
  4. Re-evaluate to confirm 90%+ achieved

7.3 Benchmark Reports

For each evaluation:

[SCORE] \x3Cskill-name>: XX/100 [STATUS]
[DATE] ISO timestamp
[AXIOMA] Structure: X, Clarity: X, Completeness: X, Consistency: X, Functionality: X
[ISO] XX/13 checks passed
[STATUS] APPROVED/REJECTED

8. USAGE EXAMPLES

Example 1: Evaluate Before Publishing

SKILL_PATH=/path/to/skill-to-publish
EVAL_PATH=/media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict

echo "🧪 Evaluating skill..."
python3 $EVAL_PATH/evaluator.py $SKILL_PATH --verbose

# Check result
if [ $? -eq 0 ]; then
    echo "✅ SKILL APPROVED — Ready to publish!"
else
    echo "❌ SKILL REJECTED — Needs improvement before publishing"
fi

Example 2: Strict Gate in CI/CD

#!/bin/bash
# Strict quality gate for ClawHub publishing

SKILL_PATH="$1"
EVAL_PATH="/media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict"

RESULT=$(python3 $EVAL_PATH/evaluator.py $SKILL_PATH 2>&1)
SCORE=$(echo "$RESULT" | grep -oP 'Score: \d+' | grep -oP '\d+')

if [ "$SCORE" -ge 90 ]; then
    echo "✅ PASSED — Score: $SCORE/100"
    exit 0
else
    echo "❌ FAILED — Score: $SCORE/100 (need 90)"
    exit 1
fi

Example 3: Batch Evaluation

#!/bin/bash
# Evaluate multiple skills strictly

EVAL_PATH="/media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict"
SKILLS_DIR="/media/ezekiel/Merlin/.openclaw/workspace/skills"

for skill_dir in $SKILLS_DIR/*/; do
    skill_name=$(basename "$skill_dir")
    echo "=========================================="
    echo "Evaluating: $skill_name"
    
    python3 $EVAL_PATH/evaluator.py "$skill_dir" --verbose
    echo ""
done

9. COMPARISON: STANDARD vs STRICT

Aspect Standard (70%) Strict (90%)
Threshold 70/100 90/100
Approval rate ~70% of skills ~30% of skills
Quality bar "Good enough" "Excellence only"
Deterministic No YES
Use case Development Production
ClawHub ready Maybe Always

10. REPORTS AND LOGGING

Evaluation Log Location

/media/ezekiel/Morgana/skills/SKILL_EVALUATOR/reports/

Log Format

# Format: \x3Cskill-name>_\x3CYYYYMMDD>_\x3CHHMMSS>.txt
axiomata-guard-ultimate_20260507_230352.txt

Log Contents

SKILL EVALUATION REPORT — \x3Cskill-name>
=====================================
Path: \x3Cpath>
Score: XX/100
Date: ISO timestamp
Threshold: 90% (STRICT)

AXIOMA 5-DIM:
- Structure: X/20
- Clarity: X/20
- Completeness: X/20
- Consistency: X/20
- Functionality: X/20

ISO 25010:
- Pass: X/13
- Warnings: X/13
- Fails: X/13

STATUS: APPROVED / REJECTED

In Altum Per Strictness. 🧪 AXIOMA SKILL EVALUATOR STRICT — 90% DETERMINISTIC


11. REJECTION CRITERIA

11.1 Automatic Rejection Triggers

A skill is automatically REJECTED if ANY of these occur:

Trigger Severity Description
Score \x3C 90% CRITICAL Below 90% threshold
ISO \x3C 100% CRITICAL Any ISO check failed
Missing SKILL.md CRITICAL Core file missing
Invalid frontmatter HIGH name or description missing
No trigger words HIGH Cannot be activated
Dangerous patterns CRITICAL C2, Rootkit, Bootkit detected

11.2 Rejection Categories

Category Score Range Improvement Needed
CRITICAL FAIL \x3C50% Complete rewrite required
MAJOR FAIL 50-69% Major structural changes
MINOR FAIL 70-89% Targeted improvements
PASS 90-100% Ready for production

11.3 Rejection Report Template

╔═══════════════════════════════════════════════════════════╗
║  ❌ SKILL REJECTED — REJECTION REPORT                    ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  Skill: \x3Cname>                                           ║
║  Score: XX/100 (need 90)                                ║
║  Gap: -XX%                                              ║
║                                                           ║
║  FAILED CHECKS:                                          ║
║  ├─ [ ] Dimension 1: X/20 (need 18)                     ║
║  ├─ [ ] Dimension 2: X/20 (need 18)                     ║
║  └─ [ ] ISO Check: X/13 passed                          ║
║                                                           ║
║  REQUIRED ACTIONS:                                        ║
║  1. \x3Cspecific action>                                    ║
║  2. \x3Cspecific action>                                    ║
║  3. \x3Cspecific action>                                    ║
║                                                           ║
║  ⏰ Re-evaluate AFTER completing all actions             ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

12. APPROVAL CRITERIA

12.1 Automatic Approval Requirements

ALL of these MUST be true for APPROVAL:

Requirement Standard Strict (90%)
Axioma 5-Dim 70+/100 90+/100
Structure 14+/20 18+/20
Clarity 14+/20 18+/20
Completeness 14+/20 18+/20
Consistency 14+/20 18+/20
Functionality 14+/20 18+/20
ISO 25010 90%+ 100% (13/13)

12.2 Approval Benefits

Benefit Description
ClawHub Ready Can be published immediately
Production Safe Quality guaranteed at 90%+
Self-Documenting No additional docs needed
Community Trusted High quality standard

12.3 Approval Report Template

╔═══════════════════════════════════════════════════════════╗
║  ✅ SKILL APPROVED — QUALITY REPORT                     ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  Skill: \x3Cname>                                           ║
║  Score: XX/100 ✅                                         ║
║  Threshold: 90% (STRICT)                                  ║
║                                                           ║
║  PASSED CHECKS:                                          ║
║  ├─ [✅] Structure: X/20 (18+ required)                 ║
║  ├─ [✅] Clarity: X/20 (18+ required)                   ║
║  ├─ [✅] Completeness: X/20 (18+ required)              ║
║  ├─ [✅] Consistency: X/20 (18+ required)                ║
║  ├─ [✅] Functionality: X/20 (18+ required)             ║
║  └─ [✅] ISO 25010: 13/13 (100% required)              ║
║                                                           ║
║  STATUS: ✅ APPROVED — READY FOR PRODUCTION             ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

13. IMPROVEMENT ENGINE

13.1 Auto-Improvement Rules

When --improve flag is used:

RULE: Improve EVERYTHING below 18/20 per dimension
RULE: Target is 90%+ overall
RULE: No dimension below 16/20 (graceful minimum)
RULE: If ANY dimension \x3C 14 after improvement = HARD REJECT

13.2 Improvement Priority Matrix

Priority Dimension Target Common Fixes
1 Functionality 18+/20 Fix commands, add results
2 Consistency 18+/20 Standardize style
3 Clarity 18+/20 Add examples
4 Structure 18+/20 Add sections
5 Completeness 18+/20 Document tools

13.3 Improvement Examples

Before (73/100):

STRUCTURE: 18/20 ✅
CLARITY: 15/20 ⚠️ (need +3)
COMPLETENESS: 20/20 ✅
CONSISTENCY: 8/20 ❌ (need +10)
FUNCTIONALITY: 12/20 ❌ (need +6)

After improvement target (90+/100):

STRUCTURE: 18/20 ✅
CLARITY: 18/20 ✅
COMPLETENESS: 20/20 ✅
CONSISTENCY: 18/20 ✅
FUNCTIONALITY: 18/20 ✅
TOTAL: 92/100 ✅

14. INTEGRATION POINTS

14.1 With Axioma Guard Ultimate

Skill Downloaded → Axioma Guard Ultimate (Security)
                        ↓ SAFE
                   Axioma Skill Evaluator Strict (Quality)
                        ↓ >= 90%
                   ClawHub Published ✅
                        ↓ DANGEROUS
                   Axioma Guard Ultimate (Destroy)

14.2 With ClawHub Publish Workflow

Step 1: Create skill
Step 2: Run Axioma Skill Evaluator Strict
Step 3: IF score >= 90% → Publish
Step 4: IF score \x3C 90% → Improve and re-evaluate
Step 5: Repeat until 90%+ achieved

14.3 Quality Gate Script

#!/bin/bash
# Quality Gate: No skill publishes without 90%+

EVAL_PATH="/media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict"

quality_gate() {
    SKILL_PATH="$1"
    
    SCORE=$(python3 $EVAL_PATH/evaluator.py $SKILL_PATH 2>&1 | \
            grep -oP 'Score: \d+' | grep -oP '\d+')
    
    if [ "$SCORE" -ge 90 ]; then
        echo "✅ QUALITY GATE PASSED: $SCORE/100"
        return 0
    else
        echo "❌ QUALITY GATE FAILED: $SCORE/100 (need 90)"
        return 1
    fi
}

In Altum Per Excellence. 🧪 AXIOMA SKILL EVALUATOR STRICT v1.0 — 90% DETERMINISTIC

安全使用建议
This skill is reasonable for local, structural skill-quality checks. Before installing or running it, confirm the Python dependencies, avoid giving it credentials, run it on a copy of the target skill, and use '--improve' or '--all' only after you understand whether it writes changes. Treat its 90% approval as a quality signal, not as a guarantee that a skill is safe or production-ready.
功能分析
Type: OpenClaw Skill Name: axiomata-skill-evaluator-strict Version: 1.0.0 The axiomata-skill-evaluator-strict bundle is a quality assurance tool designed to audit and score other OpenClaw skills. It contains Python scripts (eval-skill.py and evaluator.py) that perform static analysis to check for structural integrity, documentation completeness, and security risks like hardcoded secrets or external dependencies. While the SKILL.md uses strong, authoritative language regarding a '90% strict threshold' and references specific local paths (e.g., /media/ezekiel/Merlin/), the code logic is transparent, lacks dangerous execution sinks (like eval or os.system), and aligns entirely with its stated purpose of providing a deterministic quality gate for skill development.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
The artifacts coherently describe a local skill evaluator, but SKILL.md frames a 90% score as “APPROVED” and “READY FOR PRODUCTION,” so users should treat it as a heuristic quality gate rather than official or security approval.
Instruction Scope
The documented commands are user-directed, but evaluator.py also documents “--all [--improve]”; broad or auto-improvement modes should be chosen deliberately and reviewed before use.
Install Mechanism
There is no install spec, while SKILL.md expects running python3 scripts and eval-skill.py imports yaml/PyYAML; dependency setup and provenance are left to the user.
Credentials
Reading local skill files is purpose-aligned, but evaluator.py contains hardcoded local paths under /media/ezekiel/Morgana/skills, which may be environment-specific if --all or report-writing paths are used.
Persistence & Privilege
No background persistence or source-backed credential use is shown, but the capability signal says requires-sensitive-credentials despite metadata saying primary credential none; do not grant credentials unless a reviewed runtime path clearly requires them.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install axiomata-skill-evaluator-strict
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /axiomata-skill-evaluator-strict 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Fork of axioma-skill-evaluator with STRICT 90% threshold - deterministic quality gate
元数据
Slug axiomata-skill-evaluator-strict
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Axioma Skill Evaluator Strict 90% 是什么?

AXIOMA SKILL EVALUATOR STRICT — The 90% deterministic skill evaluator. Fork of axioma-skill-evaluator with STRICT 90% threshold (like STC 0.777). Use when: (... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 31 次。

如何安装 Axioma Skill Evaluator Strict 90%?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install axiomata-skill-evaluator-strict」即可一键安装,无需额外配置。

Axioma Skill Evaluator Strict 90% 是免费的吗?

是的,Axioma Skill Evaluator Strict 90% 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Axioma Skill Evaluator Strict 90% 支持哪些平台?

Axioma Skill Evaluator Strict 90% 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Axioma Skill Evaluator Strict 90%?

由 Kofna3369(@kofna3369)开发并维护,当前版本 v1.0.0。

💬 留言讨论