← 返回 Skills 市场
OUA 统一智能评估
作者
RafeYu8899
· GitHub ↗
· v1.0.0
· MIT-0
58
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install oua-intelligence-test
功能描述
OUA统一智能评估框架,基于10维度测试全面评估AI语言理解、逻辑推理、编码能力、创造力、安全伦理及工程可靠性。
安全使用建议
This package appears coherent for running an offline assessment and generating HTML reports. Before running it: (1) Inspect the Python script (already included) and the test bank to confirm it does only local I/O (it does). (2) Run it in a sandbox or isolated environment (container/VM) if you will execute untrusted code. (3) Do not feed production secrets or sensitive user data into its input JSON — the tool writes report files and could include those values in output. (4) If you intend to rely on results for high-stakes decisions, review the scoring logic and weights to ensure they match your evaluation criteria. (5) If you want extra assurance, verify the referenced GitHub repository and author identity and run static checks (lint, safety scanners) on the code.
功能分析
Type: OpenClaw Skill
Name: oua-intelligence-test
Version: 1.0.0
The bundle implements the OpenClaw Unified Assessment (OUA), a framework for evaluating AI capabilities across 10 dimensions. The core components include a Python scoring engine (scripts/score_test.py) that generates HTML reports and a comprehensive test bank (references/test-bank.md). While the test bank contains examples of prompt injection and adversarial queries, these are clearly defined as evaluation data for testing other AI models rather than instructions for the agent itself. The code performs standard data processing and file I/O consistent with its stated purpose, with no evidence of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Name/description (OUA unified assessment) align with the included assets: SKILL.md (framework + tests) and scripts/score_test.py (scoring and HTML report generation). The code implements weighting, per-dimension scoring, grade/ quadrant logic and report generation — all expected for a benchmarking tool.
Instruction Scope
SKILL.md provides test definitions, trigger phrases and guidance for running assessments; it does not instruct the agent to read unrelated system files, access secrets, or transmit data externally. The test bank contains prompts that intentionally include harmful examples (to test refusal), but those are test cases, not active exfiltration instructions.
Install Mechanism
There is no install specification (instruction-only skill with a bundled Python script). No downloads, package installs, or extracted archives are declared. Risk is limited to running the included Python script locally.
Credentials
The skill declares no required environment variables, no primary credential, and no config paths. The code reads JSON input and writes HTML reports locally; it does not request or use credentials or external tokens.
Persistence & Privilege
always is false and the skill is user-invocable. It does not request permanent presence or attempt to modify other skills or system-wide agent settings. The script reads/writes local files only (input JSON / output HTML).
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install oua-intelligence-test - 安装完成后,直接呼叫该 Skill 的名称或使用
/oua-intelligence-test触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
OUA v1.0.0 — 首次发布 OpenClaw 全方位AI能力10维度评测框架
- 综合 OIT(8维度智商上限测试)与 LLI(2维度工程下限测试),实现10大维度全方位AI能力评估
- 涵盖语言理解、逻辑推理、领域知识、代码生成、创意能力、上下文记忆、工具使用、安全伦理、工程实现、系统鲁棒性
- 支持交互式评分与 HTML 可视化报告(含雷达图与四象限分析)
- 明确评分标准和双轨分数体系(OIT/LLI分开计分)
- 附带详细维度说明与评分解释,适合多模型能力横评与开发改进参考
元数据
常见问题
OUA 统一智能评估 是什么?
OUA统一智能评估框架,基于10维度测试全面评估AI语言理解、逻辑推理、编码能力、创造力、安全伦理及工程可靠性。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 58 次。
如何安装 OUA 统一智能评估?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install oua-intelligence-test」即可一键安装,无需额外配置。
OUA 统一智能评估 是免费的吗?
是的,OUA 统一智能评估 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
OUA 统一智能评估 支持哪些平台?
OUA 统一智能评估 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 OUA 统一智能评估?
由 RafeYu8899(@rafeyu8899)开发并维护,当前版本 v1.0.0。
推荐 Skills