← 返回 Skills 市场
110
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install zouroboros-bench
功能描述
Benchmark harness for AI memory systems. Evaluates LongMemEval, LoCoMo, and ConvoMem datasets against any memory backend via the zouroboros-memory CLI. Inclu...
安全使用建议
What to consider before installing/running:
- The tool is intended for benchmarking memory backends and mostly does what it claims, but the runtime will by default open and may write to a persistent memory DB at ~/.zouroboros/memory.db (or whatever ZOUROBOROS_MEMORY_DB points to). If you have an existing Zouroboros memory DB you care about, the benchmark may modify it (it can store 'mimir-correction-*' facts).
- The code calls external services: OpenAI (api.openai.com) if you set OPENAI_API_KEY, a configurable 'Mimir' gate endpoint (MIMIR_GATE_URL, default http://localhost:7820), and an Ollama local model URL. Running with OPENAI_API_KEY will send prompts and model responses (potentially dataset or memory contents) to OpenAI's API. Ensure you are comfortable with that data leaving your machine.
- Mitigations: run the benchmark in an isolated environment (container, VM, or throwaway account), set ZOUROBOROS_MEMORY_DB to a temporary path so the tool does not touch your real home DB, and avoid providing OPENAI_API_KEY if you do not want cloud uploads — use an offline Ollama instance instead. Inspect or run the package source locally before npm installing globally. Review and, if necessary, modify dist/adapters/mimir-judge.js and other adapters if you want to disable the code path that writes to the memory system.
- If you need a simple read-only benchmark run: ensure ZOUROBOROS_MEMORY_CLI points to a no-op or stub binary or use ZOUROBOROS_MEMORY_DB pointing to an isolated SQLite file; do not provide OPENAI_API_KEY and set MIMIR_GATE_URL to localhost or an internal endpoint you control.
- If you are not comfortable with these behaviors or cannot run in an isolated environment, treat this skill as potentially risky and avoid giving it access to your real memory DB or OpenAI credentials.
功能分析
Type: OpenClaw Skill
Name: zouroboros-bench
Version: 1.0.0
The bundle is a benchmark harness for AI memory systems that utilizes high-risk execution patterns, specifically the use of `execSync` to construct shell commands with potentially unsanitized inputs in `mimir-judge.ts` and several adapter files (e.g., `convomem-adapter.js`). It interacts with a local SQLite database at `~/.zouroboros/memory.db` and performs network requests to `api.openai.com` and local endpoints like `http://localhost:11434` (Ollama) and `http://localhost:7820` (Mimir Gate). While these actions are plausibly necessary for benchmarking and evaluating memory backends, the implementation creates a significant attack surface for shell injection and unauthorized local data manipulation via the `zouroboros-memory` CLI.
能力标签
能力评估
Purpose & Capability
The skill is a Node-based benchmark harness that calls an external memory CLI and cloud/local LLM endpoints — this is consistent with a memory benchmarking tool. However, the code defaults to reading/writing a persistent local memory DB at ~/.zouroboros/memory.db (via better-sqlite3 or the memory CLI) and will attempt to store 'mimir-correction' facts back into the memory system. The SKILL metadata did not declare any required config paths or warn that it may modify a user's persistent memory DB, which is surprising for some users.
Instruction Scope
SKILL.md instructs installation and running npx commands and documents env vars like ZOUROBOROS_MEMORY_CLI, ZOUROBOROS_MEMORY_DB, OPENAI_API_KEY, and OLLAMA_URL. The runtime code, however, will: (1) read local dataset files and the user's home memory DB by default, (2) invoke external binaries (zouroboros-memory) via execSync, (3) send data to api.openai.com and to a GATE_URL endpoint (MIMIR_GATE_URL) and to an Ollama URL — potentially transmitting benchmark contents and memory content to remote services. The README does not explicitly warn that dataset or memory contents may be transmitted to third-party services or that the tool will write corrections back into a persistent memory DB.
Install Mechanism
Install is a normal npm package (zouroboros-bench). No downloadable archives or obscure URLs were used. Code is included in the package (dist/). This is an expected install mechanism for a Node.js tool.
Credentials
The registry metadata declares only OPENAI_API_KEY as required (which the code uses). The code also respects other env vars (ZOUROBOROS_MEMORY_CLI, ZOUROBOROS_MEMORY_DB, OLLAMA_URL, MIMIR_GATE_URL, ZO_ANSWER_MODEL). The main concern: the tool accesses a default memory DB path in the user's home directory (ZO_MEMORY_DB default -> ~/.zouroboros/memory.db) and may write entries into that DB via the memory CLI. That means the skill can access and mutate persistent, possibly sensitive local data without the SKILL manifest declaring a required config path or explicitly warning the user.
Persistence & Privilege
The skill does not request 'always: true' and doesn't modify other skills, but it does persist corrections into a memory backend (via execSync of the memory CLI) and reads a local SQLite DB by default. That gives it persistent side effects on the user's memory store, which the SKILL.md/metadata do not prominently warn about. Users should treat this as a write-capable tool for local persistent state.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install zouroboros-bench - 安装完成后,直接呼叫该 Skill 的名称或使用
/zouroboros-bench触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial ClawHub release. Benchmark harness for AI memory systems with LongMemEval, LoCoMo, ConvoMem datasets and Mimir judge.
元数据
常见问题
Zouroboros Bench 是什么?
Benchmark harness for AI memory systems. Evaluates LongMemEval, LoCoMo, and ConvoMem datasets against any memory backend via the zouroboros-memory CLI. Inclu... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 110 次。
如何安装 Zouroboros Bench?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install zouroboros-bench」即可一键安装,无需额外配置。
Zouroboros Bench 是免费的吗?
是的,Zouroboros Bench 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Zouroboros Bench 支持哪些平台?
Zouroboros Bench 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Zouroboros Bench?
由 marlandoj(@marlandoj)开发并维护,当前版本 v1.0.0。
推荐 Skills