← 返回 Skills 市场
rsavitt

SWARM Safety

作者 rsavitt · GitHub ↗ · v1.5.0
cross-platform ✓ 安全检测通过
1217
总下载
2
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install swarm-2
功能描述
SWARM: System-Wide Assessment of Risk in Multi-agent systems. 38 agent types, 29 governance levers, 55 scenarios. Study emergent risks, phase transitions, and governance cost paradoxes.
安全使用建议
This appears coherent for a local simulation framework, but follow standard precautions before installing or running: (1) run pip installs in an isolated virtualenv or container; (2) review the package source on the referenced GitHub repo before installing, especially if you plan to use extras like [llm] or [api]; (3) keep the API bound to localhost and behind firewalls — do not bind to 0.0.0.0 on untrusted networks; (4) never submit real API keys, credentials, or PII to scenarios; and (5) be aware that installing Python packages can execute install scripts, so inspect setup metadata if you require a higher assurance level.
功能分析
Type: OpenClaw Skill Name: swarm-2 Version: 1.5.0 The OpenClaw AgentSkills skill bundle for 'swarm-safety' is designed as a research framework for studying AI safety and emergent risks in multi-agent systems. The `SKILL.md` and `skill.json` files describe its functionality, installation, and API usage. All installation methods are standard (pip, git clone). The API explicitly binds to `127.0.0.1` (localhost) by default and warns against exposing the unauthenticated development API to untrusted networks, indicating security awareness rather than malicious intent. There is no evidence of data exfiltration, unauthorized command execution, persistence mechanisms, or malicious prompt injection attempts against the OpenClaw agent. The 'adversarial' and 'deceptive' agents mentioned are part of the *simulated* environment, not the skill itself acting maliciously on the host system.
能力评估
Purpose & Capability
Name/description (multi-agent safety simulation) align with the provided instructions and metadata: it documents agent types, scenarios, governance levers, and CLI/API usage. Nothing requested (no env vars, no unusual binaries) is disproportionate to a simulation/testing framework.
Instruction Scope
SKILL.md stays on-topic: it explains installation (pip/git), local API startup (uvicorn binding to 127.0.0.1), CLI usage, and curl-based local endpoints. It explicitly warns not to expose the dev API and not to submit real API keys/PII. It does not instruct reading unrelated system files or exfiltrating data.
Install Mechanism
Install instructions use pip and an upstream GitHub repository — standard, low-to-moderate risk for Python packages. No arbitrary binary downloads, no URL shorteners or personal IP-hosted archives are used in the instructions.
Credentials
The skill declares no required environment variables, credentials, or config paths. The SKILL.md notes the API will return agent api_keys when run locally (expected behavior for a simulation server) but does not request unrelated secrets.
Persistence & Privilege
No 'always' flag, default autonomous invocation is allowed (normal). The skill is instruction-only and does not request persistent or elevated system privileges or modify other skills' configurations.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install swarm-2
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /swarm-2 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.5.0
Update to v1.5.0: 38 agent types, 29 governance levers, 55 scenarios, 8 framework bridges, phase transition findings, governance cost paradox study
v0.1.0
- Initial release of the SWARM Safety Skill for simulating and studying multi-agent emergent risks. - Provides Python, CLI, and REST API interfaces for agent and scenario management. - Supports various agent types (Honest, Opportunistic, Deceptive, Adversarial, LLM) and governance mechanisms. - Includes comprehensive documentation, example scenarios, and research-use guidelines. - Offers soft probabilistic labeling and four key risk/welfare metrics for environment analysis.
元数据
Slug swarm-2
版本 1.5.0
许可证
累计安装 1
当前安装数 1
历史版本数 2
常见问题

SWARM Safety 是什么?

SWARM: System-Wide Assessment of Risk in Multi-agent systems. 38 agent types, 29 governance levers, 55 scenarios. Study emergent risks, phase transitions, and governance cost paradoxes. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1217 次。

如何安装 SWARM Safety?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install swarm-2」即可一键安装,无需额外配置。

SWARM Safety 是免费的吗?

是的,SWARM Safety 完全免费(开源免费),可自由下载、安装和使用。

SWARM Safety 支持哪些平台?

SWARM Safety 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 SWARM Safety?

由 rsavitt(@rsavitt)开发并维护,当前版本 v1.5.0。

💬 留言讨论