← 返回 Skills 市场
retrodigio

Red Team

作者 retrodigio · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
438
总下载
0
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install red-team
功能描述
Adversarial multi-agent debate engine for stress-testing decisions, ideas, and strategies. Orchestrates multiple AI agents with conflicting worldviews (bull,...
安全使用建议
This skill appears to do what it says, but take these precautions before installing or running it: - Understand where data goes: the script calls external model CLIs (Claude/Codex/Gemini) and will send the question, persona system prompts, and any context file you provide to those third-party services. Do not include secrets, private credentials, or confidential documents in the context file unless you trust the target model/provider and account. - Trust the CLIs: you must install and run vendor CLIs locally; ensure those packages are from official sources and that your account/subscription is intended to be used this way. - Inspect custom personas and context files: custom persona JSON or other inputs are merged into prompts. Only use custom persona files from trusted authors — a malicious custom persona could craft prompts that produce undesirable outputs or leak data to the model. - Review the full script: part of the Python file was truncated in the package listing; if you need high assurance, open and read the entire scripts/red-team.py to confirm there are no hidden network calls, logging to unexpected endpoints, or file exfiltration code beyond calls to the recognized CLIs. - Sandbox if needed: run first in an isolated environment or with non-sensitive dummy data to observe behavior and network traffic. If you want, I can (1) walk through the full red-team.py file line-by-line, (2) point out exactly where user data is injected into prompts, or (3) suggest safer invocation patterns (e.g., redact secrets, run in an isolated account) — tell me which you prefer.
功能分析
Type: OpenClaw Skill Name: red-team Version: 1.0.0 The skill is classified as suspicious due to the significant prompt injection surface against the underlying Large Language Models (LLMs). User-controlled inputs, particularly the `system` prompts defined in custom persona JSON files and the `question` and `context` arguments, are directly fed to the LLMs via `subprocess.run` calls in `scripts/red-team.py`. While the script itself does not exhibit malicious host-system behavior (e.g., data exfiltration, unauthorized command execution, or persistence), this design allows a malicious user to manipulate the LLM's behavior, potentially leading to the generation of harmful content, ignoring instructions, or other unintended AI actions. The broad file read/write capabilities (`--context-file`, `--output`) are used for the stated purpose and do not show malicious intent.
能力评估
Purpose & Capability
Name/description (adversarial debate engine) align with the included Python script and persona library. Requiring a local AI CLI (claude/codex/gemini) is expected for this functionality; no unrelated credentials or system accesses are requested.
Instruction Scope
SKILL.md instructs running the included script, selecting personas, and optionally feeding a context file. That context and all persona text are placed into prompts and sent to external model CLIs — so any sensitive data in the context will be transmitted to those third parties. A pre-scan flagged a 'system-prompt-override' pattern; the skill legitimately uses system prompts to define personas, but this pattern is worth noting because it can be abused in other contexts.
Install Mechanism
No install spec is included; the skill is instruction- and script-based. The README suggests installing vendor CLIs with npm (well-known packages), which is low/moderate risk and expected. There are no downloads from arbitrary URLs or archive extraction in the package itself.
Credentials
The skill declares no required environment variables or credentials; it relies on the user's installed model CLIs and their existing authenticated subscriptions. This is proportional to its purpose. Users should note those CLIs use their account tokens and will accept the prompts constructed by the script.
Persistence & Privilege
The skill is not always-enabled and does not request system-wide persistence or modifications to other skills. It runs on demand and does not declare elevated platform privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install red-team
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /red-team 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: adversarial multi-agent debate engine with 12 personas, multi-backend support (claude/codex/gemini)
元数据
Slug red-team
版本 1.0.0
许可证
累计安装 2
当前安装数 2
历史版本数 1
常见问题

Red Team 是什么?

Adversarial multi-agent debate engine for stress-testing decisions, ideas, and strategies. Orchestrates multiple AI agents with conflicting worldviews (bull,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 438 次。

如何安装 Red Team?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install red-team」即可一键安装,无需额外配置。

Red Team 是免费的吗?

是的,Red Team 完全免费(开源免费),可自由下载、安装和使用。

Red Team 支持哪些平台?

Red Team 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Red Team?

由 retrodigio(@retrodigio)开发并维护,当前版本 v1.0.0。

💬 留言讨论