← Back to Skills Marketplace

Skill Sandbox

Name: Skill Sandbox
Author: trypto1019

by ArcSelf · GitHub ↗ · v1.1.0

darwinlinux ⚠ suspicious

869

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install arc-skill-sandbox

Description

Test untrusted skills in an isolated environment before installing. Monitors network access, filesystem writes, environment variable reads, and subprocess ca...

README (SKILL.md)

Skill Sandbox

Run untrusted skills in a monitored environment. See exactly what they do before giving them access to your real system.

Why This Exists

ClawHub has hundreds of skills. Some are malicious. Even after scanning with arc-skill-scanner, you can't catch everything with static analysis. The sandbox lets you run a skill's scripts and observe their behavior at runtime — what network calls they make, what files they access, what environment variables they read.

Commands

Sandbox a skill directory

python3 {baseDir}/scripts/sandbox.py run --path ~/.openclaw/skills/some-skill/

Run a specific script in sandbox

python3 {baseDir}/scripts/sandbox.py run --script ~/.openclaw/skills/some-skill/scripts/main.py

Run with network monitoring

python3 {baseDir}/scripts/sandbox.py run --path ~/.openclaw/skills/some-skill/ --monitor-network

Run with fake environment variables

python3 {baseDir}/scripts/sandbox.py run --path ~/.openclaw/skills/some-skill/ --fake-env

Run with a time limit

python3 {baseDir}/scripts/sandbox.py run --path ~/.openclaw/skills/some-skill/ --timeout 30

Generate a safety report

python3 {baseDir}/scripts/sandbox.py report --path ~/.openclaw/skills/some-skill/

What It Monitors

Filesystem Access

Files opened (read/write)
Directories created
File deletions
Permission changes

Environment Variables

Which env vars are read
Whether sensitive keys are accessed (API keys, tokens, passwords)
Option to inject fake values to see what the skill does with them

Network Activity

Outbound HTTP/HTTPS requests (URLs, methods, payloads)
DNS lookups
Socket connections
FTP, SMTP, and other protocols

Process Execution

Subprocess calls
Shell commands
Dynamic imports

Safety Modes

observe (default) — Run the skill and log everything it does. No restrictions.
restricted — Block network access and filesystem writes outside a temp directory.
honeypot — Provide fake credentials and endpoints to see if the skill tries to exfiltrate.

Output

The sandbox produces a JSON report with:

All filesystem operations (reads, writes, deletes)
All environment variable accesses
All network connections attempted
All subprocess calls
Warnings for suspicious patterns
A safety verdict (SAFE / SUSPICIOUS / DANGEROUS)

Integration

Combine with the workflow orchestrator for automated pre-install checks:

scan skill → sandbox run → review report → install if safe → audit log

Limitations

Python skills only (JavaScript/shell support planned)
Cannot catch all evasion techniques (obfuscated or delayed execution)
Network monitoring requires the skill to use standard Python libraries
Not a true OS-level sandbox (use Docker for that level of isolation)

Usage Guidance

This tool mostly does what it says (it checks Python skill code and emits a report), but be cautious: 1) 'observe' mode runs the skill with your real environment and no restrictions — that can let a malicious skill read or exfiltrate secrets; prefer using --fake-env or the 'restricted' mode. 2) The script's monitoring is limited: running the skill as a subprocess means the monitor cannot always see every env read or low-level socket operation, and the tool itself warns it is not an OS-level sandbox. 3) Do not rely solely on the sandbox verdict — inspect the generated JSON report and the skill's source yourself, and consider running untrusted skills inside a real container/VM (Docker) or isolated machine for high-risk cases. 4) If you need stronger guarantees (preventing native code, ctypes, exec/fork, raw sockets, or kernel-level side-channels), use Docker/VM-based isolation rather than this Python-based sandbox.

Capability Analysis

Type: OpenClaw Skill Name: arc-skill-sandbox Version: 1.1.0 This skill bundle implements a security sandbox designed to monitor and analyze the behavior of other untrusted skills. All observed 'risky' capabilities, such as logging filesystem operations, environment variable access, network calls, and subprocess execution (in `scripts/sandbox.py`), are explicitly part of its stated purpose to detect malicious activity in *other* skills. The inclusion of fake credentials (e.g., `sk-fake-sandbox-key-do-not-use`) is a honeypot mechanism to test the sandboxed skill's intent, not an attempt to steal real credentials. The `SKILL.md` documentation clearly outlines its defensive purpose, and the code demonstrates robust isolation techniques (subprocess execution, sanitized environment) to prevent sandbox escapes. There is no evidence of intentional harmful behavior by this skill itself.

Capability Assessment

ℹ Purpose & Capability

Name/description match what the package contains: a Python script for running and analyzing other Python skill code. Requiring python3 and referencing skill directories in the user's home is consistent with a tool that runs and inspects skill code.

⚠ Instruction Scope

SKILL.md claims comprehensive monitoring of filesystem, env reads, network and subprocesses. The implementation mixes subprocess isolation and in-process monitoring primitives; when it runs the skill as a separate subprocess (its stated default safe approach) it cannot directly observe which environment keys the child process reads unless additional instrumentation is injected into the child. The code also logs the injection of fake credentials as warnings immediately (monitor.warn on injection), which could be misleading because injection ≠ access. The default 'observe' mode runs the skill with no restrictions and could allow real secrets or network exfiltration — this conflicts with the 'safe' implication in the description.

✓ Install Mechanism

No install spec; instruction-only plus a Python script. This is low-risk from an installation standpoint (nothing downloaded or auto-installed).

⚠ Credentials

The skill requests no external credentials and only requires python3, which is proportional. However, because SKILL.md's default observe mode runs a skill with the host environment and the tool logs environment variable access only under certain modes, a user following the docs could inadvertently expose real credentials or files. The script injects a fixed set of 'fake' keys (OpenAI, Anthropic, GitHub, AWS, etc.) which is sensible for testing, but injection is logged as a warning immediately regardless of whether the child actually reads them, creating potential false positives or misleading traces.

✓ Persistence & Privilege

No always:true or other elevated persistence requested. The skill is user-invocable and not force-included. Normal autonomous invocation is allowed by platform defaults but is not a new concern here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install arc-skill-sandbox
After installation, invoke the skill by name or use /arc-skill-sandbox
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.1.0

arc-skill-sandbox 1.1.0 - Updated scripts/sandbox.py (details not included). - No user-facing documentation changes.

v1.0.1

- Updated scripts/sandbox.py. - No changes to documentation or user-facing features. - Version bump to 1.0.1.

v1.0.0

Initial release of skill-sandbox: safely test untrusted skills in a monitored environment. - Run and monitor skills before installing to protect your agent’s data and credentials. - Tracks filesystem changes, environment variable access, network activity, and subprocess calls. - Offers multiple safety modes (observe, restricted, honeypot) to control and observe behavior. - Produces detailed JSON safety reports highlighting suspicious or risky actions. - Python-only for now; additional language support planned in future versions.

Metadata

Slug arc-skill-sandbox

Version 1.1.0

License —

All-time Installs 0

Active Installs 0

Total Versions 3

Frequently Asked Questions

What is Skill Sandbox?

Test untrusted skills in an isolated environment before installing. Monitors network access, filesystem writes, environment variable reads, and subprocess ca... It is an AI Agent Skill for Claude Code / OpenClaw, with 869 downloads so far.

How do I install Skill Sandbox?

Run "/install arc-skill-sandbox" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Skill Sandbox free?

Yes, Skill Sandbox is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Skill Sandbox support?

Skill Sandbox is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).

Who created Skill Sandbox?

It is built and maintained by ArcSelf (@trypto1019); the current version is v1.1.0.

More Skills

Skill Sandbox

Skill Sandbox

Why This Exists

Commands

Sandbox a skill directory

Run a specific script in sandbox

Run with network monitoring

Run with fake environment variables

Run with a time limit

Generate a safety report

What It Monitors

Filesystem Access

Environment Variables

Network Activity

Process Execution

Safety Modes

Output

Integration

Limitations

What is Skill Sandbox?

How do I install Skill Sandbox?

Is Skill Sandbox free?

Which platforms does Skill Sandbox support?

Who created Skill Sandbox?

💬 Comments