← Back to Skills Marketplace
remixmm

Untrusted Binary Sandbox

by remixmm · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
55
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install untrusted-binary-sandbox
Description
Use when asked to safely inspect, sandbox, detonate, run, or dynamically observe untrusted release artifacts, closed-source binaries, JARs, installers, walle...
README (SKILL.md)

Untrusted Binary Sandbox

Purpose

Use this skill to plan and execute controlled behavioral observation of untrusted software without trusting the software, its installer, or its documentation. Treat wallet, private-key, trading, MEV, arbitrage, and closed-source crypto tools as high-risk by default.

This skill does not prove safety. It reduces uncertainty for the exact artifact, configuration, runtime, and observation window tested.

Non-Negotiables

  • Do not run downloaded binaries, JARs, install scripts, release artifacts, or curl | sh flows unless the user explicitly approves dynamic execution.
  • Prefer a sacrificial VM/VPS over a production host. Never mount production secrets, SSH keys, browser profiles, Docker socket, cloud credentials, or funded wallets.
  • Never use a real private key during observation. Use invalid keys, empty throwaway wallets, or a tiny-fund wallet only in the final live tier after explicit approval.
  • Pin artifact identity before execution: source URL, version, size, SHA-256, timestamp, and any signature/digest metadata.
  • Keep confirmed findings separate from risk signals and unknowns. Report confidence level.

Workflow

  1. Scope the sample: Identify the artifact, claimed purpose, required command, required secrets, network endpoints, supported OS/CPU, and whether the user is asking for static analysis, dynamic observation, or production feasibility.

  2. Static triage first: Hash the artifact, inspect archive contents, review scripts/configs, run file, readelf, objdump, strings, jar tf, or language-specific archive inspection as appropriate. Avoid ldd on untrusted ELF files; use readelf -d or objdump -p instead.

  3. Choose the lowest sufficient tier:

    • Tier 0: no-run static analysis only.
    • Tier 1: offline Docker run with network_mode: none.
    • Tier 2: internal-only network with mock RPC/API/DNS sinks.
    • Tier 3: controlled egress with allowlisted endpoints and packet/DNS logging.
    • Tier 4: sacrificial VM/VPS with tiny-fund throwaway wallet for live behavior, only after prior tiers are clean.
  4. Build containment: Use non-root users, read_only root filesystem, cap_drop: [ALL], no-new-privileges, PID/CPU/memory limits, tmpfs for scratch, no host networking, no privileged mode, no host path mounts except read-only artifacts/config and writable logs.

  5. Instrument before execution: Capture process tree, syscalls, opened files, filesystem writes, DNS queries, outbound connections, child processes, resource use, and any downloaded or modified files. Prefer strace inside Linux containers and VM-level packet capture when network is enabled.

  6. Execute briefly and repeat: Start with short timeouts such as 60-180 seconds. Repeat with a known-bad config, mock RPC, then controlled egress. Preserve logs and hashes after each run.

  7. Report with judgment: Summarize confirmed behavior, suspicious signals, unknowns, confidence, and a go/no-go recommendation. For wallet software, include a separate private-key exposure assessment.

Stop Criteria

Stop and report immediately if the sample:

  • Reads SSH keys, shell history, browser profiles, cloud credentials, .env files, unrelated wallet paths, or host-sensitive directories.
  • Writes cron/systemd/profile startup hooks, modifies shell init files, changes file permissions broadly, or downloads and executes new payloads.
  • Requires root, host networking, Docker socket, privileged mode, broad host mounts, or disabled sandboxing without a clear technical need.
  • Connects to endpoints not declared in config/docs, especially paste sites, generic file hosts, Telegram bot APIs, raw IPs, or newly registered domains.
  • Attempts to exfiltrate key material or sends transactions before the expected safe stage.

Solana And Wallet-Specific Rules

  • Never mount ~/.config/solana/id.json or any funded wallet.
  • Prefer invalid key material or an empty throwaway keypair. Do not print, transform, or store private keys in chat.
  • Use a local mock RPC in Tier 2. In Tier 3, allow only explicit RPC/Jito/sender endpoints.
  • Treat "encrypt your private key with this binary" as high-risk behavior, not as a safety feature.
  • Measure failed transactions, priority-fee burn, Jito tips, and RPC rate-limit pressure separately from malware risk.

Resources

  • Read references/docker-sandbox.md when building the actual Docker or VM containment plan.
  • Read references/observation-checklist.md when preparing the final evidence report.
  • Use scripts/scaffold_sandbox.py to generate a local sandbox scaffold. It creates Docker Compose files and a mock Solana RPC service, but does not execute the sample by default.

Appropriate for: In-depth documentation, API references, database schemas, comprehensive guides, or any detailed information that Codex should reference while working.

assets/

Files not intended to be loaded into context, but rather used within the output Codex produces.

Examples from other skills:

  • Brand styling: PowerPoint template files (.pptx), logo files
  • Frontend builder: HTML/React boilerplate project directories
  • Typography: Font files (.ttf, .woff2)

Appropriate for: Templates, boilerplate code, document templates, images, icons, fonts, or any files meant to be copied or used in the final output.


Not every skill requires all three types of resources.

Usage Guidance
Install only if you intend to analyze untrusted software. Follow the skill's own guardrails: do not run samples without explicit approval, use a disposable VM or tightly restricted container, never mount real wallets or secrets, and review any generated Docker files before replacing the default /bin/false command with a real sample command.
Capability Tags
cryptofinancial-authorityrequires-walletcan-sign-transactionsrequires-sensitive-credentials
Capability Assessment
Purpose & Capability
The stated purpose and artifacts align: the skill provides staged static and dynamic analysis guidance for untrusted binaries, wallet tools, and trading bots, with explicit containment and evidence-reporting steps.
Instruction Scope
Instructions are bounded by user approval for dynamic execution, staged risk tiers, short timeouts, explicit artifact identification, and clear stop criteria for secret access, persistence, unexpected network activity, or transaction behavior.
Install Mechanism
The package includes an executable Python scaffold generator that writes Docker Compose files, a Dockerfile, a runbook, and a mock Solana RPC service; it does not execute the sample by default, though --force can overwrite files in the chosen output directory.
Credentials
Docker, VM, tracing, mock RPC, and controlled network guidance are high-impact techniques but proportionate to malware-style observation and are framed with non-root, read-only, no-privilege, no-host-secret safeguards.
Persistence & Privilege
No hidden persistence, privilege escalation, credential harvesting, or background worker behavior was found; the skill explicitly warns against privileged Docker, host networking, Docker socket mounts, funded wallets, and startup hooks.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install untrusted-binary-sandbox
  3. After installation, invoke the skill by name or use /untrusted-binary-sandbox
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: staged sandbox workflow for untrusted binaries, wallet/trading bots, Docker scaffolding, mock RPC, and evidence reporting.
Metadata
Slug untrusted-binary-sandbox
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Untrusted Binary Sandbox?

Use when asked to safely inspect, sandbox, detonate, run, or dynamically observe untrusted release artifacts, closed-source binaries, JARs, installers, walle... It is an AI Agent Skill for Claude Code / OpenClaw, with 55 downloads so far.

How do I install Untrusted Binary Sandbox?

Run "/install untrusted-binary-sandbox" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Untrusted Binary Sandbox free?

Yes, Untrusted Binary Sandbox is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Untrusted Binary Sandbox support?

Untrusted Binary Sandbox is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Untrusted Binary Sandbox?

It is built and maintained by remixmm (@remixmm); the current version is v1.0.0.

💬 Comments