← Back to Skills Marketplace
wanng-ide

Arxiv Gamedevbench Evaluating Agentic Capabili

by WANGJUNJIE · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
665
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install arxiv-gamedevbench-evaluating-agentic-capabili
Description
Learned from arXiv paper GameDevBench: Evaluating Agentic Capabilities Through Game Development. Use this skill to scaffold Node.js experiments based on the...
README (SKILL.md)

arxiv-gamedevbench-evaluating-agentic-capabili

Source

  • Paper key: 44f3ad505bee7a5c25a60d2a3686cb7e
  • Title: GameDevBench: Evaluating Agentic Capabilities Through Game Development
  • Categories: cs.AI,cs.CL,cs.SE

Learned insight

Despite rapid progress on coding agents, progress on their multimodal counterparts has lagged behind. A key challenge is the scarcity of evaluation testbeds that combine the complexity of software development with the need for deep multimodal understanding. Game development provides such a testbed as agents must navigate large, dense codebases while manipulating intrinsically multimodal assets such as shaders, sprites, and animations within a visual game scene. We present GameDevBench, the first

Node.js implementation entry

node {baseDir}/scripts/run.js

Usage Guidance
This skill appears coherent and low-risk: it’s an auto-generated Node.js scaffold that only prints a paper summary and a TODO. Before running, inspect the bundled files (already included) to confirm there are no added network calls or secret reads. Execute in a sandbox or isolated environment if you want additional caution. If you plan to extend the scaffold, review package.json before adding dependencies and avoid running it with elevated privileges.
Capability Analysis
Type: OpenClaw Skill Name: arxiv-gamedevbench-evaluating-agentic-capabili Version: 1.0.0 The skill bundle is a simple Node.js scaffold that prints information about an arXiv paper. All files, including `SKILL.md`, `index.js`, and `scripts/run.js`, contain only benign code and documentation. There are no indications of data exfiltration, malicious execution, persistence mechanisms, prompt injection attempts against the agent, or obfuscation. The `SKILL.md` explicitly requires the `node` binary, which is appropriate for a Node.js skill, and the `scripts/run.js` file clearly states it's a 'runnable scaffold' with a 'TODO' for further implementation.
Capability Assessment
Purpose & Capability
Name/description claim Node.js scaffolding for the GameDevBench paper and the skill only requires the node binary; included files (index.js, scripts/run.js, package.json, paper.json) are consistent with that purpose.
Instruction Scope
SKILL.md instructs running scripts/run.js. The included run.js only logs metadata and a truncated abstract; there are no instructions to read unrelated files, access environment variables, or send data to external endpoints.
Install Mechanism
No install spec provided (instruction-only). There are local code files bundled, but no downloads, package installs, or extract steps. package.json has no dependencies, so nothing is pulled at runtime beyond node.
Credentials
The skill declares no required environment variables or credentials and the code does not access process.env; requested privileges are minimal and appropriate for a local Node.js scaffold.
Persistence & Privilege
always is false and the skill does not persist configuration or modify other skills/system settings. It only exposes a main() that prints to stdout.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install arxiv-gamedevbench-evaluating-agentic-capabili
  3. After installation, invoke the skill by name or use /arxiv-gamedevbench-evaluating-agentic-capabili
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of arxiv-gamedevbench-evaluating-agentic-capabili skill. - Implements scaffolding for Node.js experiments inspired by the "GameDevBench" paper. - Focuses on evaluating agentic capabilities in game development, including code and multimodal asset manipulation. - Requires Node.js runtime for operation.
Metadata
Slug arxiv-gamedevbench-evaluating-agentic-capabili
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Arxiv Gamedevbench Evaluating Agentic Capabili?

Learned from arXiv paper GameDevBench: Evaluating Agentic Capabilities Through Game Development. Use this skill to scaffold Node.js experiments based on the... It is an AI Agent Skill for Claude Code / OpenClaw, with 665 downloads so far.

How do I install Arxiv Gamedevbench Evaluating Agentic Capabili?

Run "/install arxiv-gamedevbench-evaluating-agentic-capabili" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Arxiv Gamedevbench Evaluating Agentic Capabili free?

Yes, Arxiv Gamedevbench Evaluating Agentic Capabili is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Arxiv Gamedevbench Evaluating Agentic Capabili support?

Arxiv Gamedevbench Evaluating Agentic Capabili is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Arxiv Gamedevbench Evaluating Agentic Capabili?

It is built and maintained by WANGJUNJIE (@wanng-ide); the current version is v1.0.0.

💬 Comments