← Back to Skills Marketplace

Arxiv Gamedevbench Evaluating Agentic Capabili

Name: Arxiv Gamedevbench Evaluating Agentic Capabili
Author: wanng-ide

by WANGJUNJIE · GitHub ↗ · v1.0.0

cross-platform ✓ Security Clean

665

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install arxiv-gamedevbench-evaluating-agentic-capabili

Description

Learned from arXiv paper GameDevBench: Evaluating Agentic Capabilities Through Game Development. Use this skill to scaffold Node.js experiments based on the...

README (SKILL.md)

arxiv-gamedevbench-evaluating-agentic-capabili

Source

Paper key: 44f3ad505bee7a5c25a60d2a3686cb7e
Title: GameDevBench: Evaluating Agentic Capabilities Through Game Development
Categories: cs.AI,cs.CL,cs.SE

Learned insight

Despite rapid progress on coding agents, progress on their multimodal counterparts has lagged behind. A key challenge is the scarcity of evaluation testbeds that combine the complexity of software development with the need for deep multimodal understanding. Game development provides such a testbed as agents must navigate large, dense codebases while manipulating intrinsically multimodal assets such as shaders, sprites, and animations within a visual game scene. We present GameDevBench, the first

Node.js implementation entry

node {baseDir}/scripts/run.js

Usage Guidance

This skill appears coherent and low-risk: it’s an auto-generated Node.js scaffold that only prints a paper summary and a TODO. Before running, inspect the bundled files (already included) to confirm there are no added network calls or secret reads. Execute in a sandbox or isolated environment if you want additional caution. If you plan to extend the scaffold, review package.json before adding dependencies and avoid running it with elevated privileges.

Capability Analysis

Type: OpenClaw Skill Name: arxiv-gamedevbench-evaluating-agentic-capabili Version: 1.0.0 The skill bundle is a simple Node.js scaffold that prints information about an arXiv paper. All files, including `SKILL.md`, `index.js`, and `scripts/run.js`, contain only benign code and documentation. There are no indications of data exfiltration, malicious execution, persistence mechanisms, prompt injection attempts against the agent, or obfuscation. The `SKILL.md` explicitly requires the `node` binary, which is appropriate for a Node.js skill, and the `scripts/run.js` file clearly states it's a 'runnable scaffold' with a 'TODO' for further implementation.

Capability Assessment

✓ Purpose & Capability

Name/description claim Node.js scaffolding for the GameDevBench paper and the skill only requires the node binary; included files (index.js, scripts/run.js, package.json, paper.json) are consistent with that purpose.

✓ Instruction Scope

SKILL.md instructs running scripts/run.js. The included run.js only logs metadata and a truncated abstract; there are no instructions to read unrelated files, access environment variables, or send data to external endpoints.

✓ Install Mechanism

No install spec provided (instruction-only). There are local code files bundled, but no downloads, package installs, or extract steps. package.json has no dependencies, so nothing is pulled at runtime beyond node.

✓ Credentials

The skill declares no required environment variables or credentials and the code does not access process.env; requested privileges are minimal and appropriate for a local Node.js scaffold.

✓ Persistence & Privilege

always is false and the skill does not persist configuration or modify other skills/system settings. It only exposes a main() that prints to stdout.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install arxiv-gamedevbench-evaluating-agentic-capabili
After installation, invoke the skill by name or use /arxiv-gamedevbench-evaluating-agentic-capabili
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of arxiv-gamedevbench-evaluating-agentic-capabili skill. - Implements scaffolding for Node.js experiments inspired by the "GameDevBench" paper. - Focuses on evaluating agentic capabilities in game development, including code and multimodal asset manipulation. - Requires Node.js runtime for operation.

Metadata

Slug arxiv-gamedevbench-evaluating-agentic-capabili

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Arxiv Gamedevbench Evaluating Agentic Capabili?

Learned from arXiv paper GameDevBench: Evaluating Agentic Capabilities Through Game Development. Use this skill to scaffold Node.js experiments based on the... It is an AI Agent Skill for Claude Code / OpenClaw, with 665 downloads so far.

How do I install Arxiv Gamedevbench Evaluating Agentic Capabili?

Run "/install arxiv-gamedevbench-evaluating-agentic-capabili" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Arxiv Gamedevbench Evaluating Agentic Capabili free?

Yes, Arxiv Gamedevbench Evaluating Agentic Capabili is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Arxiv Gamedevbench Evaluating Agentic Capabili support?

Arxiv Gamedevbench Evaluating Agentic Capabili is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Arxiv Gamedevbench Evaluating Agentic Capabili?

It is built and maintained by WANGJUNJIE (@wanng-ide); the current version is v1.0.0.

More Skills