Arxiv Gamedevbench Evaluating Agentic Capabili
/install arxiv-gamedevbench-evaluating-agentic-capabili
arxiv-gamedevbench-evaluating-agentic-capabili
Source
- Paper key: 44f3ad505bee7a5c25a60d2a3686cb7e
- Title: GameDevBench: Evaluating Agentic Capabilities Through Game Development
- Categories: cs.AI,cs.CL,cs.SE
Learned insight
Despite rapid progress on coding agents, progress on their multimodal counterparts has lagged behind. A key challenge is the scarcity of evaluation testbeds that combine the complexity of software development with the need for deep multimodal understanding. Game development provides such a testbed as agents must navigate large, dense codebases while manipulating intrinsically multimodal assets such as shaders, sprites, and animations within a visual game scene. We present GameDevBench, the first
Node.js implementation entry
node {baseDir}/scripts/run.js
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install arxiv-gamedevbench-evaluating-agentic-capabili - 安装完成后,直接呼叫该 Skill 的名称或使用
/arxiv-gamedevbench-evaluating-agentic-capabili触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Arxiv Gamedevbench Evaluating Agentic Capabili 是什么?
Learned from arXiv paper GameDevBench: Evaluating Agentic Capabilities Through Game Development. Use this skill to scaffold Node.js experiments based on the... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 665 次。
如何安装 Arxiv Gamedevbench Evaluating Agentic Capabili?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install arxiv-gamedevbench-evaluating-agentic-capabili」即可一键安装,无需额外配置。
Arxiv Gamedevbench Evaluating Agentic Capabili 是免费的吗?
是的,Arxiv Gamedevbench Evaluating Agentic Capabili 完全免费(开源免费),可自由下载、安装和使用。
Arxiv Gamedevbench Evaluating Agentic Capabili 支持哪些平台?
Arxiv Gamedevbench Evaluating Agentic Capabili 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Arxiv Gamedevbench Evaluating Agentic Capabili?
由 WANGJUNJIE(@wanng-ide)开发并维护,当前版本 v1.0.0。