← 返回 Skills 市场
tiktokdad

OpenClaw VLN Planner

作者 TIKTOKDAD · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
131
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install openclaw-vln-planner
功能描述
Plan the next high-level navigation step for a robot from a user navigation instruction, one current image, and a sequence of historical images. Use when the...
安全使用建议
This skill appears to do what it says: it will read local camera frame files, base64-encode them, and POST them (with the model API key from a YAML config) to whatever OpenAI-compatible gateway you configure. Before installing: (1) confirm you trust the gateway endpoint and operator because camera images and any scene data will be transmitted; (2) store the API key securely (the example uses a config file rather than env vars) and update the manifest if you need policy/audit visibility; (3) keep executor in dry_run while testing and review/replace the placeholder execute_* functions so the planner cannot command hardware until you've integrated a vetted execution bridge; (4) if you require stricter telemetry controls, inspect/modify image_to_data_url and build_messages to avoid sending raw images or to anonymize them. The small manifest omission (no declared required config path/credential) is not malicious but worth correcting for clarity and safety.
功能分析
Type: OpenClaw Skill Name: openclaw-vln-planner Version: 1.0.0 The skill bundle is a legitimate vision-language navigation (VLN) planner designed to convert camera observations and user instructions into robot movement commands. The Python bridge (`scripts/vln_bridge.py`) implements standard multimodal LLM integration using the `requests` library and includes robust validation logic, such as movement distance/angle bounds and safety fallbacks. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the file access and network activity are strictly aligned with the stated purpose of processing image frames and communicating with an AI gateway.
能力评估
Purpose & Capability
Name/description, SKILL.md, config, and vln_bridge.py all align: the planner builds a prompt from historical+current images and an instruction, queries a multimodal model, parses JSON, validates bounds, and forwards a mid-level action. Network access to a model and reading image files are expected for this purpose.
Instruction Scope
Runtime instructions and the Python bridge explicitly read image files, load a YAML config containing model base_url/api_key/model_id, and send base64-encoded images to the configured OpenAI-compatible gateway. This behavior is necessary for a multimodal planner but means camera frames (potentially sensitive) are transmitted to an external service. SKILL.md and code limit outputs to pure JSON and define safety fallbacks.
Install Mechanism
This is an instruction-only skill with a small example Python bridge; there is no install spec, no external downloads, and only a minimal requirements.txt (requests, PyYAML). No extraction from arbitrary URLs or package installs are present.
Credentials
The package does not declare required env vars or primary credentials, but the bridge requires a model base_url and api_key in a YAML config file (config/vln-config.yaml). Expect to provide credentials to the model gateway; that is proportional to the task but the manifest omission of a required config/credential declaration is a small inconsistency to be aware of.
Persistence & Privilege
The skill does not request persistent/system privileges, does not set always:true, and has no install actions that modify other skills or system-wide settings. The bridge runs as a standalone script and prints dry-run execution by default.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install openclaw-vln-planner
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /openclaw-vln-planner 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of OpenClaw VLN Planner. - Introduces a high-level vision-language navigation planner for robots based on visual observations and natural-language user instructions. - Outputs a single, validated mid-level navigation action as pure JSON (MOVE_FORWARD, TURN_LEFT, TURN_RIGHT, STOP) for each input. - Integrates with any OpenAI-compatible multimodal gateway for action prediction using current and historical frames. - Includes strict safety fallback rules—defaults to STOP on uncertainty, parse failure, or safety concerns. - Provides clear input requirements, output contract, and runtime configuration via external YAML. - Bundles schema reference, config template, and a bridge script for execution.
元数据
Slug openclaw-vln-planner
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

OpenClaw VLN Planner 是什么?

Plan the next high-level navigation step for a robot from a user navigation instruction, one current image, and a sequence of historical images. Use when the... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 131 次。

如何安装 OpenClaw VLN Planner?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install openclaw-vln-planner」即可一键安装,无需额外配置。

OpenClaw VLN Planner 是免费的吗?

是的,OpenClaw VLN Planner 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

OpenClaw VLN Planner 支持哪些平台?

OpenClaw VLN Planner 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 OpenClaw VLN Planner?

由 TIKTOKDAD(@tiktokdad)开发并维护,当前版本 v1.0.0。

💬 留言讨论