← Back to Skills Marketplace

NVIDIA Kimi Vision

Name: NVIDIA Kimi Vision
Author: miladnoo

by Quiz Due · GitHub ↗ · v1.0.3

cross-platform ⚠ suspicious

892

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install nvidia-kimi-vision

Description

Analyze images using NVIDIA Kimi K2.5 vision model via NVIDIA NIM API. Perfect for adding vision to non-vision models like MiniMax M2.5, GLM-5, or any model...

Usage Guidance

This skill appears to do what it says: it base64-encodes an image and sends it to NVIDIA's inference API using a Kimi model. Before installing or running it: 1) Inspect the script (it is included) and confirm the endpoint and model match the NVIDIA service you intend to use. 2) Install the Python requests package (preferably in a virtualenv). 3) Create a dedicated/limited NVIDIA API key (do not reuse sensitive production keys) and consider storing it in a secure secrets manager rather than a plaintext file; if you must use the file, restrict its permissions (chmod 600). 4) Be aware of a minor bug: extension parsing in the script expects bare extensions (e.g., 'jpg') but uses os.path.splitext which returns '.jpg' so the MIME mapping falls back to image/jpeg; this is a correctness issue, not an obvious security issue. 5) Avoid sending images that contain highly sensitive data unless you trust the endpoint and key. If you want stronger assurance, run the script in an isolated environment and/or modify it to read the API key from an OS credential store.

Capability Analysis

Type: OpenClaw Skill Name: nvidia-kimi-vision Version: 1.0.3 The `scripts/analyze_image.py` file contains an information disclosure vulnerability. The `get_api_key` function, when provided with a third command-line argument, attempts to read that argument as a file path if it exists. This allows an attacker to specify a path to an arbitrary sensitive file (e.g., `~/.ssh/id_rsa`, `/etc/passwd`). The content of this file is then used as the API key and sent in the `Authorization` header to NVIDIA's API endpoint (`https://integrate.api.nvidia.com`). While the data is sent to a legitimate third party (NVIDIA), this constitutes an unauthorized information disclosure risk.

Capability Assessment

✓ Purpose & Capability

Name/description match the actual behavior: the script encodes an image and POSTs it to an NVIDIA inference endpoint (integrate.api.nvidia.com) using a Kimi model. Required runtime (python3 and the requests library) is consistent with the code.

⚠ Instruction Scope

The SKILL.md and script instruct reading a user API key from ~/.config/nvidia-kimi-api-key (or accepting it inline). The skill does not read any other system files, but it references a config file path that is not declared in the registry metadata (requires.config). That mismatch should be declared so users know the skill will read a file from their home directory.

ℹ Install Mechanism

This is an instruction-only skill with no install spec (lowest risk). The SKILL.md metadata contains an 'install' entry labeled 'Install Python dependencies (requests)' but provides no command; the included script requires the requests package, so users must install it manually or the skill will fail.

ℹ Credentials

No environment variables or cloud credentials are required by the registry metadata; the script instead expects an NVIDIA API key stored in a user config file or passed inline. Storing the key as plaintext in ~/.config/nvidia-kimi-api-key is functional but not ideal—this is proportional to the skill's purpose but should be noted as a security consideration.

✓ Persistence & Privilege

The skill does not request permanent/always-on presence, does not modify other skills or system-wide settings, and only reads/writes the user-provided config path. Autonomous invocation is enabled by default but is not combined with broad or unexplained privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install nvidia-kimi-vision
After installation, invoke the skill by name or use /nvidia-kimi-vision
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.3

Republished

v1.0.2

Added clear setup instructions

v1.0.1

Added SEO keywords for non-vision models

v1.0.0

Initial release

Metadata

Slug nvidia-kimi-vision

Version 1.0.3

License —

All-time Installs 2

Active Installs 2

Total Versions 4

Frequently Asked Questions

What is NVIDIA Kimi Vision?

Analyze images using NVIDIA Kimi K2.5 vision model via NVIDIA NIM API. Perfect for adding vision to non-vision models like MiniMax M2.5, GLM-5, or any model... It is an AI Agent Skill for Claude Code / OpenClaw, with 892 downloads so far.

How do I install NVIDIA Kimi Vision?

Run "/install nvidia-kimi-vision" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is NVIDIA Kimi Vision free?

Yes, NVIDIA Kimi Vision is completely free (open-source). You can download, install and use it at no cost.

Which platforms does NVIDIA Kimi Vision support?

NVIDIA Kimi Vision is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created NVIDIA Kimi Vision?

It is built and maintained by Quiz Due (@miladnoo); the current version is v1.0.3.

More Skills