yolo-vision-tools

Name: yolo-vision-tools
Author: ruoyu05

Description

Use Ultralytics YOLO to perform computer vision tasks, such as detecting people or objects in images and videos, classifying images, estimating human poses,...

Usage Guidance

What to consider before installing/using: - Review code: The repository includes scripts that run shell commands and probe the filesystem (searching for other Python interpreters and cached models). This is expected for an environment checker but read the scripts if you want reassurance. - Run in an isolated environment: Use a disposable VM or dedicated virtualenv/conda environment if you are worried about exposing environment details or interfering with system Python installs. - No secrets requested: The skill does not ask for API keys or credentials. Still, the scripts can reveal local paths, installed package versions, and GPU details — treat that information as potentially sensitive on shared machines. - Network behavior: SKILL.md examples show loading images from URLs and installing ultralytics with pip; when you run the skill, it may download models or fetch remote images if you pass URLs. Only use trusted model/image URLs. - Subprocess/shell usage: The check script uses subprocess.run with shell execution and executes other python interpreters to check for ultralytics. This is normal for a diagnostic tool but increases what the script can inspect. Avoid running it on systems where arbitrary command output must remain private. - Mitigations: Inspect/modify scripts to remove any checks you don't want, run them with restricted permissions, or run only the functions you trust. Prefer running 'pip install -U ultralytics' yourself and invoking well-known commands (yolo checks) rather than running every diagnostic script unmodified. If you want, I can point out the specific lines that probe your home directory, check other python interpreters, or execute shell commands so you can audit them more easily.

Capability Analysis

Type: OpenClaw Skill Name: yolo-vision-tools Version: 1.2.3 The yolo-vision-tools bundle is a comprehensive and well-structured set of utilities for the Ultralytics YOLO computer vision framework. It includes legitimate scripts for environment diagnostics (check_environment.py), dataset conversion (dataset_tools.py), and model management (model_utils.py). While the scripts utilize high-privilege operations such as subprocess execution and file system manipulation, these actions are strictly aligned with the stated purpose of managing machine learning environments and processing vision data. No evidence of data exfiltration, malicious prompt injection, or unauthorized persistence was found.

Capability Assessment

✓ Purpose & Capability

The name/description (YOLO vision tools) aligns with the included code and docs: model selection, dataset conversion, training helpers, and environment checks. The files and functions are directly relevant to the stated computer-vision tasks.

ℹ Instruction Scope

SKILL.md instructs installing ultralytics and running environment checks; the provided check_environment.py enumerates Python environments, executes external python interpreters, runs shell commands (via subprocess), and inspects ~/.cache/ultralytics for models. Those actions are reasonable for diagnosing YOLO setups but do extend beyond pure inference (they probe filesystem and other Python installs).

✓ Install Mechanism

There is no install spec; this is instruction- and script-based (no remote downloads baked into an installer). That reduces supply-chain risk. The SKILL.md suggests installing ultralytics via pip, which is expected for this purpose.

ℹ Credentials

The skill does not request credentials or environment variables. However, scripts read user paths (home directory caches, virtualenv locations), execute local Python interpreters, and run system commands (which may reveal environment details). These accesses are justifiable for environment checks but are broader than a minimal inference-only tool.

✓ Persistence & Privilege

The skill does not request persistent privileges, does not set always: true, and does not declare writing to other skills' config. It will save outputs to workspace/yolo-vision or user-specified project folders (normal behavior).

Version History

v1.2.3

**Version 1.2.3 Changelog** - Clarified where output images and videos are saved by default, specifying the `yolo-vision` folder. - Added instructions and examples for setting a custom output directory in both Python and CLI interfaces. - Updated comments and examples to reflect output directory changes for all major tasks (detection, segmentation, pose). - No breaking changes; usage and core functionality remain the same.

v1.2.2

**1.2.2 Changelog** - Added detailed, multilingual trigger phrases for all major YOLO vision tasks and model/environment FAQs to the description. - Clarified supported computer vision workflows: detection, segmentation, classification, pose estimation, and tracking. - Enhanced documentation for use cases—including both English and Chinese support—for more accurate skill triggering.

v1.2.1

- Fixed some script bugs.

v1.2.0

**YOLO Vision Tools v1.2.0 Changelog** - Completely restructured and streamlined documentation for clarity, more concise quick start, and simplified model/task selection workflows. - Replaced or consolidated previous references into new, focused guides: added resources for installation, environment checks, model names, task types, parameter reference, and dataset preparation. - Introduced a suite of Python utility scripts for environment checking, configuration/template generation, dataset handling, model selection, training, and quick functional testing. - Removed multiple legacy, redundant, or overlapping reference files and integrated key best practices directly into the main skill documentation. - Updated official description, task examples, and recommendations to align with the latest YOLO26 release and evolving best practices.

v1.1.2

- Update runtime security privacy guidelines.

v1.1.1

- 简化并更新了许可提醒内容。 - 优化了视频发送兼容性说明，明确要求输出MP4必须为H.264编码，并提供了检查与转换命令。 - 调整视频处理相关说明，强调高画质及避免MPEG-4编码问题。 - 更新了技能维护信息，补充了ClawHub平台相关链接。 - 精简了部分说明文字，内容更聚焦、条理更清晰。

v1.1.0

- Major documentation cleanup and structure update. - Merged and refactored multiple reference files for clearer organization (e.g. "quickstart.md" to "quick_start.md", "environment.md" to "install_environment.md", etc.). - Removed all standalone Python and deployment scripts from the package; replaced script instructions with documentation links. - Added new sections and files for security/privacy guidelines and clarified video processing, batch processing, and troubleshooting instructions. - Simplified usage guidance, clarified preferred models (YOLO26 series), and improved consistency in licensing reminders. - Streamlined response flow and best practice advice for users and AI assistants.

v1.0.1

**v1.0.1 Changelog (major restructuring)** - Added comprehensive reference documentation covering quickstart, model selection, deployment, troubleshooting, and YOLO-SAM integration. - Introduced multiple utility scripts for batch processing, visualization, web service deployment, Docker deployment, YOLO-SAM integration, and video conversion. - Added strict license compliance reminders: every use must notify users of YOLO (AGPL-3.0) and SAM license requirements before any guidance. - Replaced monolithic usage/readme files with modular references and scripts; removed redundant files. - Incorporated workflows for automatic video format conversion and detailed code examples for all supported YOLO and new SAM-based tasks. - Clarified that the default and only recommended model family is YOLO26; legacy version fallback and ambiguous usage no longer supported.

v1.0.0

yolo-vision-tools 1.0.0 - Initial Release - Provides a full suite of YOLO vision tasks: object detection, instance segmentation, image classification, pose estimation, and object tracking. - Strictly supports yolo26 series models (latest as of 2026); prevents automatic downgrade to earlier YOLO versions. - Automatic model alias support and mapping from older version names to yolo26 equivalents. - Integrates with OpenClaw for streamlined image/video analysis via chat commands. - Includes easy installation steps, GPU acceleration options, and extensive usage examples. - Offers clear error handling and automatic model version correction if old YOLO models are requested.

Metadata

Slug yolo-vision-tools

Version 1.2.3

License MIT-0

All-time Installs 2

Active Installs 2

Total Versions 9

Frequently Asked Questions

What is yolo-vision-tools?

Use Ultralytics YOLO to perform computer vision tasks, such as detecting people or objects in images and videos, classifying images, estimating human poses,... It is an AI Agent Skill for Claude Code / OpenClaw, with 562 downloads so far.

How do I install yolo-vision-tools?

Run "/install yolo-vision-tools" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is yolo-vision-tools free?

Yes, yolo-vision-tools is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does yolo-vision-tools support?

yolo-vision-tools is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created yolo-vision-tools?

It is built and maintained by Ruoyu (@ruoyu05); the current version is v1.2.3.

More Skills

What is yolo-vision-tools?

How do I install yolo-vision-tools?

Is yolo-vision-tools free?

Which platforms does yolo-vision-tools support?

Who created yolo-vision-tools?

💬 Comments