← 返回 Skills 市场
mingo-318

Image Deduplicator

作者 Mingo_318 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
322
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install image-deduplicator
功能描述
Detect and remove exact or similar duplicate images in folders using perceptual and MD5 hashing with configurable similarity and actions.
使用说明 (SKILL.md)

Image Deduplicator

Find and remove duplicate or similar images in a folder using perceptual hashing. Use when user wants to clean up duplicate images, find near-duplicates, or deduplicate an image dataset.

Features

  • Exact Duplicates: Find images with identical content
  • Similar Images: Detect visually similar images (threshold configurable)
  • Hash-based: Fast MD5 hashing for exact duplicates
  • Perceptual Hash: pHash for finding similar images
  • Batch Processing: Process large image folders
  • Multiple Actions: List, delete, or move duplicates

Usage

# Find exact duplicates
python scripts/dedupe.py scan /path/to/images/

# Find similar images (90% similarity)
python scripts/dedupe.py scan /path/to/images/ --threshold 90

# Delete duplicates (keeps first occurrence)
python scripts/dedupe.py scan /path/to/images/ --action delete

# Move duplicates to a folder
python scripts/dedupe.py scan /path/to/images/ --action move --output /path/to/dupes/

Examples

$ python scripts/dedupe.py scan ./images/

Scanning images...
Found 150 images
Computing hashes...
Found 5 duplicate groups:

Group 1 (3 files):
  ./images/photo1.jpg
  ./images/photo1_copy.jpg
  ./images/photo1_final.jpg

Group 2 (2 files):
  ./images/screenshot.png
  ./images/screenshot (1).png

Total: 5 duplicate groups, 8 duplicate files

Installation

pip install pillow imagehash

Options

  • --threshold: Similarity threshold (0-100), default: 100 (exact)
  • --action: What to do with duplicates (list, delete, move)
  • --output: Output folder for --action move
  • --extensions: File extensions to scan (default: jpg,jpeg,png,bmp)
安全使用建议
This skill is coherent with its stated purpose (finding and removing duplicate image files) and does not request credentials or network access. However, the documentation claims a configurable similarity threshold (e.g., 90% similarity) while the included script does not implement a similarity-distance comparison — it only groups images whose perceptual hash strings are identical. Before using destructive actions (delete/move): 1) run with --action list to review groups, 2) back up your images or test on a small subset, 3) inspect or modify the script if you need true similarity (implement Hamming-distance comparisons of phash values), and 4) ensure pillow and imagehash are installed in a controlled environment. If you rely on the threshold feature, treat the current implementation as incomplete and contact the author or modify the code to compute and compare hash distances rather than string equality.
功能分析
Type: OpenClaw Skill Name: image-deduplicator Version: 1.0.0 The skill bundle provides a legitimate utility for finding and removing duplicate images using MD5 and perceptual hashing (pHash). The Python script `scripts/dedupe.py` performs standard file operations such as directory traversal, hashing, and file deletion/movement based on explicit user commands, and it includes a confirmation prompt before deleting files.
能力评估
Purpose & Capability
Name and description match the provided code and SKILL.md: the script scans folders, computes MD5 and perceptual hashes, and can list/delete/move duplicate files. The required capabilities (none) are proportionate to the task.
Instruction Scope
SKILL.md claims a configurable similarity threshold (e.g., 90%) for finding similar images. The script only uses the threshold to decide whether to call perceptual hashing (use_phash when threshold < 100) but then groups files by exact hash string equality. There is no comparison of Hamming distance or other similarity measure to support a numeric threshold. This is a functional mismatch: the instructions promise near-duplicate detection by threshold but the code only finds exact hash matches (or identical perceptual hash strings).
Install Mechanism
There is no install spec (instruction-only install). SKILL.md recommends 'pip install pillow imagehash', which is reasonable. No remote downloads or obscure installers are used. Because the code is included, nothing is written to disk by an installer beyond normal package installation.
Credentials
The skill requests no environment variables, no credentials, and references only local file system paths supplied by the user. This is proportionate to an image dedupe tool.
Persistence & Privilege
always is false and the skill does not request persistent system privileges or modify other skill configurations. It runs as a user-invoked CLI script and performs local file operations only.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install image-deduplicator
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /image-deduplicator 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of Image Deduplicator. - Find and remove exact or similar images using MD5 and perceptual hash. - Configurable similarity threshold for near-duplicate detection. - Supports batch processing and large image folders. - Multiple actions for duplicates: list, delete, or move. - Easy CLI usage with clear options for customizing scans.
元数据
Slug image-deduplicator
版本 1.0.0
许可证
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Image Deduplicator 是什么?

Detect and remove exact or similar duplicate images in folders using perceptual and MD5 hashing with configurable similarity and actions. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 322 次。

如何安装 Image Deduplicator?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install image-deduplicator」即可一键安装,无需额外配置。

Image Deduplicator 是免费的吗?

是的,Image Deduplicator 完全免费(开源免费),可自由下载、安装和使用。

Image Deduplicator 支持哪些平台?

Image Deduplicator 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Image Deduplicator?

由 Mingo_318(@mingo-318)开发并维护,当前版本 v1.0.0。

💬 留言讨论