← Back to Skills Marketplace

Image Duplication Detector

Name: Image Duplication Detector
Author: aipoch-ai

by AIpoch · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

136

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install image-duplication-detector

Description

Detect image duplication and tampering in manuscript figures using computer vision algorithms

README (SKILL.md)

Image Duplication Detector

ID: 195

Description

Uses Computer Vision (CV) algorithms to scan all images in paper manuscripts to detect potential duplication or local tampering (PS traces).

Usage

# Scan single PDF file
python scripts/main.py --input paper.pdf --output report.json

# Scan image folder
python scripts/main.py --input ./images/ --output report.json

# Specify similarity threshold (default 0.85)
python scripts/main.py --input paper.pdf --threshold 0.90 --output report.json

# Enable tampering detection
python scripts/main.py --input paper.pdf --detect-tampering --output report.json

# Generate visualization report
python scripts/main.py --input paper.pdf --visualize --output report.json

Parameters

Parameter	Type	Default	Required	Description
`--input`	string	-	Yes	Input PDF file or image folder path
`--output`	string	report.json	No	Output report path
`--threshold`	float	0.85	No	Similarity threshold (0-1), higher is stricter
`--detect-tampering`	flag	false	No	Enable tampering/PS trace detection
`--visualize`	flag	false	No	Generate visualization comparison images
`--temp-dir`	string	./temp	No	Temporary file directory

Output Format

{
  "summary": {
    "total_images": 12,
    "duplicates_found": 2,
    "tampering_detected": 1,
    "processing_time": "3.5s"
  },
  "duplicates": [
    {
      "group_id": 1,
      "similarity": 0.98,
      "images": [
        {"page": 2, "index": 1, "path": "..."},
        {"page": 5, "index": 3, "path": "..."}
      ]
    }
  ],
  "tampering": [
    {
      "image": "page_3_img_2.png",
      "suspicious_regions": [
        {"x": 120, "y": 80, "width": 50, "height": 50, "confidence": 0.92}
      ]
    }
  ]
}

Requirements

opencv-python>=4.8.0
numpy>=1.24.0
Pillow>=10.0.0
PyPDF2>=3.0.0
pdf2image>=1.16.0
imagehash>=4.3.0
scikit-image>=0.21.0
matplotlib>=3.7.0

Algorithm Details

Duplication Detection

Perceptual Hashing: Uses pHash, dHash, aHash combination to detect visually similar images
Feature Matching: ORB feature point matching to verify similarity
SSIM: Structural similarity index as auxiliary verification

Tampering Detection

ELA (Error Level Analysis): Detects JPEG compression level inconsistencies
Noise Analysis: Noise pattern anomaly detection
Copy-Move Detection: Copy-move forgery detection
Lighting Inconsistency: Lighting consistency analysis

Example

from scripts.main import ImageDuplicationDetector

detector = ImageDuplicationDetector(
    threshold=0.85,
    detect_tampering=True
)

results = detector.scan("paper.pdf")
detector.save_report(results, "report.json")

Notes

Supports PDF, PNG, JPG, TIFF formats
Large files recommended for batch processing
Tampering detection may produce false positives, manual review recommended

Risk Assessment

Risk Indicator	Assessment	Level
Code Execution	Python/R scripts executed locally	Medium
Network Access	No external API calls	Low
File System Access	Read input files, write output files	Medium
Instruction Tampering	Standard prompt guidelines	Low
Data Exposure	Output files saved to workspace	Low

Security Checklist

No hardcoded credentials or API keys
No unauthorized file system access (../)
Output does not expose sensitive information
Prompt injection protections in place
Input file paths validated (no ../ traversal)
Output directory restricted to workspace
Script execution in sandboxed environment
Error messages sanitized (no stack traces exposed)
Dependencies audited

Prerequisites

# Python dependencies
pip install -r requirements.txt

Evaluation Criteria

Success Metrics

Successfully executes main functionality
Output meets quality standards
Handles edge cases gracefully
Performance is acceptable

Test Cases

Basic Functionality: Standard input → Expected output
Edge Case: Invalid input → Graceful error handling
Performance: Large dataset → Acceptable processing time

Lifecycle Status

Current Stage: Draft
Next Review Date: 2026-03-06
Known Issues: None
Planned Improvements:
- Performance optimization
- Additional feature support

Usage Guidance

This skill appears to implement what it claims, but take these precautions before installing or running it: - Dependency hygiene: requirements.txt uses nonstandard names ('cv2', 'pil') and may be missing opencv-contrib (SIFT). Fix and pin package names (opencv-python, opencv-contrib-python, Pillow, proper versions) before pip install -r requirements.txt. - Run in an isolated environment (virtualenv, container) to avoid affecting your system Python and to contain file I/O. - Test on non-sensitive sample data first. The tool reads input files and writes temporary files and reports; ensure your input does not contain confidential images you cannot expose. - Audit the full script tail (file was truncated in the review) to confirm there are no unexpected network calls, telemetry, or external endpoints before trusting it with sensitive datasets. - Verify temp file handling and deletion (code deletes some temp files but double-check behavior on errors) and consider running with a controlled temp-dir. - Be aware of false positives in tampering detection; manual review of flagged regions is recommended. If you want, I can: (1) produce a corrected requirements.txt with appropriate package names and versions, (2) scan the rest of the script if you provide the truncated portion, or (3) suggest a safe containerized run command.

Capability Analysis

Type: OpenClaw Skill Name: image-duplication-detector Version: 1.0.0 The skill bundle is a legitimate tool for detecting image duplication and tampering in manuscripts using computer vision. The implementation in `scripts/main.py` uses standard libraries (OpenCV, PIL, imagehash) to perform Perceptual Hashing, Error Level Analysis (ELA), and feature matching (ORB/SIFT). No evidence of malicious intent, data exfiltration, or prompt injection was found; file system operations are restricted to reading inputs and writing reports/temporary files as described in `SKILL.md`.

Capability Assessment

✓ Purpose & Capability

Name/description align with the included Python implementation: perceptual hashing, ORB/SIFT feature matching, ELA and copy-move detection. The requested operations (reading PDFs/images, writing reports, using CV libraries) are consistent with the stated purpose.

✓ Instruction Scope

SKILL.md instructs only local processing of supplied PDFs/images and saving a report/visualizations. There are no instructions to read unrelated system files, access environment secrets, or call external endpoints. The code creates temporary files and writes outputs to workspace as expected.

ℹ Install Mechanism

No install spec is provided (lower risk), but requirements.txt contains inconsistent/invalid package names (e.g., 'cv2' and 'pil' instead of 'opencv-python' and 'Pillow') and the code uses SIFT (usually in opencv-contrib). This can lead to installation surprises; dependencies should be pinned and corrected.

✓ Credentials

The skill declares no required environment variables, no credentials, and no config paths. The code only needs filesystem access to input/output/temp directories, which is proportionate to its functionality.

✓ Persistence & Privilege

always is false and the skill does not request persistent privileges or modify other skills/system-wide settings. It runs locally when invoked and does not enable autonomous always-on behavior.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install image-duplication-detector
After installation, invoke the skill by name or use /image-duplication-detector
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of Image Duplication Detector. - Detects image duplication and possible tampering in manuscript figures using computer vision. - Supports PDF, PNG, JPG, and TIFF input formats; outputs JSON summary and detailed results. - Offers command-line options for similarity threshold, tampering detection, and result visualization. - Utilizes perceptual hashing, feature matching, and multiple image forensics methods (ELA, copy-move, noise, lighting analysis). - Includes risk assessment, security and quality criteria, and documented usage instructions.

Metadata

Slug image-duplication-detector

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Image Duplication Detector?

Detect image duplication and tampering in manuscript figures using computer vision algorithms. It is an AI Agent Skill for Claude Code / OpenClaw, with 136 downloads so far.

How do I install Image Duplication Detector?

Run "/install image-duplication-detector" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image Duplication Detector free?

Yes, Image Duplication Detector is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image Duplication Detector support?

Image Duplication Detector is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image Duplication Detector?

It is built and maintained by AIpoch (@aipoch-ai); the current version is v1.0.0.

More Skills

Image Duplication Detector

Image Duplication Detector

Description

Usage

Parameters

Output Format

Requirements

Algorithm Details

Duplication Detection

Tampering Detection

Example

Notes

Risk Assessment

Security Checklist

Prerequisites

Evaluation Criteria

Success Metrics

Test Cases

Lifecycle Status

What is Image Duplication Detector?

How do I install Image Duplication Detector?

Is Image Duplication Detector free?

Which platforms does Image Duplication Detector support?

Who created Image Duplication Detector?

💬 Comments