← Back to Skills Marketplace
aipoch-ai

Image Duplication Detector

by AIpoch · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
136
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install image-duplication-detector
Description
Detect image duplication and tampering in manuscript figures using computer vision algorithms
README (SKILL.md)

Image Duplication Detector

ID: 195

Description

Uses Computer Vision (CV) algorithms to scan all images in paper manuscripts to detect potential duplication or local tampering (PS traces).

Usage

# Scan single PDF file
python scripts/main.py --input paper.pdf --output report.json

# Scan image folder
python scripts/main.py --input ./images/ --output report.json

# Specify similarity threshold (default 0.85)
python scripts/main.py --input paper.pdf --threshold 0.90 --output report.json

# Enable tampering detection
python scripts/main.py --input paper.pdf --detect-tampering --output report.json

# Generate visualization report
python scripts/main.py --input paper.pdf --visualize --output report.json

Parameters

Parameter Type Default Required Description
--input string - Yes Input PDF file or image folder path
--output string report.json No Output report path
--threshold float 0.85 No Similarity threshold (0-1), higher is stricter
--detect-tampering flag false No Enable tampering/PS trace detection
--visualize flag false No Generate visualization comparison images
--temp-dir string ./temp No Temporary file directory

Output Format

{
  "summary": {
    "total_images": 12,
    "duplicates_found": 2,
    "tampering_detected": 1,
    "processing_time": "3.5s"
  },
  "duplicates": [
    {
      "group_id": 1,
      "similarity": 0.98,
      "images": [
        {"page": 2, "index": 1, "path": "..."},
        {"page": 5, "index": 3, "path": "..."}
      ]
    }
  ],
  "tampering": [
    {
      "image": "page_3_img_2.png",
      "suspicious_regions": [
        {"x": 120, "y": 80, "width": 50, "height": 50, "confidence": 0.92}
      ]
    }
  ]
}

Requirements

opencv-python>=4.8.0
numpy>=1.24.0
Pillow>=10.0.0
PyPDF2>=3.0.0
pdf2image>=1.16.0
imagehash>=4.3.0
scikit-image>=0.21.0
matplotlib>=3.7.0

Algorithm Details

Duplication Detection

  • Perceptual Hashing: Uses pHash, dHash, aHash combination to detect visually similar images
  • Feature Matching: ORB feature point matching to verify similarity
  • SSIM: Structural similarity index as auxiliary verification

Tampering Detection

  • ELA (Error Level Analysis): Detects JPEG compression level inconsistencies
  • Noise Analysis: Noise pattern anomaly detection
  • Copy-Move Detection: Copy-move forgery detection
  • Lighting Inconsistency: Lighting consistency analysis

Example

from scripts.main import ImageDuplicationDetector

detector = ImageDuplicationDetector(
    threshold=0.85,
    detect_tampering=True
)

results = detector.scan("paper.pdf")
detector.save_report(results, "report.json")

Notes

  • Supports PDF, PNG, JPG, TIFF formats
  • Large files recommended for batch processing
  • Tampering detection may produce false positives, manual review recommended

Risk Assessment

Risk Indicator Assessment Level
Code Execution Python/R scripts executed locally Medium
Network Access No external API calls Low
File System Access Read input files, write output files Medium
Instruction Tampering Standard prompt guidelines Low
Data Exposure Output files saved to workspace Low

Security Checklist

  • No hardcoded credentials or API keys
  • No unauthorized file system access (../)
  • Output does not expose sensitive information
  • Prompt injection protections in place
  • Input file paths validated (no ../ traversal)
  • Output directory restricted to workspace
  • Script execution in sandboxed environment
  • Error messages sanitized (no stack traces exposed)
  • Dependencies audited

Prerequisites

# Python dependencies
pip install -r requirements.txt

Evaluation Criteria

Success Metrics

  • Successfully executes main functionality
  • Output meets quality standards
  • Handles edge cases gracefully
  • Performance is acceptable

Test Cases

  1. Basic Functionality: Standard input → Expected output
  2. Edge Case: Invalid input → Graceful error handling
  3. Performance: Large dataset → Acceptable processing time

Lifecycle Status

  • Current Stage: Draft
  • Next Review Date: 2026-03-06
  • Known Issues: None
  • Planned Improvements:
    • Performance optimization
    • Additional feature support
Usage Guidance
This skill appears to implement what it claims, but take these precautions before installing or running it: - Dependency hygiene: requirements.txt uses nonstandard names ('cv2', 'pil') and may be missing opencv-contrib (SIFT). Fix and pin package names (opencv-python, opencv-contrib-python, Pillow, proper versions) before pip install -r requirements.txt. - Run in an isolated environment (virtualenv, container) to avoid affecting your system Python and to contain file I/O. - Test on non-sensitive sample data first. The tool reads input files and writes temporary files and reports; ensure your input does not contain confidential images you cannot expose. - Audit the full script tail (file was truncated in the review) to confirm there are no unexpected network calls, telemetry, or external endpoints before trusting it with sensitive datasets. - Verify temp file handling and deletion (code deletes some temp files but double-check behavior on errors) and consider running with a controlled temp-dir. - Be aware of false positives in tampering detection; manual review of flagged regions is recommended. If you want, I can: (1) produce a corrected requirements.txt with appropriate package names and versions, (2) scan the rest of the script if you provide the truncated portion, or (3) suggest a safe containerized run command.
Capability Analysis
Type: OpenClaw Skill Name: image-duplication-detector Version: 1.0.0 The skill bundle is a legitimate tool for detecting image duplication and tampering in manuscripts using computer vision. The implementation in `scripts/main.py` uses standard libraries (OpenCV, PIL, imagehash) to perform Perceptual Hashing, Error Level Analysis (ELA), and feature matching (ORB/SIFT). No evidence of malicious intent, data exfiltration, or prompt injection was found; file system operations are restricted to reading inputs and writing reports/temporary files as described in `SKILL.md`.
Capability Assessment
Purpose & Capability
Name/description align with the included Python implementation: perceptual hashing, ORB/SIFT feature matching, ELA and copy-move detection. The requested operations (reading PDFs/images, writing reports, using CV libraries) are consistent with the stated purpose.
Instruction Scope
SKILL.md instructs only local processing of supplied PDFs/images and saving a report/visualizations. There are no instructions to read unrelated system files, access environment secrets, or call external endpoints. The code creates temporary files and writes outputs to workspace as expected.
Install Mechanism
No install spec is provided (lower risk), but requirements.txt contains inconsistent/invalid package names (e.g., 'cv2' and 'pil' instead of 'opencv-python' and 'Pillow') and the code uses SIFT (usually in opencv-contrib). This can lead to installation surprises; dependencies should be pinned and corrected.
Credentials
The skill declares no required environment variables, no credentials, and no config paths. The code only needs filesystem access to input/output/temp directories, which is proportionate to its functionality.
Persistence & Privilege
always is false and the skill does not request persistent privileges or modify other skills/system-wide settings. It runs locally when invoked and does not enable autonomous always-on behavior.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install image-duplication-detector
  3. After installation, invoke the skill by name or use /image-duplication-detector
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of Image Duplication Detector. - Detects image duplication and possible tampering in manuscript figures using computer vision. - Supports PDF, PNG, JPG, and TIFF input formats; outputs JSON summary and detailed results. - Offers command-line options for similarity threshold, tampering detection, and result visualization. - Utilizes perceptual hashing, feature matching, and multiple image forensics methods (ELA, copy-move, noise, lighting analysis). - Includes risk assessment, security and quality criteria, and documented usage instructions.
Metadata
Slug image-duplication-detector
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Image Duplication Detector?

Detect image duplication and tampering in manuscript figures using computer vision algorithms. It is an AI Agent Skill for Claude Code / OpenClaw, with 136 downloads so far.

How do I install Image Duplication Detector?

Run "/install image-duplication-detector" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image Duplication Detector free?

Yes, Image Duplication Detector is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image Duplication Detector support?

Image Duplication Detector is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image Duplication Detector?

It is built and maintained by AIpoch (@aipoch-ai); the current version is v1.0.0.

💬 Comments