Dedupe
/install dedupe
Dedupe — Data Deduplication Reference
Quick-reference skill for deduplication strategies, algorithms, and data quality patterns.
When to Use
- Removing duplicate rows from datasets or databases
- Deduplicating files in storage systems
- Implementing fuzzy matching for near-duplicate detection
- Choosing between exact and probabilistic dedup methods
- Building ETL pipelines with deduplication stages
Commands
intro
scripts/script.sh intro
Overview of deduplication — types, strategies, and tradeoffs.
exact
scripts/script.sh exact
Exact deduplication — hash-based, key-based, and sorting approaches.
fuzzy
scripts/script.sh fuzzy
Fuzzy deduplication — similarity measures, blocking, and record linkage.
files
scripts/script.sh files
File-level deduplication — fdupes, jdupes, rdfind, and storage dedup.
algorithms
scripts/script.sh algorithms
Dedup algorithms — bloom filters, HyperLogLog, MinHash, SimHash.
sql
scripts/script.sh sql
SQL deduplication patterns — ROW_NUMBER, DISTINCT, GROUP BY strategies.
cli
scripts/script.sh cli
Command-line dedup tools — sort, uniq, awk, and stream processing.
checklist
scripts/script.sh checklist
Deduplication quality checklist and validation steps.
help
scripts/script.sh help
version
scripts/script.sh version
Configuration
| Variable | Description |
|---|---|
DEDUPE_DIR |
Data directory (default: ~/.dedupe/) |
Powered by BytesAgain | bytesagain.com | [email protected]
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install dedupe - After installation, invoke the skill by name or use
/dedupe - Provide required inputs per the skill's parameter spec and get structured output
What is Dedupe?
Deduplication reference — exact matching, fuzzy matching, hash-based dedup, bloom filters, and data quality. Use when removing duplicate records, files, or d... It is an AI Agent Skill for Claude Code / OpenClaw, with 178 downloads so far.
How do I install Dedupe?
Run "/install dedupe" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Dedupe free?
Yes, Dedupe is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Dedupe support?
Dedupe is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Dedupe?
It is built and maintained by bytesagain4 (@xueyetianya); the current version is v1.0.0.