← Back to Skills Marketplace
Data Cleaner Skill
by
shenghoo123-png
· GitHub ↗
· v1.1.0
· MIT-0
271
Downloads
0
Stars
1
Active Installs
4
Versions
Install in OpenClaw
/install data-cleaner-tool
Description
AI-powered tool for cleaning Excel/CSV data by removing duplicates, handling missing values, standardizing formats, detecting outliers, and batch processing...
README (SKILL.md)
Data Cleaner Pro - 数据清洗工具
简介
AI驱动的数据清洗工具,自动处理Excel/CSV数据中的重复、缺失、格式错误等问题。
适用场景:运营报表、财务数据、客户名单清洗
功能特性
- ✅ 智能去重 - 保留最新/最早记录
- ✅ 缺失值处理 - 填充/删除/预测
- ✅ 格式标准化 - 手机号、邮箱、日期等
- ✅ 异常值检测 - 统计方法 + 规则方法
- ✅ 批量处理 - 多个文件同时清洗
使用方式
触发词
- "清洗数据"
- "去重处理"
- "整理Excel"
输入
- CSV/Excel 文件路径
- 清洗规则(可选,默认智能模式)
输出
- 清洗后的文件
- 清洗报告(处理了xx条,删除了xx条)
技术栈
- pandas:数据处理
- Python:核心逻辑
- openpyxl:Excel支持
安装依赖
pip install pandas openpyxl
使用示例
# 智能清洗
python scripts/clean_data.py data.csv
# 自定义规则
python scripts/clean_data.py data.csv --dedup --fillna mean --fix-phone
# 批量处理
python scripts/batch_clean.py folder/
使用说明
- 免费使用
- 欢迎反馈问题和建议
适用人群
- 运营人员:整理用户数据
- 财务人员:清洗账单数据
- 行政人员:整理员工名单
- 销售人员:客户名单去重
Usage Guidance
This package appears to be a straightforward local data-cleaner and does not request credentials or make network calls, so risk is low. Before installing/running: 1) note that README/SKILL.md mention batch_clean.py but that file is missing — batch-processing example is incomplete; 2) the script is deterministic (pandas/regex rules), not actually calling any external AI service despite the 'AI-driven' wording; 3) there are small code/documentation mismatches (e.g., --fix-date handling in the CLI and the function signature) but these are bugs, not malicious behavior; 4) sanitize or review outputs if your data may contain untrusted spreadsheet formulas: the tool writes CSV/Excel but does not sanitize leading characters (e.g., =,+,-,@) which can be a CSV/Excel formula-injection risk when opening in spreadsheet apps; 5) run it first on non-sensitive test data or in a sandbox, and inspect the generated _cleaned files to confirm behavior. If you need batch processing, request the missing batch script from the author or implement your own wrapper.
Capability Assessment
Purpose & Capability
The name/description match the included code: a Python/pandas-based data cleaner. The README and SKILL.md mention batch processing and a batch_clean.py example, but no batch_clean.py is included in the package — a minor inconsistency. Overall, required tools (pandas/openpyxl) are appropriate for the stated purpose.
Instruction Scope
Runtime instructions and examples only run local Python scripts on local files and request installing pandas/openpyxl. They do not instruct reading unrelated system files or exporting data to external endpoints. One mismatch: the SKILL.md and README call this an 'AI-driven' tool, but the provided script is heuristic/deterministic (no external model/API calls).
Install Mechanism
No install spec; dependencies are installed via standard pip (pandas, openpyxl) as documented. No downloads from untrusted URLs or archive extraction are present.
Credentials
The skill requests no environment variables, no credentials, and no special config paths — proportional to a local data-cleaning tool.
Persistence & Privilege
Skill is not marked always:true and is user-invocable (normal). It does not request persistent system-wide privileges or modify other skills/config.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install data-cleaner-tool - After installation, invoke the skill by name or use
/data-cleaner-tool - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
新增日期格式标准化功能
v1.0.2
修复fillna pandas警告,优化手机号补全逻辑
v1.0.1
移除定价信息,免费使用
v1.0.0
Initial release of Data Cleaner Pro – 数据清洗工具.
- AI-driven tool for cleaning duplicate, missing, and malformed data in Excel/CSV files
- Features include intelligent deduplication, missing value handling (fill/delete/predict), data format standardization, and outlier detection
- Supports batch processing of multiple files
- Simple command-line usage with customizable cleaning rules
- Provides a summary report of cleaned records and deletions
- Tiered pricing with free, standard, and professional plans
Metadata
Frequently Asked Questions
What is Data Cleaner Skill?
AI-powered tool for cleaning Excel/CSV data by removing duplicates, handling missing values, standardizing formats, detecting outliers, and batch processing... It is an AI Agent Skill for Claude Code / OpenClaw, with 271 downloads so far.
How do I install Data Cleaner Skill?
Run "/install data-cleaner-tool" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Data Cleaner Skill free?
Yes, Data Cleaner Skill is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Data Cleaner Skill support?
Data Cleaner Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Data Cleaner Skill?
It is built and maintained by shenghoo123-png (@shenghoo123-png); the current version is v1.1.0.
More Skills