← Back to Skills Marketplace
atlasnexusops

Data Toolkit

by AtlasNexusOps · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ Security Clean
81
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install data-toolkit
Description
Complete data conversion, validation, and cleaning toolkit. Convert between JSON/CSV/YAML/XML, validate schemas, clean duplicates and nulls. Essential utilit...
README (SKILL.md)

Data Toolkit

Complete data processing utilities for OpenClaw agents.

Features

Converters

  • JSON ↔ CSV - Bidirectional conversion with schema inference
  • JSON ↔ YAML - Clean formatting, comment preservation
  • JSON ↔ XML - Configurable root elements and attributes
  • CSV ↔ YAML - Direct conversion without intermediate steps
  • Multi-format batch conversion - Process entire directories

Validators

  • JSON Schema validation - Validate against JSON Schema specs
  • CSV structure validation - Check headers, columns, data types
  • Data type inference - Automatic type detection and validation
  • Custom rules - Define business logic validations

Cleaners

  • Duplicate removal - Smart deduplication with configurable keys
  • Null/empty handling - Remove or replace null values
  • Data normalization - Standardize formats (dates, numbers, strings)
  • Whitespace cleanup - Trim, collapse multiple spaces
  • Column operations - Remove, rename, reorder columns

Get Data Toolkit

🛒 Gumroad (€10): https://nexusatlas.gumroad.com/l/bsyacx
📦 ClawHub: https://clawhub.ai/skills/data-toolkit

MIT License — Python 3.8+, zero dependencies.

Usage

Convert Data

# JSON to CSV
./src/convert.py --input data.json --output data.csv --format csv

# CSV to JSON
./src/convert.py --input data.csv --output data.json --format json

# JSON to YAML
./src/convert.py --input data.json --output data.yaml --format yaml

# XML to JSON
./src/convert.py --input data.xml --output data.json --format json

# Batch conversion
./src/convert.py --input-dir ./raw --output-dir ./processed --format json

Validate Data

# Validate against JSON schema
./src/validate.py --input data.json --schema schema.json

# Validate CSV structure
./src/validate.py --input data.csv --check-headers --check-types

# Custom validation rules
./src/validate.py --input data.json --rules validation-rules.yaml

Clean Data

# Remove duplicates
./src/clean.py --input data.json --dedupe --key id

# Handle nulls
./src/clean.py --input data.csv --remove-nulls
./src/clean.py --input data.csv --replace-nulls "N/A"

# Normalize data
./src/clean.py --input data.json --normalize dates,numbers,strings

# Full cleanup pipeline
./src/clean.py --input messy.csv --dedupe --remove-nulls --normalize all --output clean.csv

API Usage (Python)

from data_toolkit import convert, validate, clean

# Convert
convert.json_to_csv('input.json', 'output.csv')
convert.csv_to_yaml('input.csv', 'output.yaml')

# Validate
is_valid = validate.json_schema('data.json', 'schema.json')
errors = validate.csv_structure('data.csv')

# Clean
clean.remove_duplicates('data.json', key='id')
clean.normalize_dates('data.csv', format='ISO8601')

Examples

See examples/ directory for complete workflows:

  • examples/etl-pipeline.sh - Full ETL workflow
  • examples/api-data-processing.py - API response processing
  • examples/batch-conversion.sh - Bulk file conversion

Installation

Dependencies are minimal and common:

  • Python 3.8+
  • PyYAML
  • pandas (optional, for advanced CSV operations)
pip install pyyaml pandas

Requirements

  • Node.js (for JSON/YAML parsing)
  • Python 3.8+
  • 10MB disk space

License

MIT

Support

Issues: https://github.com/forge-agent/data-toolkit Docs: See docs/ directory

Usage Guidance
Before installing, be comfortable running local Python scripts on your data. Use explicit output paths or backups to avoid overwriting originals, and install any Python dependencies from trusted sources.
Capability Analysis
Type: OpenClaw Skill Name: data-toolkit Version: 1.0.1 The data-toolkit skill is a standard set of utility scripts for data cleaning, format conversion, and validation (JSON, CSV, YAML, XML). The Python scripts (clean.py, convert.py, validate.py) use standard libraries and safe practices like yaml.safe_load(), with no evidence of network activity, data exfiltration, or malicious execution. The documentation in SKILL.md is consistent with the code and contains no prompt-injection attempts.
Capability Assessment
Purpose & Capability
The included Python scripts align with the stated conversion, validation, and cleaning purpose. Some documentation/metadata is inconsistent or broader than the provided files, such as Node being required while the shown code is Python-based and referenced examples/docs/API package files not being present.
Instruction Scope
The documented commands are user-directed and purpose-aligned, but cleaning operations can overwrite the input file if no output path is provided.
Install Mechanism
There is no registry install spec, while SKILL.md instructs users to manually install unpinned Python packages. This is common for a Python utility but should be done from trusted package sources.
Credentials
The skill requests node and python3, but the provided source shown is Python and does not clearly use Node. No credentials, network access, or privileged system access are requested.
Persistence & Privilege
No background persistence, elevated privileges, or credential use is shown. The scripts do create or overwrite local data files as part of their intended function.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install data-toolkit
  3. After installation, invoke the skill by name or use /data-toolkit
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
Added Gumroad purchase link
v1.0.0
Initial release: JSON/CSV/YAML/XML conversion, schema validation, deduplication, null handling, data cleaning.
Metadata
Slug data-toolkit
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Data Toolkit?

Complete data conversion, validation, and cleaning toolkit. Convert between JSON/CSV/YAML/XML, validate schemas, clean duplicates and nulls. Essential utilit... It is an AI Agent Skill for Claude Code / OpenClaw, with 81 downloads so far.

How do I install Data Toolkit?

Run "/install data-toolkit" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Data Toolkit free?

Yes, Data Toolkit is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Data Toolkit support?

Data Toolkit is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Data Toolkit?

It is built and maintained by AtlasNexusOps (@atlasnexusops); the current version is v1.0.1.

💬 Comments