← Back to Skills Marketplace
mike47512

Etl Design

by mike47512 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
176
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install etl-design
Description
Deep ETL/ELT design workflow—extract patterns, transforms, loading strategies, idempotency, validation, and reconciliation. Use when designing batch data flo...
README (SKILL.md)

ETL Design

ETL is correctness under change: schema drift, partial loads, retries, and reconciliation with upstream systems.

When to Offer This Workflow

Trigger conditions:

  • Batch loads into warehouse or data lake
  • Choosing between CDC, snapshots, and incremental watermarks
  • Missing rows, duplicates, or inconsistent aggregates downstream

Initial offer:

Use six stages: (1) source contract, (2) extract strategy, (3) transform rules, (4) load & dedupe, (5) validation, (6) operations & backfill). Confirm batch window and SLA.


Stage 1: Source Contract

Goal: Document schema, primary keys, change indicators (updated_at, CDC log position), and access constraints (rate limits, read replicas).


Stage 2: Extract Strategy

Goal: Full dump vs incremental watermark vs CDC—trade freshness, source load, and complexity.

Practices

  • CDC for large sources; snapshots for small or infrequent tables

Stage 3: Transform Rules

Goal: Deterministic transforms; surrogate keys; business rules versioned; handling of deletes (tombstones vs hard deletes).


Stage 4: Load & Dedupe

Goal: Upsert keys; partitions; rerunnable jobs with same batch id producing the same outcome (idempotent load).


Stage 5: Validation

Goal: Row counts, checksums, key uniqueness, referential checks; alert on threshold breaches.


Stage 6: Operations & Backfill

Goal: Replay by date range; monitor lag; dead-letter or quarantine bad rows with reason codes.


Final Review Checklist

  • Source contract and keys documented
  • Extract mode matches SLA and source constraints
  • Transforms deterministic and versioned
  • Idempotent load strategy
  • Validation and reconciliation defined

Tips for Effective Guidance

  • Plan for late-arriving facts and slowly changing dimensions in analytics paths.
  • Pair with data-pipelines for orchestration and monitoring.

Handling Deviations

  • Near-real-time: document micro-batch or streaming semantics separately.
Usage Guidance
This skill is high-level design advice and appears safe to install. It won't by itself access your data or systems because it has no installs, code, or credential requirements. Before using it in an agent that also has connectors or other skills, confirm those other skills do not grant the agent access to production data sources or secrets — the ETL guidance may prompt actions that require those connectors, and you should control credential access separately.
Capability Analysis
Type: OpenClaw Skill Name: etl-design Version: 1.0.0 The skill bundle is a purely conceptual framework for guiding users through ETL/ELT design processes. It contains no executable code, external dependencies, or suspicious instructions, focusing entirely on standard data engineering best practices such as idempotency, schema validation, and extraction strategies as documented in SKILL.md.
Capability Assessment
Purpose & Capability
The name and description match the SKILL.md content: a six-stage ETL/ELT design workflow. Nothing in the skill requests unrelated resources or capabilities.
Instruction Scope
SKILL.md provides high-level design guidance (source contract, extract, transforms, load/dedupe, validation, ops/backfill). It does not instruct the agent to read system files, access environment variables, or transmit data to external endpoints.
Install Mechanism
No install spec and no code files; this is instruction-only so nothing is written to disk or downloaded during install.
Credentials
No required environment variables, credentials, or config paths are declared; the guidance is purely conceptual and does not demand secrets or external service access.
Persistence & Privilege
Skill is not always-enabled and uses default invocation behavior; it does not request persistent or elevated platform privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install etl-design
  3. After installation, invoke the skill by name or use /etl-design
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of the etl-design skill, providing a structured ETL/ELT workflow for robust data pipeline design. - Introduces a six-stage ETL design framework: source contract, extract strategy, transform rules, load & dedupe, validation, and operations & backfill. - Covers best practices for handling schema drift, partial loads, retries, and data reconciliation. - Includes detailed guidance for batch data loads, choosing extract modes, and ensuring idempotency and correctness. - Provides a comprehensive final review checklist and operational tips. - Clarifies when to use this workflow and considerations for deviations like near-real-time scenarios.
Metadata
Slug etl-design
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Etl Design?

Deep ETL/ELT design workflow—extract patterns, transforms, loading strategies, idempotency, validation, and reconciliation. Use when designing batch data flo... It is an AI Agent Skill for Claude Code / OpenClaw, with 176 downloads so far.

How do I install Etl Design?

Run "/install etl-design" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Etl Design free?

Yes, Etl Design is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Etl Design support?

Etl Design is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Etl Design?

It is built and maintained by mike47512 (@mike47512); the current version is v1.0.0.

💬 Comments