← 返回 Skills 市场
mike47512

Etl Design

作者 mike47512 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
176
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install etl-design
功能描述
Deep ETL/ELT design workflow—extract patterns, transforms, loading strategies, idempotency, validation, and reconciliation. Use when designing batch data flo...
使用说明 (SKILL.md)

ETL Design

ETL is correctness under change: schema drift, partial loads, retries, and reconciliation with upstream systems.

When to Offer This Workflow

Trigger conditions:

  • Batch loads into warehouse or data lake
  • Choosing between CDC, snapshots, and incremental watermarks
  • Missing rows, duplicates, or inconsistent aggregates downstream

Initial offer:

Use six stages: (1) source contract, (2) extract strategy, (3) transform rules, (4) load & dedupe, (5) validation, (6) operations & backfill). Confirm batch window and SLA.


Stage 1: Source Contract

Goal: Document schema, primary keys, change indicators (updated_at, CDC log position), and access constraints (rate limits, read replicas).


Stage 2: Extract Strategy

Goal: Full dump vs incremental watermark vs CDC—trade freshness, source load, and complexity.

Practices

  • CDC for large sources; snapshots for small or infrequent tables

Stage 3: Transform Rules

Goal: Deterministic transforms; surrogate keys; business rules versioned; handling of deletes (tombstones vs hard deletes).


Stage 4: Load & Dedupe

Goal: Upsert keys; partitions; rerunnable jobs with same batch id producing the same outcome (idempotent load).


Stage 5: Validation

Goal: Row counts, checksums, key uniqueness, referential checks; alert on threshold breaches.


Stage 6: Operations & Backfill

Goal: Replay by date range; monitor lag; dead-letter or quarantine bad rows with reason codes.


Final Review Checklist

  • Source contract and keys documented
  • Extract mode matches SLA and source constraints
  • Transforms deterministic and versioned
  • Idempotent load strategy
  • Validation and reconciliation defined

Tips for Effective Guidance

  • Plan for late-arriving facts and slowly changing dimensions in analytics paths.
  • Pair with data-pipelines for orchestration and monitoring.

Handling Deviations

  • Near-real-time: document micro-batch or streaming semantics separately.
安全使用建议
This skill is high-level design advice and appears safe to install. It won't by itself access your data or systems because it has no installs, code, or credential requirements. Before using it in an agent that also has connectors or other skills, confirm those other skills do not grant the agent access to production data sources or secrets — the ETL guidance may prompt actions that require those connectors, and you should control credential access separately.
功能分析
Type: OpenClaw Skill Name: etl-design Version: 1.0.0 The skill bundle is a purely conceptual framework for guiding users through ETL/ELT design processes. It contains no executable code, external dependencies, or suspicious instructions, focusing entirely on standard data engineering best practices such as idempotency, schema validation, and extraction strategies as documented in SKILL.md.
能力评估
Purpose & Capability
The name and description match the SKILL.md content: a six-stage ETL/ELT design workflow. Nothing in the skill requests unrelated resources or capabilities.
Instruction Scope
SKILL.md provides high-level design guidance (source contract, extract, transforms, load/dedupe, validation, ops/backfill). It does not instruct the agent to read system files, access environment variables, or transmit data to external endpoints.
Install Mechanism
No install spec and no code files; this is instruction-only so nothing is written to disk or downloaded during install.
Credentials
No required environment variables, credentials, or config paths are declared; the guidance is purely conceptual and does not demand secrets or external service access.
Persistence & Privilege
Skill is not always-enabled and uses default invocation behavior; it does not request persistent or elevated platform privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install etl-design
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /etl-design 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of the etl-design skill, providing a structured ETL/ELT workflow for robust data pipeline design. - Introduces a six-stage ETL design framework: source contract, extract strategy, transform rules, load & dedupe, validation, and operations & backfill. - Covers best practices for handling schema drift, partial loads, retries, and data reconciliation. - Includes detailed guidance for batch data loads, choosing extract modes, and ensuring idempotency and correctness. - Provides a comprehensive final review checklist and operational tips. - Clarifies when to use this workflow and considerations for deviations like near-real-time scenarios.
元数据
Slug etl-design
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Etl Design 是什么?

Deep ETL/ELT design workflow—extract patterns, transforms, loading strategies, idempotency, validation, and reconciliation. Use when designing batch data flo... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 176 次。

如何安装 Etl Design?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install etl-design」即可一键安装,无需额外配置。

Etl Design 是免费的吗?

是的,Etl Design 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Etl Design 支持哪些平台?

Etl Design 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Etl Design?

由 mike47512(@mike47512)开发并维护,当前版本 v1.0.0。

💬 留言讨论