← Back to Skills Marketplace
hanxueyuan

Databricks Cloud

by hanxueyuan · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
19
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install databricks-cloud
Description
Databricks Cloud unifies data engineering, warehousing, machine learning, and AI on cloud infrastructure with a consumption-based SaaS model.
README (SKILL.md)

Databricks

历史时间线

  • 2013: 由Apache Spark创始人Matei Zaharia及UC Berkeley AMPLab团队成员在旧金山创立,初始愿景是让Spark在企业中更易用
  • 2014: 发布首个托管Spark服务,定位为"Spark as a Service",解决企业自行部署Spark集群的运维痛点
  • 2016: 推出Databricks Delta(后更名为Delta Lake),引入ACID事务到数据湖,解决数据湖的可靠性问题
  • 2019: 发布Delta Lake开源版本,同时推出MLflow实验管理工具,构建完整MLOps生态
  • 2021: 提出"Lakehouse"架构概念,融合数据仓库的结构化查询能力与数据湖的灵活性
  • 2022: 收购8080 Labs(munity.ai),推出DBRX大语言模型训练基础设施
  • 2023: 发布Unity Catalog实现跨工作区统一治理,估值达430亿美元,提交IPO申请
  • 2024: Databricks IQ(AI助手)和Lakeflow产品发布,年经常性收入突破20亿美元

商业模式

Databricks采用consumption-based pricing(按消耗付费)的云SaaS模式。客户在AWS/Azure/GCP上运行Databricks工作区,按DBU(Databricks Unit)计费。收入来自两部分:DBU消耗费用(支付给Databricks)和底层云基础设施费用(支付给云厂商)。这种模式与云厂商深度绑定,形成共生关系——云厂商获得IaaS收入,Databricks获得平台收入。企业版还包含高级安全治理、实时协作和专属支持。

护城河分析

技术护城河: Delta Lake格式已成为行业事实标准,一旦被企业采用,迁移成本极高。Spark生态系统的深度优化让Databricks在大规模数据处理上保持性能优势。 网络效应: Notebook协作环境形成团队粘性,数据工程师、科学家和分析师在同一平台上协作,替换意味着整个团队工作流重建。 生态壁垒: MLflow、Delta Sharing、Unity Catalog构成完整的data+AI工具链,竞争者很难在单一产品线上同时匹敌。 云伙伴关系: 与AWS、Azure、GCP的marketplace集成降低了采购门槛,三大云厂商均将其作为首选数据平台推荐。

关键数据

  • 估值: ~430亿美元(2023年最后一轮)
  • ARR: 超过20亿美元(2024年)
  • 客户数: 超过10,000家企业客户
  • 员工数: 约5,000人
  • 融资总额: 超过35亿美元
  • 核心产品: Data Engineering, Data Warehousing, Machine Learning, AI/LLM

有趣事实

Databricks的创始人团队几乎就是Apache Spark的原作者。他们最初在伯克利的一个研究项目中创造了Spark,后来意识到企业需要托管服务才能真正释放其价值。有趣的是,他们选择"湖仓一体"这个概念,本质上是在说"为什么要在数据仓库和数据湖之间做选择?"——这直接挑战了Snowflake等纯仓库厂商的定位。

Usage Guidance
This skill appears safe from an agentic-security perspective. Treat it as general reference content about Databricks rather than an official Databricks integration.
Capability Analysis
Type: OpenClaw Skill Name: databricks-cloud Version: 1.0.0 The skill bundle contains purely informational content regarding the history, business model, and market position of Databricks. There is no executable code, no network activity, and no instructions that could be interpreted as prompt injection or malicious behavior. All files (_meta.json, SKILL.md) are consistent with a documentation-only skill.
Capability Assessment
Purpose & Capability
The content is consistent with the stated purpose: it provides background, business model, moat analysis, and key facts about Databricks.
Instruction Scope
The skill contains informational text only and does not instruct the agent to override user intent, take autonomous actions, or use tools.
Install Mechanism
There is no install spec, no code files, no package dependencies, and no required binaries.
Credentials
The skill declares no environment variables, credentials, config paths, network access, or local file access.
Persistence & Privilege
No persistence, background behavior, privilege use, memory storage, or account access is described in the artifacts.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install databricks-cloud
  3. After installation, invoke the skill by name or use /databricks-cloud
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Databricks-cloud skill initial release: - Introduces a comprehensive overview of Databricks’ evolution, from its Apache Spark roots to its lakehouse architecture leadership. - Details major milestones, including key product launches (Delta Lake, MLflow, Lakehouse, Unity Catalog, Databricks IQ) and strategic acquisitions. - Explains Databricks’ consumption-based cloud SaaS business model and deep integration with major cloud providers. - Analyzes competitive moats: technical standards (Delta Lake), network effects, ecosystem completeness, and cloud partnerships. - Shares key company statistics (valuation, ARR, customers, employees) and industry insights.
Metadata
Slug databricks-cloud
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Databricks Cloud?

Databricks Cloud unifies data engineering, warehousing, machine learning, and AI on cloud infrastructure with a consumption-based SaaS model. It is an AI Agent Skill for Claude Code / OpenClaw, with 19 downloads so far.

How do I install Databricks Cloud?

Run "/install databricks-cloud" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Databricks Cloud free?

Yes, Databricks Cloud is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Databricks Cloud support?

Databricks Cloud is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Databricks Cloud?

It is built and maintained by hanxueyuan (@hanxueyuan); the current version is v1.0.0.

💬 Comments