← Back to Skills Marketplace
ivangdavila

Google Colab

by Iván · GitHub ↗ · v1.0.0
darwinlinuxwin32 ✓ Security Clean
331
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install google-colab
Description
Run Google Colab notebooks for Python and machine learning with reproducible runtimes, data pipelines, debugging workflows, and experiment discipline.
README (SKILL.md)

Setup

On first use, read setup.md and align activation behavior and risk boundaries before proposing notebook changes.

When to Use

User needs Google Colab notebook work that must be reproducible, not one-off trial and error. Agent handles runtime setup, package and dependency hygiene, data import and export flows, debugging failures, and experiment tracking.

Architecture

Memory lives in ~/google-colab/. See memory-template.md for setup and status values.

~/google-colab/
|-- memory.md                   # Activation preferences, constraints, and current goals
|-- notebooks.md                # Notebook registry with owners and objective per notebook
|-- runtimes.md                 # Runtime choices, dependency pins, and restart history
|-- datasets.md                 # Data source map, mount paths, and validation notes
|-- incidents.md                # Error timelines, root causes, and fixes
`-- experiments.md              # Hypotheses, metrics, and reproducibility evidence

Quick Reference

Use the smallest relevant file for the active task.

Topic File
Setup and activation behavior setup.md
Memory and local templates memory-template.md
Notebook structure and cell contracts notebook-architecture.md
Runtime setup, pinning, and restart recovery runtime-playbook.md
Data import, export, and schema checks data-io-patterns.md
Debugging triage and failure recovery debugging-runbook.md
Experiment log format and promotion rules experiment-log-template.md

Requirements

  • For diagnostics and lightweight API checks: curl, jq
  • For notebook execution: Google account with Colab access
  • For dataset mounting: explicit permission for Drive, GCS, or external endpoints

Never ask users to paste API keys, OAuth refresh tokens, or private dataset credentials into chat.

Data Storage

Local operational notes stay in ~/google-colab/:

  • notebook inventory with objective, owner, and current status
  • runtime and dependency decisions with pinned versions
  • dataset and schema validation history
  • experiment outcomes and unresolved risks

Core Rules

1. Start with Objective, Constraints, and Exit Criteria

Before writing notebook steps, identify:

  • objective: prototype, benchmark, fine-tuning, teaching, or production prep
  • constraints: runtime tier, budget, execution time, and data availability
  • exit criteria: metric threshold, artifact output, or decision checkpoint

Without explicit exit criteria, notebook sessions drift and become hard to evaluate.

2. Design Notebook Cells as Contracts

Each cell should have a contract:

  • inputs required and where they come from
  • deterministic output shape and validation check
  • failure mode and fallback behavior

Treat hidden state between cells as technical debt and document every state dependency.

3. Pin Runtime and Dependency State

Any runnable plan must define:

  • Python version and runtime class (CPU, T4, L4, A100 when relevant)
  • pinned package versions for non-standard libraries
  • rehydration steps after runtime disconnect or restart

Never assume a fresh runtime matches previous package state.

4. Validate Data Paths and Schema Before Training or Evaluation

Before expensive operations:

  • verify mount success and path existence
  • sample and validate schema and null patterns
  • block execution when split boundaries or label columns are ambiguous

Fast schema checks prevent long failed runs and invalid metrics.

5. Make Cost and Time Guardrails Explicit

For any medium or high-cost run:

  • estimate runtime duration and checkpoint intervals
  • define early-stop conditions and budget cutoff
  • recommend smaller dry-run dataset before full execution

No full-scale run should start without budget and cutoff rules.

6. Triage Failures by Layer, Not by Guessing

Debug in layers:

  1. runtime health and package import layer
  2. data loading and preprocessing layer
  3. model logic and training loop layer
  4. evaluation and artifact export layer

Layered triage shortens incident resolution and avoids random patching.

7. Log Reproducibility Evidence in Every Significant Run

For each meaningful run, capture:

  • notebook id or link, runtime class, seed, and dependency snapshot
  • dataset version or timestamp and split method
  • primary metric result and whether exit criteria passed

If reproducibility evidence is missing, treat conclusions as provisional.

Common Traps

  • Installing packages ad hoc across cells without pins -> results differ after runtime reconnect
  • Using absolute local paths copied from old sessions -> file not found during replay
  • Training before schema and null validation -> wasted GPU time and misleading metrics
  • Mixing exploratory and production cells in one notebook -> brittle execution order
  • Treating cached outputs as fresh ground truth -> stale evaluation and wrong decisions
  • Ignoring random seeds and data splits -> impossible to compare experiment outcomes
  • Exporting artifacts without metadata -> model files cannot be audited later

External Endpoints

Endpoint Data Sent Purpose
https://colab.research.google.com Notebook execution metadata and selected runtime actions Interactive notebook execution and runtime control
https://www.googleapis.com File identifiers and requested object payloads Google Drive and related API interactions
https://storage.googleapis.com Dataset or artifact object requests Read or write objects in Google Cloud Storage
https://pypi.org Package names and version requests Python dependency installation and version resolution

No other data should be sent externally unless the user explicitly configures additional systems.

Security & Privacy

Data that leaves your machine:

  • notebook payloads and runtime metadata required by Colab services
  • selected file and object metadata required for Drive or GCS operations
  • package lookup requests for dependency installation

Data that stays local:

  • workflow memory and decision logs under ~/google-colab/
  • incident notes, experiment summaries, and validation evidence

This skill does NOT:

  • request or store raw secrets in conversation text
  • execute high-cost runs without explicit guardrails
  • bypass user-defined data boundaries or compliance rules

Trust

This skill relies on Google Colab, Google APIs, and package repositories used during notebook setup. Only install and run it if you trust those systems with your code and data.

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

  • gcp - Plan cloud workloads, storage, and service boundaries for Google environments
  • api - Design resilient API contracts for data and model integrations
  • pandas - Build robust tabular data transformations and validation pipelines
  • numpy - Improve numerical computation patterns and vectorized operations
  • automate - Convert repeatable notebook steps into reliable automation workflows

Feedback

  • If useful: clawhub star google-colab
  • Stay updated: clawhub sync
Usage Guidance
This skill appears coherent and limited to helping plan and manage Colab notebooks. Before enabling it: review setup.md and decide whether you want the skill to create ~/google-colab/ and store logs there; confirm activation preferences and whether the agent should act proactively; do not paste secrets into chat (the skill itself warns against this); and be aware that for actual Drive/GCS mounts you'll still need to grant those services their normal permissions through Google (the skill does not request credentials). If you want stricter limits, decline automatic activation and only invoke the skill explicitly when needed.
Capability Analysis
Type: OpenClaw Skill Name: google-colab Version: 1.0.0 The Google Colab skill bundle provides a structured framework for managing notebooks with an emphasis on reproducibility, cost guardrails, and data validation. It uses local file-based storage in `~/google-colab/` for state management and includes standard setup commands in `setup.md` that follow secure permission practices (chmod 700/600). All external endpoints (colab.research.google.com, googleapis.com, pypi.org) are appropriate for the skill's stated purpose, and there is no evidence of malicious intent or data exfiltration.
Capability Assessment
Purpose & Capability
Name and description (Colab notebook reproducibility, runtimes, data pipelines, experiment logs) match the content of SKILL.md and the included templates. Required binaries (curl, jq) are reasonable for diagnostics and lightweight API checks described in the instructions.
Instruction Scope
Runtime instructions focus on notebook design, dependency pinning, data validation, and local bookkeeping under ~/google-colab/. The setup flow explicitly creates a local workspace and templates and instructs the agent to consult setup.md before acting. There are no instructions to read unrelated system files or to ask for secrets in chat; external endpoints listed (colab.research.google.com and www.googleapis.com) align with expected Colab/Drive interactions.
Install Mechanism
No install spec and no code files (instruction-only) — lowest-risk pattern. Nothing is downloaded from arbitrary URLs or extracted to disk by an installer.
Credentials
The skill requests no environment variables or credentials. It references that a Google account and explicit permission to mount Drive/GCS are required for full functionality, but it does not request or instruct users to paste API keys or tokens. Local file creation under the user's home is justified by the skill's bookkeeping purpose.
Persistence & Privilege
always is false and model invocation is allowed (platform default). The skill writes only to its own directory ~/google-colab/ and does not modify other skills or system-wide agent configurations. Creation of files and setting of permissions (mkdir/touch/chmod) is described and scoped to the skill's workspace.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install google-colab
  3. After installation, invoke the skill by name or use /google-colab
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release with notebook architecture patterns, runtime recovery playbooks, data IO safeguards, and reproducible experiment logs.
Metadata
Slug google-colab
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Google Colab?

Run Google Colab notebooks for Python and machine learning with reproducible runtimes, data pipelines, debugging workflows, and experiment discipline. It is an AI Agent Skill for Claude Code / OpenClaw, with 331 downloads so far.

How do I install Google Colab?

Run "/install google-colab" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Google Colab free?

Yes, Google Colab is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Google Colab support?

Google Colab is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux, win32).

Who created Google Colab?

It is built and maintained by Iván (@ivangdavila); the current version is v1.0.0.

💬 Comments