功能描述

Persist and retrieve structured data using the Lance columnar format. Use when you need to store, query, or analyze data across sessions — such as saving ski...

使用说明 (SKILL.md)

Lance Store

Name: Lance Store
Author: vitorhugoze

Installation

python3 -m pip install -r requirements.txt

A persistent data store using the Lance columnar format for fast ML data access.

Quick Start

# List all datasets and their metadata
python3 scripts/command.py list-datasets-info

# Create a dataset
python3 scripts/command.py create-dataset \x3Cname> \x3Cfield1> \x3Cfield2> ...

# Append data
python3 scripts/command.py append-to-dataset \x3Cname> \x3Cvalue1> \x3Cvalue2> ...

# Read all records from a dataset
python3 scripts/command.py read-dataset \x3Cname>

Note: list-datasets-info shows dataset metadata (schema, field types, record count) — it does not return the actual data rows. Use read-dataset to retrieve records.

Storage Location

DataSets are created and stored on the current path '.'

Critical Behavior: Data Type Strictness

⚠️ Lance is strict about data types — they CANNOT change after the first record

When you append the first record to a dataset, Lance infers the data type for each field. All subsequent records MUST use the same types.

Example — this FAILS:

# First record: age as STRING
append-to-dataset users "John" "25" "[email protected]"

# Second record: age as INTEGER (will FAIL!)
append-to-dataset users "Jane" 30 "[email protected]"
# Error: `age` should have type large_string but type was int64

Correct approach — maintain consistent types:

# First record: age as STRING
append-to-dataset users "John" "25" "[email protected]"

# Second record: age as STRING
append-to-dataset users "Jane" "30" "[email protected]"

Why This Matters

Unlike traditional databases that may coerce types, Lance rejects type mismatches. If you store numbers as strings initially, you must always pass strings. Plan your schema carefully.

Initialization Workflow

When starting a session, always initialize by listing existing datasets first:

# This command returns ALL datasets with their structure
python3 scripts/command.py list-datasets-info

Example output:

{
    "skill": "lance",
    "operation": "list_datasets_info",
    "status": "success",
    "data": [
        {
            "dataset_name": "users",
            "path": "/data/users",
            "fields": ["name", "age", "email"],
            "field_types": {
                "_id": "large_string",
                "_updated_at": "timestamp[us]",
                "name": "large_string",
                "age": "large_string",
                "email": "large_string"
            },
            "record_count": 2,
            "columns": ["id", "_updated_at", "name", "age", "email"],
            "last_updated": "2026-03-21T17:57:44.595628"
        }
    ],
    "error": null
}

Understanding `field_types`

State	Meaning
`{}` (empty)	Dataset exists but no records yet — types not yet defined
populated	Types are locked — appends must match

Important: If field_types is empty, the first append will define types. Be deliberate about the first record's types.

Commands Reference

Create Dataset

python3 scripts/command.py create-dataset \x3Cname> \x3Cfield1> \x3Cfield2> ...

Creates a metadata entry. Fields have no types until first append.

Append Record

python3 scripts/command.py append-to-dataset \x3Cname> \x3Cvalue1> \x3Cvalue2> ...

Appends one record. Types are inferred from first record.

Batch Append

python3 scripts/command.py batch-append-to-dataset \x3Cname> '\x3Cjson-array>'

Example: batch-append-to-dataset users '[["Alice", "22", "[email protected]"], ["Bob", "35", "[email protected]"]]'

Update Record

python3 scripts/command.py update-dataset-record \x3Cname> \x3Crecord_id> \x3Cvalue1> \x3Cvalue2> ...

Updates fields for a specific record by ID.

Delete Record

python3 scripts/command.py delete-dataset-record \x3Cname> \x3Crecord_id>

List All Datasets

python3 scripts/command.py list-datasets

Get Dataset Info

python3 scripts/command.py get-dataset-info \x3Cname>

Returns schema, field types (if data exists), and record count.

List All Datasets with Full Info

python3 scripts/command.py list-datasets-info

Recommended for initialization. Returns all datasets with complete metadata.

Get Dataset Path

python3 scripts/command.py get-dataset-path-info \x3Cname>

Backup Dataset

python3 scripts/command.py backup-dataset \x3Cname> \x3Cbackup_path>

Count Records

python3 scripts/command.py count-records \x3Cname>

Read All Records

Returns all records from the dataset as a list of objects.

python3 scripts/command.py read-dataset \x3Cname>

Drop Dataset

Requires confirmation if have not created a backup beforehand.

Delete the entire dataset and its metadata.

python3 scripts/command.py drop-dataset \x3Cname>

Internal fields available in every dataset:

Field	Type	Description
`_id`	string	UUID — unique record identifier
`_updated_at`	timestamp	When the record was last inserted or updated

List Records (Paginated)

python3 scripts/command.py list-records \x3Cname> --limit 10 --offset 0

Returns records with optional pagination.

Get Single Record

python3 scripts/command.py get-record \x3Cname> \x3Crecord_id>

Retrieves a specific record by its UUID.

Get Dataset Info

python3 scripts/command.py get-dataset-info \x3Cname>

Returns schema, field types (if data exists), and record count.

Response Format

All commands return JSON:

{
  "skill": "lance",
  "operation": "\x3Coperation_name>",
  "status": "success|error",
  "data": \x3Cresult_data_or_null>,
  "error": \x3Cerror_message_or_null>
}

Internal Fields

Every dataset automatically includes:

_id — UUID for each record
_updated_at — timestamp of last insert/update

These are managed automatically — when appending, only provide your defined fields.

Data Type Inference

Lance infers types from the first record:

Python Type	Lance Type
`"string"`	`large_string`
`25` (int)	`int64`
`25.5` (float)	`float64`
`True`/`False`	`bool`

CLI caveat: When passing via command line, all values are strings. To ensure integer types, initialize with actual integers in a script rather than CLI.

Tips

Initialize at session start: Run list-datasets-info to understand what data already exists
Plan your schema: First record determines types for the entire dataset
Use batch append when adding multiple records: More efficient than individual appends

Requirements

Dependencies are declared in frontmatter (metadata.openclaw.install) and handled by the OpenClaw install system via uv. The Python packages required are:

pylance — The Lance columnar format library.

⚠️ Naming note: Despite the PyPI package being named pylance, the library is imported as import lance in Python code. This is the official Lance project naming convention — it is NOT the VS Code "pylance" language server. See lance.org for details.
pandas — Data manipulation

安全使用建议

Before installing or running this skill, verify the package and installer mapping: 1) Check PyPI (or the source the platform will use) to confirm that the package named 'pylance' actually provides the 'lance' Python module expected by the code — if 'pylance' is a typo or a different package, the skill may fail or install an unrelated package. 2) Ask the publisher/developer to explain the 'uv' installer kind (how it resolves packages) and why 'pylance' is used instead of a package named 'lance'. 3) Run this skill in an isolated/sandbox environment (not on a production host) and inspect what packages are installed and the installed package source. 4) If you proceed, run small tests in a temporary directory to confirm datasets are created only where you expect. If the developer confirms that 'pylance' is intentionally the correct PyPI package for the Lance runtime and explains the installer, the concerns here would likely be resolved and the skill could be considered coherent.

功能分析

Type: OpenClaw Skill Name: lance-store Version: 1.0.12 The lance-store skill provides a legitimate interface for persisting and querying structured data using the Lance columnar format. The code is well-structured and includes proactive security measures, such as explicit path traversal validation in scripts/manage.py (validate_dataset_name and validate_backup_path) to ensure file operations remain within the intended directory. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found.

能力评估

ℹ Purpose & Capability

The code (scripts/*.py) implements local dataset creation, append/read/update/delete, metadata management, and uses a 'lance' Python module to operate on local files — this matches the skill's stated purpose of persisting structured data in Lance format. However, the install/requirements references a PyPI package named 'pylance' (and the README/requirements.txt comment insists 'pylance' is the package that provides the 'lance' module). That naming mismatch is unexpected and reduces confidence that the install instructions map cleanly to the runtime imports.

✓ Instruction Scope

SKILL.md and the CLI scripts operate on the current working directory only, list and read local 'metadata.lance', and provide explicit commands; there are no instructions to read unrelated system files, environment variables, or to transmit data to external endpoints. The code includes input validation (dataset name / backup path checks) to mitigate path traversal.

⚠ Install Mechanism

The registry install block and requirements.txt ask for 'pylance' via an installer labelled 'uv' and for 'pandas'. 'pylance' is an unusual name for a Lance-format runtime dependency (it is also the name of a Microsoft VSCode language server), so this could be a naming mistake, a confusing wrapper, or an incorrect package. The install mechanism 'uv' is non-standard/ambiguous in this context. These inconsistencies could cause the installed packages not to provide the expected 'lance' module, or — in the worst case — install an unrelated package. No arbitrary download URLs are present, which reduces highest-risk concerns, but the package-name ambiguity is a real red flag to verify.

✓ Credentials

The skill requests no environment variables, no external credentials, and no config paths. All filesystem access is scoped to the current working directory; the code does read and write local dataset directories and metadata files, which is appropriate for a storage skill.

✓ Persistence & Privilege

always: false and the skill does not request permanent elevated platform privileges. It creates and modifies files under the current working directory (data and metadata files) but does not modify other skills, global config, or system-wide settings.

版本历史

v1.0.12

- Added Openclaw metadata for installation, specifying dependencies and install method. - Updated the description to fix a typo ("Afther" → "After"). - No code or functionality changes; documentation and dependency management improvement only.

v1.0.10

Version 1.0.10 of lance-store - Version bump from 1.0.9 to 1.0.10. - No file changes detected; documentation remains unchanged except for version metadata.

v1.0.9

lance-store version 1.0.9 - Improved documentation for Python package requirements. - Clarified the distinction between the Lance project's PyPI package `pylance` and the unrelated VS Code "pylance" language server. - No functional or code changes in this release. Documentation updates only.

v1.0.8

No user-facing changes in this release (version bump only).

v1.0.7

- Requirements and installation steps updated: now install dependencies with `pip install -r requirements.txt` instead of using npx. - The environment variable configuration and reference to LANCE_DATA_PATH were removed; datasets are always stored in the current directory. - SQL querying and related DuckDB integration instructions were removed for simplicity; focus is now on basic data storage and retrieval. - Documentation trimmed and restructured for clarity, with updated quick start and concise usage tips. - Requirements are now explicitly listed; `pylance`, `pandas`, `duckdb`, and `argparse` are required.

v1.0.6

- Expanded and clarified documentation in SKILL.md to detail data type strictness, important usage notes, and initialization workflow. - Added comprehensive command-line usage instructions and examples for all major dataset operations. - Improved guidance for schema definition, data type consistency, and best practices for persistent data storage using the Lance columnar format. - Provided detailed SQL query examples and explanations for internal dataset fields. - Emphasized the use of the LANCE_DATA_PATH environment variable for custom storage configuration. - Highlighted behavior differences from traditional databases regarding type enforcement.

元数据

Slug lance-store

版本 1.0.12

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 6

常见问题

Lance Store 是什么？

Persist and retrieve structured data using the Lance columnar format. Use when you need to store, query, or analyze data across sessions — such as saving ski... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 124 次。

如何安装 Lance Store？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install lance-store」即可一键安装，无需额外配置。

Lance Store 是免费的吗？

是的，Lance Store 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Lance Store 支持哪些平台？

Lance Store 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Lance Store？

由 vitorhugoze（@vitorhugoze）开发并维护，当前版本 v1.0.12。

Lance Store

Lance Store

Installation

Quick Start

Storage Location

Critical Behavior: Data Type Strictness

Why This Matters

Initialization Workflow

Understanding field_types

Commands Reference

Create Dataset

Append Record

Batch Append

Update Record

Delete Record

List All Datasets

Get Dataset Info

List All Datasets with Full Info

Get Dataset Path

Backup Dataset

Count Records

Read All Records

Drop Dataset

Requires confirmation if have not created a backup beforehand.

List Records (Paginated)

Get Single Record

Get Dataset Info

Response Format

Internal Fields

Data Type Inference

Tips

Requirements

Lance Store 是什么？

如何安装 Lance Store？

Lance Store 是免费的吗？

Lance Store 支持哪些平台？

谁开发了 Lance Store？

💬 留言讨论

Understanding `field_types`