← Back to Skills Marketplace

Zotero Vectorize

Name: Zotero Vectorize
Author: yckbz

by yckbz · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ✓ Security Clean

294

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install zotero-vectorize

Description

Build and maintain a cross-platform local Zotero semantic index using metadata embeddings and PDF full-text chunk embeddings. Use when the user asks to vecto...

Usage Guidance

This bundle appears to do what it says: it reads your local Zotero DB/storage, snapshots the DB, extracts PDF text, and builds local JSON vector stores. Before running: (1) review/confirm the output directory to avoid overwriting files; (2) run in a Python virtualenv and install the listed packages; (3) expect the embedding model to be downloaded (may be large and require network access) — you can pre-download models if offline; (4) close Zotero while snapshotting to avoid SQLite lock errors; (5) the safety rule to obtain user confirmation before applying incremental updates is procedural (the apply script does not itself prompt), so only run apply_incremental_updates after reviewing the check_incremental_updates report. If you need the skill to never download models automatically, inspect or modify get_embedding_model/encode_texts to load local models only.

Capability Analysis

Type: OpenClaw Skill Name: zotero-vectorize Version: 0.1.0 The zotero-vectorize skill bundle is a legitimate tool designed to create a local semantic index of a Zotero library using embeddings. The scripts (e.g., build_metadata_vectors.py, build_fulltext_vectors.py) and the shared library (zotero_vectorize_lib.py) perform standard file operations, SQLite database snapshotting, and text extraction using well-known libraries like PyMuPDF and sentence-transformers. The instructions in SKILL.md emphasize safety, such as treating the Zotero database as read-only and requiring user confirmation before applying incremental updates, with no evidence of malicious intent, data exfiltration, or prompt injection.

Capability Assessment

✓ Purpose & Capability

Name/description match the included scripts and references: the code reads a Zotero SQLite DB and storage, extracts metadata and PDF text, creates embeddings, and writes local vector store files. Environment variables referenced (ZOTERO_DATA_DIR, ZOTERO_DB, ZOTERO_STORAGE, ZOTERO_VECTORS_DIR) are appropriate for locating Zotero data and output and are optional. No unrelated credentials, binaries, or config paths are requested.

✓ Instruction Scope

SKILL.md describes and limits runtime behavior to detecting Zotero paths, snapshotting the DB, extracting text, creating embeddings, verifying counts, and backing up/writing vector store files. The included scripts follow that flow and do not attempt to read unrelated system files or contact external endpoints (beyond embedding/model loading which is expected). Note: the workflow relies on creating DB snapshots and writing backups/output files; the 'ask for user confirmation before applying updates' is a process rule (scripts don't prompt interactively) — the agent/user must run check_incremental_updates first and then explicitly run apply_incremental_updates.

✓ Install Mechanism

No install spec is included (instruction-only installation), and all code is provided. Dependencies are standard Python packages for embedding and PDF extraction (sentence-transformers, torch, PyMuPDF, numpy). There are no downloads from obscure URLs in the install phase. Runtime model downloads (HuggingFace/SentenceTransformers) may occur when loading embeddings — this is expected for the purpose, but is a network operation to be aware of.

✓ Credentials

The skill requests no secrets or privileged environment variables. It optionally reads path-related env vars for convenience (ZOTERO_*), which are coherent with its function. There are no unrelated tokens, passwords, or cloud credentials requested.

✓ Persistence & Privilege

always is false and the skill does not request permanent elevated platform privileges. It writes backups and store files only to the configured output directory and snapshots a user-provided Zotero DB path; it does not alter other skills or system-wide agent settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install zotero-vectorize
After installation, invoke the skill by name or use /zotero-vectorize
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.0

- Initial release of zotero-vectorize. - Build and maintain a local, cross-platform Zotero semantic index using embeddings for metadata and PDF full-text chunks. - Strictly read-only input from Zotero; no modifications to user data. - Comprehensive, step-by-step workflow for builds, incremental updates, backups, and verification. - Detailed integration with reference files per platform and troubleshooting guidance. - Automated reporting of counts, file sizes, and operational summaries after actions.

Metadata

Slug zotero-vectorize

Version 0.1.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Zotero Vectorize?

Build and maintain a cross-platform local Zotero semantic index using metadata embeddings and PDF full-text chunk embeddings. Use when the user asks to vecto... It is an AI Agent Skill for Claude Code / OpenClaw, with 294 downloads so far.

How do I install Zotero Vectorize?

Run "/install zotero-vectorize" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Zotero Vectorize free?

Yes, Zotero Vectorize is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Zotero Vectorize support?

Zotero Vectorize is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Zotero Vectorize?

It is built and maintained by yckbz (@yckbz); the current version is v0.1.0.

More Skills