← Back to Skills Marketplace
294
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install zotero-vectorize
Description
Build and maintain a cross-platform local Zotero semantic index using metadata embeddings and PDF full-text chunk embeddings. Use when the user asks to vecto...
Usage Guidance
This bundle appears to do what it says: it reads your local Zotero DB/storage, snapshots the DB, extracts PDF text, and builds local JSON vector stores. Before running: (1) review/confirm the output directory to avoid overwriting files; (2) run in a Python virtualenv and install the listed packages; (3) expect the embedding model to be downloaded (may be large and require network access) — you can pre-download models if offline; (4) close Zotero while snapshotting to avoid SQLite lock errors; (5) the safety rule to obtain user confirmation before applying incremental updates is procedural (the apply script does not itself prompt), so only run apply_incremental_updates after reviewing the check_incremental_updates report. If you need the skill to never download models automatically, inspect or modify get_embedding_model/encode_texts to load local models only.
Capability Analysis
Type: OpenClaw Skill
Name: zotero-vectorize
Version: 0.1.0
The zotero-vectorize skill bundle is a legitimate tool designed to create a local semantic index of a Zotero library using embeddings. The scripts (e.g., build_metadata_vectors.py, build_fulltext_vectors.py) and the shared library (zotero_vectorize_lib.py) perform standard file operations, SQLite database snapshotting, and text extraction using well-known libraries like PyMuPDF and sentence-transformers. The instructions in SKILL.md emphasize safety, such as treating the Zotero database as read-only and requiring user confirmation before applying incremental updates, with no evidence of malicious intent, data exfiltration, or prompt injection.
Capability Assessment
Purpose & Capability
Name/description match the included scripts and references: the code reads a Zotero SQLite DB and storage, extracts metadata and PDF text, creates embeddings, and writes local vector store files. Environment variables referenced (ZOTERO_DATA_DIR, ZOTERO_DB, ZOTERO_STORAGE, ZOTERO_VECTORS_DIR) are appropriate for locating Zotero data and output and are optional. No unrelated credentials, binaries, or config paths are requested.
Instruction Scope
SKILL.md describes and limits runtime behavior to detecting Zotero paths, snapshotting the DB, extracting text, creating embeddings, verifying counts, and backing up/writing vector store files. The included scripts follow that flow and do not attempt to read unrelated system files or contact external endpoints (beyond embedding/model loading which is expected). Note: the workflow relies on creating DB snapshots and writing backups/output files; the 'ask for user confirmation before applying updates' is a process rule (scripts don't prompt interactively) — the agent/user must run check_incremental_updates first and then explicitly run apply_incremental_updates.
Install Mechanism
No install spec is included (instruction-only installation), and all code is provided. Dependencies are standard Python packages for embedding and PDF extraction (sentence-transformers, torch, PyMuPDF, numpy). There are no downloads from obscure URLs in the install phase. Runtime model downloads (HuggingFace/SentenceTransformers) may occur when loading embeddings — this is expected for the purpose, but is a network operation to be aware of.
Credentials
The skill requests no secrets or privileged environment variables. It optionally reads path-related env vars for convenience (ZOTERO_*), which are coherent with its function. There are no unrelated tokens, passwords, or cloud credentials requested.
Persistence & Privilege
always is false and the skill does not request permanent elevated platform privileges. It writes backups and store files only to the configured output directory and snapshots a user-provided Zotero DB path; it does not alter other skills or system-wide agent settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install zotero-vectorize - After installation, invoke the skill by name or use
/zotero-vectorize - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
- Initial release of zotero-vectorize.
- Build and maintain a local, cross-platform Zotero semantic index using embeddings for metadata and PDF full-text chunks.
- Strictly read-only input from Zotero; no modifications to user data.
- Comprehensive, step-by-step workflow for builds, incremental updates, backups, and verification.
- Detailed integration with reference files per platform and troubleshooting guidance.
- Automated reporting of counts, file sizes, and operational summaries after actions.
Metadata
Frequently Asked Questions
What is Zotero Vectorize?
Build and maintain a cross-platform local Zotero semantic index using metadata embeddings and PDF full-text chunk embeddings. Use when the user asks to vecto... It is an AI Agent Skill for Claude Code / OpenClaw, with 294 downloads so far.
How do I install Zotero Vectorize?
Run "/install zotero-vectorize" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Zotero Vectorize free?
Yes, Zotero Vectorize is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Zotero Vectorize support?
Zotero Vectorize is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Zotero Vectorize?
It is built and maintained by yckbz (@yckbz); the current version is v0.1.0.
More Skills