← 返回 Skills 市场

Rag Retriever

Name: Rag Retriever
Author: yuyonghao-123

作者 yuyonghao-123 · GitHub ↗ · v0.1.0 · MIT-0

cross-platform ⚠ suspicious

139

总下载

当前安装

版本数

在 OpenClaw 中安装

/install rag-retriever

功能描述

提供基于文档分块、简单词频嵌入及混合向量+关键词搜索的RAG 2.0检索系统，支持中英文本和来源引用。

安全使用建议

This skill is largely coherent with its advertised RAG retriever purpose, but please consider the following before installing: 1) npm install will fetch non-trivial native-heavy packages (transformers, onnxruntime, sharp) — verify you are comfortable with build time and disk usage and run inside a controlled environment if unsure. 2) The code includes an OpenAI embedding provider that will call https://api.openai.com/v1/embeddings if you supply an API key (process.env.OPENAI_API_KEY) — the registry metadata doesn't declare that env var, so only provide keys if you intend to use OpenAI and understand network calls. 3) The skill writes cache and database files under its local data/ directory (embedding-cache.json, LanceDB files) — ensure you trust the documents you add. 4) If you need higher assurance, review package-lock.json for dependency origins, run the skill in a sandbox/container, or audit the few JS files that perform network calls (embeddings.js) and file writes before granting credentials.

功能分析

Type: OpenClaw Skill Name: rag-retriever Version: 0.1.0 The bundle implements a functional RAG (Retrieval-Augmented Generation) system providing document chunking, hybrid search (BM25 and vector), and citation management. It supports multiple embedding providers, including OpenAI and local models via Transformers.js, and utilizes LanceDB for vector storage. The code logic in files like `src/rag2.js`, `src/hybrid-search.js`, and `src/retriever.js` is consistent with the stated purpose of document retrieval and context augmentation. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found; the network requests to OpenAI and Hugging Face mirrors (hf-mirror.com) are standard for this type of application.

能力评估

ℹ Purpose & Capability

Name/description, README and SKILL.md match the included code and data: document chunking, simple TF embeddings, BM25, LanceDB storage and hybrid search are implemented. Minor mismatch: the code contains an OpenAIEmbedding provider that will call OpenAI's embeddings endpoint if used, but the skill's registry metadata does not declare OPENAI_API_KEY (no required env vars). This is optional behavior and coherent with the stated plan to optionally integrate OpenAI embeddings.

✓ Instruction Scope

SKILL.md instructs only to run local CLI commands (init/add/search/rag) and use the provided JavaScript API; runtime instructions and implementation operate on local files and the included LanceDB path. There are no instructions to scan arbitrary system files or to exfiltrate agent data. The only external network call in code is to the OpenAI embeddings API when the OpenAIEmbedding provider is used, which aligns with an embedding provider.

ℹ Install Mechanism

There is no special install spec in registry metadata (instruction-only), but package.json and package-lock indicate npm install is expected. npm will pull sizeable dependencies (e.g., @huggingface/transformers, onnxruntime variants, sharp) which may compile native modules. The repository also includes local model/tokenizer JSON files (large assets) — this increases disk usage but is not inherently malicious. Review native dependency installation and disk requirements before installing.

ℹ Credentials

The skill declares no required environment variables, which is reasonable for its local/simple-embedding default. However, the OpenAIEmbedding implementation will use process.env.OPENAI_API_KEY if present or supplied — that credential is not declared in the metadata. No other credentials or unrelated env vars are requested. If you plan to use OpenAI embeddings, you must provide an API key; otherwise the default SimpleEmbedding is used.

✓ Persistence & Privilege

The skill is not always-enabled, is user-invocable, and does not request system-wide privileges or modify other skills. It writes caches and LanceDB files to its local data/ directory (e.g., data/embedding-cache.json, data/lancedb), which is expected for a retriever and is scoped to the skill's folder.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install rag-retriever
安装完成后，直接呼叫该 Skill 的名称或使用 /rag-retriever 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.1.0

Initial release of RAG 2.0 retrieval system for OpenClaw. - Implements document chunking with configurable size and overlap. - Supports simple text embedding (TF-based) and LanceDB vector storage. - Provides hybrid search: vector similarity plus BM25 keyword search (RRF fusion). - Enables source citation tracking and context-augmented RAG prompts. - Adds Chinese segmentation for multilingual search. - Includes CLI tool, JavaScript API, and basic tests.

元数据

Slug rag-retriever

版本 0.1.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Rag Retriever 是什么？

提供基于文档分块、简单词频嵌入及混合向量+关键词搜索的RAG 2.0检索系统，支持中英文本和来源引用。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 139 次。

如何安装 Rag Retriever？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install rag-retriever」即可一键安装，无需额外配置。

Rag Retriever 是免费的吗？

是的，Rag Retriever 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Rag Retriever 支持哪些平台？

Rag Retriever 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Rag Retriever？

由 yuyonghao-123（@yuyonghao-123）开发并维护，当前版本 v0.1.0。