← Back to Skills Marketplace

Rag Retriever

Name: Rag Retriever
Author: yuyonghao-123

by yuyonghao-123 · GitHub ↗ · v0.1.1 · MIT-0

cross-platform ⚠ suspicious

158

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install yuyonghao-rag-retriever

Description

RAG 2.0 检索系统，支持中文分词，混合向量与关键词搜索，智能文档分块与加权重排，自动生成 RAG 格式输出。

Usage Guidance

This skill appears to implement a legitimate RAG retriever, but exercise caution because: (1) the code will call the OpenAI Embeddings API if an OPENAI_API_KEY is available — that is not declared in the registry metadata or SKILL.md; providing a key will cause your texts to be sent to OpenAI. (2) npm install pulls heavy native deps (transformers, onnxruntime, sharp), so review resource/compatibility and run in a sandbox if possible. Before installing, either: (A) review src/embeddings.js and other source files to confirm what data is sent externally, (B) avoid setting OPENAI_API_KEY to force use of the local SimpleEmbedding fallback, or (C) run the skill in an isolated environment and inspect network traffic. If you plan to provide an API key, make sure you understand what documents/queries will be transmitted and consider data-sanitization or policy constraints. If you want to be extra safe, run tests locally and audit the code paths that call external APIs and write to ./data/.

Capability Analysis

Type: OpenClaw Skill Name: yuyonghao-rag-retriever Version: 0.1.1 The skill bundle implements a legitimate RAG (Retrieval-Augmented Generation) system featuring hybrid search (BM25 and vector), document chunking, and support for both OpenAI and local Transformers.js embeddings. The code (src/rag2.js, src/retriever.js) is well-structured, includes extensive tests, and follows standard practices for document retrieval systems. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the use of a Hugging Face mirror (hf-mirror.com) and standard environment variables for API keys are consistent with the tool's stated purpose.

Capability Assessment

ℹ Purpose & Capability

The libraries and files (LanceDB, jieba, HuggingFace transformers, BM25 implementation) are appropriate for a RAG retriever and align with the description. The included local model/tokenizer artifacts also make sense for local embedding support. No obvious unrelated dependencies are present.

⚠ Instruction Scope

SKILL.md instructs npm install and running the CLI and JS APIs and documents using external embedding providers, but it does not mention that the code will call the OpenAI Embeddings API. The runtime instructions do not declare or warn about network requests or the need to supply an OpenAI API key, which the code will use if present.

ℹ Install Mechanism

There is no formal install spec in the registry entry, but SKILL.md/README instruct users to run 'npm install'. package.json and package-lock.json will pull many native and heavy dependencies (transformers, onnxruntime, sharp, etc.), which is expected for local transformer support but can be large and may require native build/runtime dependencies. This is expected for the skill's purpose but worth knowing.

⚠ Credentials

Registry metadata declares no required environment variables, but src/embeddings.js reads process.env.OPENAI_API_KEY and will POST user text to https://api.openai.com/v1/embeddings if used. That means sensitive data (documents or queries) can be sent to an external API if a key is provided — this external credential access is not declared in the skill metadata.

✓ Persistence & Privilege

The skill does write local caches and model/tokenizer artifacts under ./data/ (embedding cache, lancedb, model-cache). It does not request permanent 'always: true' privileges and does not attempt to modify other skills or global agent settings. Local file writes are expected for this functionality.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install yuyonghao-rag-retriever
After installation, invoke the skill by name or use /yuyonghao-rag-retriever
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.1

- Updated package version to 0.1.1 in package.json. - No other functional or documentation changes in this release.

v0.1.0

Initial release of RAG 2.0 document retrieval system for OpenClaw. - Implements document chunking with configurable size and overlap - Supports simple TF-normalized vector embedding and LanceDB storage - Provides hybrid retrieval: vector search combined with BM25 keyword search (RRF fusion) - Features source citation, context enhancement for RAG prompts, and Chinese/English mixed search with jieba tokenizer - Includes CLI tools, JavaScript API, and basic testing - Lays out roadmap for embedding model integration, hybrid search, reranker, and multi-collection management

Metadata

Slug yuyonghao-rag-retriever

Version 0.1.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is Rag Retriever?

RAG 2.0 检索系统，支持中文分词，混合向量与关键词搜索，智能文档分块与加权重排，自动生成 RAG 格式输出。 It is an AI Agent Skill for Claude Code / OpenClaw, with 158 downloads so far.

How do I install Rag Retriever?

Run "/install yuyonghao-rag-retriever" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Rag Retriever free?

Yes, Rag Retriever is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Rag Retriever support?

Rag Retriever is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Rag Retriever?

It is built and maintained by yuyonghao-123 (@yuyonghao-123); the current version is v0.1.1.

More Skills