← Back to Skills Marketplace
redredrrred

2026-3-22dataset

by redredrrred · GitHub ↗ · v1.0.5 · MIT-0
cross-platform ✓ Security Clean
122
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install 2026-3-22dataset
Description
Use for RAGFlow dataset tasks: create, list, inspect, update, or delete datasets; upload, list, update, or delete documents; start or stop parsing; check par...
README (SKILL.md)

RAGFlow Dataset And Retrieval

Use only the bundled scripts in scripts/. Prefer --json so returned fields can be relayed exactly. Follow reference.md for all user-facing output.

Use This Skill When

  • the user wants to create, list, inspect, update, or delete RAGFlow datasets
  • the user wants to upload, list, update, or delete documents in a dataset
  • the user wants to start parsing, stop parsing, or check parse progress
  • the user wants to retrieve chunks from one or more datasets
  • the user wants to list configured RAGFlow models

Core Workflow

  1. Resolve the target dataset or document IDs first.
  2. Run the matching script from scripts/.
  3. Use --json unless a script only needs a simple text response.
  4. Return API fields exactly; do not guess missing details.

Common commands:

python3 scripts/datasets.py list --json
python3 scripts/datasets.py info DATASET_ID --json
python3 scripts/datasets.py create "Example Dataset" --description "Quarterly reports" --json
python3 scripts/update_dataset.py DATASET_ID --name "Updated Dataset" --json
python3 scripts/upload.py DATASET_ID /path/to/file.pdf --json
python3 scripts/upload.py list DATASET_ID --json
python3 scripts/update_document.py DATASET_ID DOC_ID --name "Updated Document" --json
python3 scripts/parse.py DATASET_ID DOC_ID1 [DOC_ID2 ...] --json
python3 scripts/stop_parse_documents.py DATASET_ID DOC_ID1 [DOC_ID2 ...] --json
python3 scripts/parse_status.py DATASET_ID --json
python3 scripts/search.py "query" --json
python3 scripts/search.py "query" DATASET_ID --json
python3 scripts/search.py --dataset-ids DATASET_ID1,DATASET_ID2 --doc-ids DOC_ID1,DOC_ID2 "query" --json
python3 scripts/search.py --retrieval-test --kb-id DATASET_ID "query" --json
python3 scripts/list_models.py --json

Guardrails

  • For any delete action, list the exact items first and require explicit user confirmation before executing.
  • Delete only by explicit dataset IDs or document IDs. If the user gives names or fuzzy descriptions, resolve IDs first.
  • Upload does not start parsing. Start parsing only when the user asks for it.
  • parse.py returns immediately after the start request; use parse_status.py for progress.
  • For progress requests, use parse_status.py on the most specific scope available:
    • dataset specified: inspect that dataset
    • document IDs specified: pass --doc-ids
    • no dataset specified: list datasets first, then aggregate status across datasets
  • If a parse status result includes progress_msg, surface it directly. For FAIL, treat it as the primary error detail.
  • Use --retrieval-test only for single-dataset debugging or when the user explicitly asks for that endpoint.

Output Rules

  • Follow reference.md.
  • Use tables for 3+ items when possible.
  • Preserve api_error, error, message, and related fields exactly as returned.
  • Never fabricate progress percentages or inferred causes.
Usage Guidance
This skill appears to do what it advertises: run the included Python scripts against a RAGFlow API. Before installing or enabling it, make sure RAGFLOW_API_URL points to a trusted RAGFlow server and provide only an API key you intend to allow for dataset/document operations. The skill will perform HTTP requests to that URL using the provided key, including endpoints that can list configured LLMs (list_models exposes api_base and other model metadata). The agent's SKILL.md asks you (the agent) to require explicit confirmation for deletes — verify that your agent actually prompts users for confirmation before destructive actions. If you need tighter control, restrict the API key's permissions (principle of least privilege) or use a read-only key for listing/status use cases.
Capability Analysis
Type: OpenClaw Skill Name: 2026-3-22dataset Version: 1.0.5 The skill bundle provides a legitimate set of tools for interacting with the RAGFlow API to manage datasets and documents. The Python scripts in the `scripts/` directory (e.g., `datasets.py`, `upload.py`, `search.py`) use standard libraries like `urllib.request` to perform API operations and follow the instructions outlined in `SKILL.md`. There is no evidence of malicious intent, data exfiltration to unauthorized endpoints, or harmful prompt injection.
Capability Assessment
Purpose & Capability
Name/description match the included scripts and env vars: the bundle implements dataset create/list/info/update/delete, document upload/list/update/delete, parsing control, status, search, and model listing. Required env vars (RAGFLOW_API_URL, RAGFLOW_API_KEY) and python3 are appropriate for interacting with a RAGFlow HTTP API.
Instruction Scope
SKILL.md instructs the agent to run the shipped scripts under scripts/ and to prefer --json, and the scripts perform only HTTP calls to the configured RAGFlow URL using the declared env vars. Scripts do not read unrelated system files or other environment variables, and guardrails (resolve IDs first, require explicit delete IDs) are present in policy text.
Install Mechanism
No install spec is provided (instruction-only for runtime usage); scripts are shipped with the skill and executed with python3. No remote downloads or archive extraction are performed by an installer.
Credentials
The only sensitive items requested are RAGFLOW_API_URL and RAGFLOW_API_KEY (primaryEnv), which are directly required to call the service the skill targets. The number and type of env vars are proportionate to the skill's functionality.
Persistence & Privilege
The skill does not request always: true and does not attempt to modify other skills or system-wide agent settings. It requires no persistent installation steps or special privileges beyond running the bundled scripts.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install 2026-3-22dataset
  3. After installation, invoke the skill by name or use /2026-3-22dataset
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.5
Version 1.0.5 Changelog - Expanded documentation on usage, guardrails, and core workflow for RAGFlow dataset management. - Clarified command scripts for creating, listing, updating, deleting datasets and documents, including upload and parsing workflows. - Added strict output and deletion confirmation rules to prevent accidental data loss and ensure accurate API field reporting. - Detailed handling instructions for parsing status requests, error propagation, and retrieval query options. - Updated environment variable and dependency requirements in metadata.
Metadata
Slug 2026-3-22dataset
Version 1.0.5
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 2026-3-22dataset?

Use for RAGFlow dataset tasks: create, list, inspect, update, or delete datasets; upload, list, update, or delete documents; start or stop parsing; check par... It is an AI Agent Skill for Claude Code / OpenClaw, with 122 downloads so far.

How do I install 2026-3-22dataset?

Run "/install 2026-3-22dataset" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 2026-3-22dataset free?

Yes, 2026-3-22dataset is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 2026-3-22dataset support?

2026-3-22dataset is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 2026-3-22dataset?

It is built and maintained by redredrrred (@redredrrred); the current version is v1.0.5.

💬 Comments