← 返回 Skills 市场
kaiasdobi

Feishu Knowledge Ingest

作者 kaiasdobi · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
105
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install feishu-knowledge-ingest
功能描述
batch ingest feishu folders and single attachments into report-first knowledge artifacts. use when chatgpt needs to read a feishu directory or a single share...
使用说明 (SKILL.md)

Feishu Knowledge Ingest

Use this skill to turn a Feishu folder or a single shared attachment into structured, reviewable knowledge outputs.

What this skill does

  • Accept a Feishu folder link/token or a single shared attachment.
  • Classify files into direct-read, download-and-parse, manual-review, or permission-blocked.
  • Parse .docx and .pdf in v0.1.
  • Produce report-first outputs instead of writing MEMORY.md directly.
  • Preserve failures and uncertainty instead of guessing content.

Supported v0.1 scope

Inputs

  • Feishu folder link or folder_token
  • Single shared attachment link or token

Parsing

  • .docx
  • .pdf

Outputs

  • ingest-report.md
  • kb-items.jsonl
  • failed-items.jsonl
  • MEMORY.candidate.md

Required behavior

  1. Distinguish Feishu native docs from uploaded attachments.
    • Native docs: doc, sheet, wiki, bitable
    • Uploaded attachments: .docx, .pdf, .pptx, other files
  2. Do not claim attachment content was learned unless text was actually extracted.
  3. Default to report-first. Do not write MEMORY.md in v0.1.
  4. Record every failed file with a concrete reason.
  5. Prefer plain-text summaries over complex Feishu cards when reporting progress.

File routing rules

Direct-read

Treat these as direct-read only when the runtime has a reliable native-reader path:

  • doc
  • sheet
  • wiki
  • bitable

Download-and-parse

Treat these as download-and-parse:

  • .docx
  • .pdf

Manual-review

Route here when the file is out of scope or low-confidence in v0.1:

  • .pptx
  • images
  • scans with no extractable text
  • archives
  • unusual file types

Permission-blocked

Route here when listing is possible but the file cannot be downloaded or read.

Standard workflow

  1. Resolve input type.
    • Folder link/token -> enumerate files.
    • Single file link/token -> build a one-file manifest.
  2. Create a batch record.
    • Generate batch_id.
    • Record started_at.
  3. Build a manifest.
    • File name
    • File token/link
    • file type
    • route decision
  4. Attempt extraction.
    • .docx -> use parsers/parse_docx.py
    • .pdf -> use parsers/parse_pdf.py
  5. Produce structured outputs.
    • success -> append to kb-items.jsonl
    • failure -> append to failed-items.jsonl
  6. Summarize the batch.
    • Write ingest-report.md
    • Write MEMORY.candidate.md
  7. Finish the batch.
    • Record finished_at
    • Never auto-write MEMORY.md

Output contracts

kb-items.jsonl

Write one JSON object per successfully extracted knowledge item with at least:

  • batch_id
  • source_file
  • source_token
  • file_type
  • topic
  • content_type
  • summary
  • extracted_at
  • confidence

failed-items.jsonl

Write one JSON object per failed or blocked file with at least:

  • batch_id
  • source_file
  • source_token
  • file_type
  • failure_reason
  • error_detail
  • suggested_action
  • failed_at

MEMORY.candidate.md

Include:

  • batch header (batch_id, started_at, finished_at, source_directory or source_file)
  • grouped knowledge summaries
  • source references
  • confidence notes
  • items needing review

ingest-report.md

Include:

  1. Batch summary
  2. Input scope
  3. File counts and routing counts
  4. Successful extraction summary
  5. Failures and risks
  6. Recommended next actions

Safety rules

  • Never invent text that was not extracted.
  • If parsing fails, say so plainly and log it.
  • Treat filenames as hints only, never as proof of document contents.
  • Keep sensitive data out of MEMORY.candidate.md unless the workflow explicitly allows it.

Included files

  • run.py: minimal batch runner for local testing
  • parsers/parse_docx.py: docx text extraction helper
  • parsers/parse_pdf.py: pdf text extraction helper
  • references/output_examples.md: sample output shapes and field guidance
  • README.md: setup and usage notes
安全使用建议
This package is a local, v0.1 skeleton for parsing .docx/.pdf files and producing report-first outputs — it does NOT actually connect to Feishu or accept Feishu tokens. If you install this expecting automatic Feishu folder ingestion, you will be disappointed: you'll need to add a Feishu listing/download adapter and credential handling. Before using: inspect run.py and the parser files (they only read local files and do not send data externally), install the Python dependencies (python-docx, pypdf) in a controlled environment, and be careful that any files you ingest don't contain sensitive information you don't want written into the generated outputs (kb-items.jsonl, failed-items.jsonl, MEMORY.candidate.md, ingest-report.md). If you want real Feishu integration, ask the author for the connector code or add secure credential requirements (and review any network/download code for where data is sent).
功能分析
Type: OpenClaw Skill Name: feishu-knowledge-ingest Version: 1.0.0 The skill bundle is a legitimate tool designed to parse local .docx and .pdf files and generate structured knowledge reports. The code in run.py and the parsers (parse_docx.py, parse_pdf.py) uses standard libraries (python-docx, pypdf) to extract text without any evidence of data exfiltration, malicious execution, or obfuscation. The SKILL.md instructions focus on maintaining data integrity and transparency by requiring a 'report-first' workflow and preventing direct modification of the agent's memory.
能力评估
Purpose & Capability
The name and description promise batch ingestion from Feishu folder links or shared-attachment tokens, but the code and README contain no Feishu API adapter, no network/download logic, and no environment variables for Feishu credentials. The runtime only processes local files in a directory, so someone expecting automatic Feishu integration would be misled.
Instruction Scope
SKILL.md describes resolving folder tokens, enumerating files, routing permission-blocked items, and preserving source tokens; run.py implements a local directory loop and sets source_token to an empty string. The instructions thus give the agent responsibilities (handle Feishu tokens, list remote files) that the provided code does not implement.
Install Mechanism
There is no install spec (instruction-only), but a requirements.txt lists python-docx and pypdf which are reasonable for the included parsers. No remote downloads or unusual install steps are present, so install risk is low — but the skill does not document how to install those requirements in the SKILL.md.
Credentials
SKILL.md expects inputs like 'folder_token' or shared-attachment links, yet requires.env is empty and there are no declared primary credentials. If you intend to enable live Feishu ingestion you would need to add credentials (which is not currently handled). The absence of any credential requirement is inconsistent with the stated purpose.
Persistence & Privilege
The skill is not always-on, does not request elevated platform privileges, and does not modify other skills or system-wide settings. It writes output files to a specified output directory (local disk) which is expected behaviour for an ingestion tool.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install feishu-knowledge-ingest
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /feishu-knowledge-ingest 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
feishu-knowledge-ingest 1.0.0 - Initial release of Feishu Knowledge Ingest for batch processing Feishu folders or single attachments into structured knowledge artifacts. - Supports parsing and extracting text from `.docx` and `.pdf` files; other file types are routed for manual review or failure logging. - Outputs include `ingest-report.md`, `kb-items.jsonl`, `failed-items.jsonl`, and `MEMORY.candidate.md`—never writes `MEMORY.md` directly. - Provides clear reporting of successes, failures, file routing decisions, and promotes review-first workflows. - Includes safety rules to prevent content invention and maintain data integrity.
元数据
Slug feishu-knowledge-ingest
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Feishu Knowledge Ingest 是什么?

batch ingest feishu folders and single attachments into report-first knowledge artifacts. use when chatgpt needs to read a feishu directory or a single share... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 105 次。

如何安装 Feishu Knowledge Ingest?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install feishu-knowledge-ingest」即可一键安装,无需额外配置。

Feishu Knowledge Ingest 是免费的吗?

是的,Feishu Knowledge Ingest 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Feishu Knowledge Ingest 支持哪些平台?

Feishu Knowledge Ingest 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Feishu Knowledge Ingest?

由 kaiasdobi(@kaiasdobi)开发并维护,当前版本 v1.0.0。

💬 留言讨论