← 返回 Skills 市场
ezhencacao-dotcom

Bid Reader

作者 ezhencacao-dotcom · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
67
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install bid-reader
功能描述
Extracts and returns plain text from PDF, Word (.docx), and Excel (.xlsx/.xls) bid documents for analysis, search, or summarisation.
使用说明 (SKILL.md)

bid-reader Skill

Overview

A lightweight skill to extract readable text from bid and tender documents in PDF, Word (.docx), and Excel (.xlsx/.xls) formats. It can be invoked from the OpenClaw UI or other agents to quickly pull the full textual content of a file for analysis, search, or summarisation.

Usage

bid-read \x3Cfile-path>
  • \x3Cfile-path> should be an absolute or workspace‑relative path to a document.
  • The skill prints the extracted plain‑text to stdout, which OpenClaw captures and returns to the caller.

Example

bid-read /home/zhenxing/投标文件/招投标项目1/13.上海联通/投标文件.pdf

The command returns the full text of the PDF, ready for further processing (e.g., keyword search, summarisation).

Installation

Copy the skill folder into your workspace under skills/bid-reader. Install required Python packages:

pip install -r $(pwd)/skills/bid-reader/requirements.txt

The skill is then available as an agent command.

Implementation Details

  • PDF: Uses pdfplumber to extract text page‑by‑page.
  • Word: Uses python-docx to read paragraphs.
  • Excel: Uses pandas (with openpyxl/xlrd) to read all sheets and concatenate cell values.

Limitations

  • Only .pdf, .docx, .xlsx, and .xls are supported. Other formats will be ignored.
  • Large files may take a few seconds to process.
  • Tables are flattened into whitespace‑separated rows; complex formatting is not preserved.

Future Enhancements

  • Add OCR fallback for scanned PDFs (e.g., via pytesseract).
  • Support selective page or sheet extraction.
  • Provide a JSON output mode with structural metadata.
安全使用建议
This skill appears internally consistent and limited to extracting text from files. Before installing: (1) install the Python deps in an isolated virtual environment to avoid polluting your system/site packages, (2) be aware the script will read any file path you pass it — do not point it at sensitive files you don't want processed, (3) the repo does not provide a 'bid-read' executable wrapper; run the script directly (python scripts/read.py /path/to/file) or add your own launcher, and (4) note that .doc support is advertised but likely unsupported by python-docx (use .docx). If you need OCR for scanned PDFs or stricter sandboxing, add those safeguards before using on untrusted documents.
能力评估
Purpose & Capability
Name/description match the code and requirements: pdfplumber, python-docx, pandas/openpyxl are appropriate for extracting text from PDF, .docx and Excel files. Minor mismatch: SKILL.md advertises a 'bid-read' command but no wrapper is provided in the bundle (you'd need to invoke scripts/read.py or create an executable entrypoint). The code also accepts '.doc' extension but uses python-docx (which only supports .docx), so '.doc' handling is likely to fail — a correctness issue, not a security one.
Instruction Scope
SKILL.md instructs the agent to read a workspace or absolute file path and print extracted text to stdout. The implementation only reads the provided file and does not access other files, network endpoints, or environment variables. There is no broad or vague instruction to gather extra context or scan unrelated paths.
Install Mechanism
No packaged install spec is provided (instruction-only plus a local script). Installation relies on pip installing requirements.txt from PyPI, which is normal for a Python utility but will fetch third-party packages over the network; consider installing in a virtual environment. Nothing in the install steps pulls code from arbitrary URLs or personal servers.
Credentials
The skill declares no required environment variables, credentials, or config paths and the code does not read any env vars. Requested permissions are minimal and proportional to the task.
Persistence & Privilege
always is false and the skill is user-invocable; it does not request permanent presence or modify other skills or system-wide settings. Autonomous invocation is allowed by default but the skill's operations are limited to local file reading when invoked.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install bid-reader
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /bid-reader 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of bid-reader skill. - Extracts readable text from PDF, Word (.docx), and Excel (.xlsx/.xls) bid/tender documents. - Simple command-line usage: bid-read <file-path>. - Outputs plain text for search, summarisation, or further analysis. - Supports invocation from OpenClaw UI or other agents. - Handles PDF via pdfplumber, Word via python-docx, Excel via pandas/openpyxl/xlrd. - Ignores unsupported formats; flattens tables and does not preserve complex formatting.
元数据
Slug bid-reader
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Bid Reader 是什么?

Extracts and returns plain text from PDF, Word (.docx), and Excel (.xlsx/.xls) bid documents for analysis, search, or summarisation. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 67 次。

如何安装 Bid Reader?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install bid-reader」即可一键安装,无需额外配置。

Bid Reader 是免费的吗?

是的,Bid Reader 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Bid Reader 支持哪些平台?

Bid Reader 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Bid Reader?

由 ezhencacao-dotcom(@ezhencacao-dotcom)开发并维护,当前版本 v1.0.0。

💬 留言讨论