← Back to Skills Marketplace
iasgu

环评知识库提炼

by iasgu · GitHub ↗ · v2.3.0 · MIT-0
cross-platform ✓ Security Clean
176
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install eia-knowledge-extractor
Description
环评报告知识库提炼工具 - 从环评报告表中提取结构化知识库文件,支持PDF/DOCX解析
README (SKILL.md)

环评知识库提炼

从环境影响评价报告表中自动提取环境数据,生成结构化知识库文件。

输入

环评报告表文件(支持PDF、DOCX、DOC、TXT格式)

输出

生成5个知识库CSV文件:

1. 污染因子知识库(19字段)

污染物ID、行业、区域、产污工段、产污设施、原辅材料、污染物条件1-3、污染物名称、污染因子名称、污染物种类、排放位置、标准条件1-3、适用标准、标准限值-浓度/速率/高度/其他、备注、出处

2. 废气源强核算知识库(15字段)

污染物ID、污染物种类、污染因子种类、行业、区域、核算污染因子、产生量核算方法类型/依据/方法、核算公式、所需参数、产污系数、类比项目规模信息/污染物量、出处

3. 废水源强核算知识库(15字段)

同上

4. 固废源强核算知识库(16字段)

污染物ID、污染物种类、污染因子种类、固废类型、危废代码、行业、区域、核算污染因子、产生量核算方法类型/依据/方法、核算公式、所需参数、产污系数、类比项目规模信息/污染物量、出处

5. 噪声源强核算知识库(13字段)

污染物ID、污染物种类、污染因子种类、行业、区域、噪声源、规格型号、声源类型、计量单位/方式、声源源强值、降噪措施、降噪后源强值、出处

数据规范

  • 污染物ID格式行业_原辅料_产污工段_其他条件_污染物
  • 示例通用设备制造业_铸件_抛丸_/_抛丸粉尘
  • 污染物种类:废气、废水、固废、噪声
  • 空值:用 / 表示

质量核查

生成后需核查:文件完整性(5个CSV)、命名规则、字段完整性(19/15/15/16/13)、与原报告一致性(不可缩写/缩减/缺漏)

使用方法

python scripts/main.py report.pdf -o output_dir

依赖

pip install pymupdf pandas python-docx
Usage Guidance
This skill appears to do what it says: parse EIA report files and produce CSV knowledge bases. Before installing or running it: 1) run it in an isolated environment (virtualenv/container) and inspect outputs; 2) test with non-sensitive sample PDFs to confirm the extractor and table parsing meet your needs — many extract_* functions are currently stubs and may produce empty CSVs; 3) note the small doc mismatch (SKILL.md says 5 CSVs, code writes an additional emission-standards CSV) and verify the exact files produced; 4) review produced CSVs for correctness before using them in downstream systems; and 5) install dependencies from PyPI only (pip install -r requirements.txt) and avoid running untrusted binaries.
Capability Analysis
Type: OpenClaw Skill Name: eia-knowledge-extractor Version: 2.3.0 The skill bundle is a specialized tool designed to extract structured environmental data from PDF and DOCX reports into CSV format. The code in scripts/main.py and associated extraction scripts uses legitimate libraries (PyMuPDF, Pandas, python-docx) for document parsing and includes a robust data validation framework. No evidence of malicious intent, data exfiltration, or prompt injection was found; the logic is consistent with the stated purpose of environmental knowledge extraction.
Capability Assessment
Purpose & Capability
Name/description (environment impact report → structured CSV knowledge bases) matches the included Python scripts and declared dependencies (pymupdf, pandas, python-docx). The skill does not request unrelated credentials or system config paths. The presence of extract_* and main.py scripts is coherent with the stated purpose.
Instruction Scope
SKILL.md instructs running scripts/main.py on PDF/DOCX and installing the listed Python libs — that matches the code. However, multiple extraction functions are currently placeholders (they return empty lists or 'pass'), so in practice some outputs may be empty or require further implementation and manual review. Also SKILL.md lists '5' CSV knowledge files but the code supports writing an extra emission-standards CSV (i.e., 6 outputs) — a minor documentation mismatch to be aware of.
Install Mechanism
No install specification (instruction-only for environment setup) and the requirements.txt lists standard PyPI packages. No downloads from arbitrary URLs, no archive extraction, and no package managers invoked automatically. Risk is low; user should install dependencies in a virtualenv to limit scope.
Credentials
The skill requires no environment variables, no credentials, and reads only user-supplied report files and writes CSV/text output locally. There are no requests for unrelated secrets or system-level configs.
Persistence & Privilege
Registry flags (always: false, normal model invocation) are standard. The skill writes output files to disk (CSV, reports) under the specified output directory — expected for this tool. It does not modify other skills or system-wide agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install eia-knowledge-extractor
  3. After installation, invoke the skill by name or use /eia-knowledge-extractor
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.3.0
V2.3: 新增质量核查要求,精简文档
Metadata
Slug eia-knowledge-extractor
Version 2.3.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 环评知识库提炼?

环评报告知识库提炼工具 - 从环评报告表中提取结构化知识库文件,支持PDF/DOCX解析. It is an AI Agent Skill for Claude Code / OpenClaw, with 176 downloads so far.

How do I install 环评知识库提炼?

Run "/install eia-knowledge-extractor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 环评知识库提炼 free?

Yes, 环评知识库提炼 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 环评知识库提炼 support?

环评知识库提炼 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 环评知识库提炼?

It is built and maintained by iasgu (@iasgu); the current version is v2.3.0.

💬 Comments