← 返回 Skills 市场

Visible Text Extractor

Name: Visible Text Extractor
Author: wunianze666-netizen

作者 wunianze666-netizen · GitHub ↗ · v1.2.0 · MIT-0

cross-platform ⚠ suspicious

153

总下载

当前安装

版本数

在 OpenClaw 中安装

/install visible-text-extractor

功能描述

Extract and reconstruct as much visible text as possible from webpage URLs, article pages, screenshots, long images, image directories, and GIFs. Use when th...

安全使用建议

This skill appears to do what it says, but before running it you should: (1) confirm required runtime tools exist (node, ffmpeg, Python, the local OCR stack referenced at /root/.openclaw/.../ocr-local and any virtualenv); the package metadata currently does not declare these dependencies, so expect runtime errors otherwise; (2) be aware it downloads arbitrary images/frames referenced by pages and writes temp files — if pages contain internal URLs this could cause server-side requests to your internal network (SSRF-like behavior); (3) the scripts optionally send the produced docx to Feishu when a receive-id is supplied — do not provide a receive-id unless you trust the destination; (4) review or supply trusted implementations for the referenced external scripts (ocr.js, feishu_file_sender.py) since the skill delegates OCR and delivery to them; (5) run the skill in an isolated environment (or sandbox) if you plan to process sensitive pages. If the publisher updates the package metadata to list required binaries and external script dependencies explicitly (and documents the optional Feishu delivery clearly), and if you verify the referenced local scripts are trusted, this assessment could be upgraded to benign.

功能分析

Type: OpenClaw Skill Name: visible-text-extractor Version: 1.2.0 The skill bundle provides extensive capabilities for web scraping, browser automation (Playwright), and OCR, which involve high-risk behaviors such as arbitrary network requests and shell command execution via subprocess. It includes a feature to send extracted documents to external Feishu recipients using a separate local skill (feishu-file-sender). While these capabilities are aligned with the stated purpose of extracting and delivering text from various media, the broad access to the network, file system, and local command execution environment warrants a suspicious classification under the provided criteria. Key files include extract_visible_text.py for web/OCR logic and build_authorized_capture_docx.py for the delivery pipeline.

能力评估

⚠ Purpose & Capability

The skill claims to extract visible text from pages/images and the included scripts implement that. However the package metadata lists no required binaries or credentials while scripts clearly call node, ffmpeg, and an external/local OCR stack (paths like /root/.openclaw/.../ocr-local/scripts/ocr.js and /root/.openclaw/venvs/ocrstack/bin/python). The absence of declared runtime requirements is an incoherence: a consumer would legitimately need those dependencies to run the skill.

ℹ Instruction Scope

SKILL.md and USAGE.md instruct the agent to download pages/images, render pages via a browser fallback, extract GIF frames, run OCR, and produce docx/JSON/markdown. That scope matches the stated purpose. Some scripts also reference other local skill scripts (feishu sender, ocr-local) and absolute workspace paths and will download arbitrary image URLs discovered in pages — expected for this task but worth noting because it increases the runtime network surface (and may access internal-only URLs if present).

✓ Install Mechanism

There is no install spec (instruction-only with bundled scripts) so nothing is fetched at install time. Runtime, however, depends on external binaries and other skill scripts (node, ffmpeg, local OCR scripts). The lack of an explicit install section or dependency declaration is the main issue, not the install mechanism itself.

⚠ Credentials

The skill declares no required env vars/credentials, but several scripts can invoke a Feishu file-sender script and will send a generated docx if a user-supplied --send-feishu-receive-id is passed. The code will also invoke external local tools under absolute paths. Requiring no credentials is coherent for read-only extraction, but the optional remote-send behavior and implicit dependencies on other local skill code are not documented in metadata and increase the risk of unintended data sharing or failure due to missing components.

✓ Persistence & Privilege

always is false and the skill does not request permanent inclusion or modify other skills' configs. It writes temporary files and output artifacts in specified output paths, which is expected for this workflow.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install visible-text-extractor
安装完成后，直接呼叫该 Skill 的名称或使用 /visible-text-extractor 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.2.0

Visible Text Extractor 1.0.0 – Initial release - Initial public release with support for webpage, article, screenshot, long image, GIF, and image directory text extraction. - Adds dedicated scripts for WeChat article order reconstruction, high-accuracy OCR, and multi-stage cleanup. - Introduces specialized pipelines for clean, human-readable output (markdown/Word/JSON). - Provides workflow and reference documentation for usage, publishing, and release notes. - Includes multiple extraction and deliverable pipelines, especially tailored for WeChat articles and complex, image-heavy sources.

v1.1.1

Refine reading-order reconstruction guidance, strengthen deliverable quality targets, and improve the skill description around original article flow and user-facing comfort.

v1.1.0

Improve WeChat reading-order reconstruction, stabilize OCR fallback speed, and tighten deliverable quality for cleaner Word output.

v1.0.0

Initial polished public release. Added OCR cleanup, section reconstruction, WeChat handling, troubleshooting docs, and a one-step deliverable pipeline that outputs raw JSON, clean JSON, clean markdown, and Word documents.

元数据

Slug visible-text-extractor

版本 1.2.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 4

常见问题

Visible Text Extractor 是什么？

Extract and reconstruct as much visible text as possible from webpage URLs, article pages, screenshots, long images, image directories, and GIFs. Use when th... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 153 次。

如何安装 Visible Text Extractor？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install visible-text-extractor」即可一键安装，无需额外配置。

Visible Text Extractor 是免费的吗？

是的，Visible Text Extractor 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Visible Text Extractor 支持哪些平台？

Visible Text Extractor 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Visible Text Extractor？

由 wunianze666-netizen（@wunianze666-netizen）开发并维护，当前版本 v1.2.0。