← 返回 Skills 市场
131
总下载
1
收藏
0
当前安装
5
版本数
在 OpenClaw 中安装
/install web-reader-tts
功能描述
提取网页正文,自动检测语言,使用微软Edge TTS合成语音并通过Whisper识别,支持中英日及混合语言朗读。
安全使用建议
This skill appears to be what it says: it fetches pages, extracts text, synthesizes audio via Microsoft Edge TTS, and transcribes with Whisper. Before installing, consider: (1) Privacy: synthesized text is sent to Microsoft’s Edge TTS service (no API key required), so avoid sending highly sensitive content you don't want transmitted off-device. (2) Resource use: Playwright will download Chromium and Whisper may download very large models (medium/large), so ensure you have bandwidth and disk space. (3) Untrusted pages: the skill will load and render arbitrary URLs in a headless browser — do not point it at internal URLs or pages containing secrets. (4) The repo includes a transcript.txt containing many offensive/malicious command examples — this is likely sample data but underscores that the skill will faithfully read whatever text it extracts. If you need stricter privacy, consider an offline/local TTS engine or review/clean content before sending it to Edge TTS.
功能分析
Type: OpenClaw Skill
Name: web-reader-tts
Version: 2.0.0
The skill bundle provides a functional web-to-speech tool but includes a highly suspicious 'transcript.txt' file that contains an extensive collection of offensive security commands. This file includes instructions for credential dumping (Mimikatz, secretsdump), persistence (enabling the guest account), registry manipulation (wdigest), and remote exploitation (MS17-010, xp_cmdshell). While the Python script (web_reader_tts.py) and the instructions (SKILL.md) do not directly execute these commands, the presence of a 'Pentest Cheat Sheet' within a sample transcript file suggests a potential prompt-injection payload or an attempt to smuggle malicious instructions into the agent's context. The use of Playwright to fetch arbitrary web content further increases the risk that the agent might encounter and act upon similar malicious strings from external URLs.
能力评估
Purpose & Capability
Name/description (web page → TTS → Whisper) align with the provided files and SKILL.md. The Python script and Readability.js implement the described mixed-extraction pipeline (trafilatura, Readability, newspaper3k) and call edge-tts and openai-whisper as documented. No extraneous cloud credentials or unrelated binaries are requested.
Instruction Scope
SKILL.md and web_reader_tts.py stay within the stated scope (download/parse pages with Playwright/trafilatura/newspaper, synthesize via edge-tts, transcribe with Whisper). Two notes: (1) the skill fetches and renders arbitrary webpages in Playwright (the browser will execute page JS during navigation), which is expected but means untrusted pages could execute scripts inside the headless browser context; (2) the repository includes transcript.txt containing many hacking/offensive command-line snippets — this appears to be example output/data, not active code, but users should be aware the skill will read and reproduce whatever text is present on a target URL.
Install Mechanism
No installer that pulls arbitrary code from unknown hosts is declared (instruction-only install). SKILL.md tells users to pip install standard packages and to run 'python -m playwright install chromium' — these are expected. Caveat: Playwright will download a Chromium build and Whisper may download large models on first run; these are normal but require bandwidth/disk.
Credentials
The skill requests no environment variables or credentials, which is proportional. One privacy/telemetry implication: edge-tts performs network calls to Microsoft's TTS endpoints (documented in SKILL.md). That means page content sent to Edge TTS for synthesis will leave the host machine; the skill does not request explicit API keys but does transmit content over the network for TTS. Whisper is local and model downloads are local.
Persistence & Privilege
always is false and the skill does not request elevated or persistent system privileges. It contains code that writes local output files (audio.mp3, transcript.txt) which is consistent with its purpose; no modifications to other skills or system-wide configs are present.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install web-reader-tts - 安装完成后,直接呼叫该 Skill 的名称或使用
/web-reader-tts触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v2.0.0
🎉 重大升级:混合提取方案(Trafilatura + Readability + newspaper3k),正文准确率提升至 95%+,自动选择最佳引擎,支持元数据提取
v1.3.1
更新文档说明;优化示例代码
v1.3.0
优化内容提取算法,提升完整性至 95%+;增强多语言支持;改进文档结构
v1.2.0
- Added metadata and origin files for improved project management and ecosystem integration.
- No changes to skill features or functionality.
v1.0.0
Web Reader TTS 1.0.0 - Initial Release
- Read aloud web page content using TTS (Text-to-Speech), supporting Chinese, English, and Japanese.
- Automatically extract main content from web pages via Playwright.
- Choose high-quality voice (Edge TTS) based on detected language.
- Optional speech recognition with Whisper (supports multi-language, default medium model).
- Command line and natural language trigger support.
- Configurable voices, language, speed, and model size.
- Outputs audio (mp3) and recognized transcript (txt).
元数据
常见问题
Web Reader TTS 是什么?
提取网页正文,自动检测语言,使用微软Edge TTS合成语音并通过Whisper识别,支持中英日及混合语言朗读。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 131 次。
如何安装 Web Reader TTS?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install web-reader-tts」即可一键安装,无需额外配置。
Web Reader TTS 是免费的吗?
是的,Web Reader TTS 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Web Reader TTS 支持哪些平台?
Web Reader TTS 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Web Reader TTS?
由 Zach(@phentse)开发并维护,当前版本 v2.0.0。
推荐 Skills