← Back to Skills Marketplace
phentse

Web Reader TTS

by Zach · GitHub ↗ · v2.0.0 · MIT-0
cross-platform ⚠ suspicious
131
Downloads
1
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install web-reader-tts
Description
提取网页正文,自动检测语言,使用微软Edge TTS合成语音并通过Whisper识别,支持中英日及混合语言朗读。
Usage Guidance
This skill appears to be what it says: it fetches pages, extracts text, synthesizes audio via Microsoft Edge TTS, and transcribes with Whisper. Before installing, consider: (1) Privacy: synthesized text is sent to Microsoft’s Edge TTS service (no API key required), so avoid sending highly sensitive content you don't want transmitted off-device. (2) Resource use: Playwright will download Chromium and Whisper may download very large models (medium/large), so ensure you have bandwidth and disk space. (3) Untrusted pages: the skill will load and render arbitrary URLs in a headless browser — do not point it at internal URLs or pages containing secrets. (4) The repo includes a transcript.txt containing many offensive/malicious command examples — this is likely sample data but underscores that the skill will faithfully read whatever text it extracts. If you need stricter privacy, consider an offline/local TTS engine or review/clean content before sending it to Edge TTS.
Capability Analysis
Type: OpenClaw Skill Name: web-reader-tts Version: 2.0.0 The skill bundle provides a functional web-to-speech tool but includes a highly suspicious 'transcript.txt' file that contains an extensive collection of offensive security commands. This file includes instructions for credential dumping (Mimikatz, secretsdump), persistence (enabling the guest account), registry manipulation (wdigest), and remote exploitation (MS17-010, xp_cmdshell). While the Python script (web_reader_tts.py) and the instructions (SKILL.md) do not directly execute these commands, the presence of a 'Pentest Cheat Sheet' within a sample transcript file suggests a potential prompt-injection payload or an attempt to smuggle malicious instructions into the agent's context. The use of Playwright to fetch arbitrary web content further increases the risk that the agent might encounter and act upon similar malicious strings from external URLs.
Capability Assessment
Purpose & Capability
Name/description (web page → TTS → Whisper) align with the provided files and SKILL.md. The Python script and Readability.js implement the described mixed-extraction pipeline (trafilatura, Readability, newspaper3k) and call edge-tts and openai-whisper as documented. No extraneous cloud credentials or unrelated binaries are requested.
Instruction Scope
SKILL.md and web_reader_tts.py stay within the stated scope (download/parse pages with Playwright/trafilatura/newspaper, synthesize via edge-tts, transcribe with Whisper). Two notes: (1) the skill fetches and renders arbitrary webpages in Playwright (the browser will execute page JS during navigation), which is expected but means untrusted pages could execute scripts inside the headless browser context; (2) the repository includes transcript.txt containing many hacking/offensive command-line snippets — this appears to be example output/data, not active code, but users should be aware the skill will read and reproduce whatever text is present on a target URL.
Install Mechanism
No installer that pulls arbitrary code from unknown hosts is declared (instruction-only install). SKILL.md tells users to pip install standard packages and to run 'python -m playwright install chromium' — these are expected. Caveat: Playwright will download a Chromium build and Whisper may download large models on first run; these are normal but require bandwidth/disk.
Credentials
The skill requests no environment variables or credentials, which is proportional. One privacy/telemetry implication: edge-tts performs network calls to Microsoft's TTS endpoints (documented in SKILL.md). That means page content sent to Edge TTS for synthesis will leave the host machine; the skill does not request explicit API keys but does transmit content over the network for TTS. Whisper is local and model downloads are local.
Persistence & Privilege
always is false and the skill does not request elevated or persistent system privileges. It contains code that writes local output files (audio.mp3, transcript.txt) which is consistent with its purpose; no modifications to other skills or system-wide configs are present.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install web-reader-tts
  3. After installation, invoke the skill by name or use /web-reader-tts
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
🎉 重大升级:混合提取方案(Trafilatura + Readability + newspaper3k),正文准确率提升至 95%+,自动选择最佳引擎,支持元数据提取
v1.3.1
更新文档说明;优化示例代码
v1.3.0
优化内容提取算法,提升完整性至 95%+;增强多语言支持;改进文档结构
v1.2.0
- Added metadata and origin files for improved project management and ecosystem integration. - No changes to skill features or functionality.
v1.0.0
Web Reader TTS 1.0.0 - Initial Release - Read aloud web page content using TTS (Text-to-Speech), supporting Chinese, English, and Japanese. - Automatically extract main content from web pages via Playwright. - Choose high-quality voice (Edge TTS) based on detected language. - Optional speech recognition with Whisper (supports multi-language, default medium model). - Command line and natural language trigger support. - Configurable voices, language, speed, and model size. - Outputs audio (mp3) and recognized transcript (txt).
Metadata
Slug web-reader-tts
Version 2.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is Web Reader TTS?

提取网页正文,自动检测语言,使用微软Edge TTS合成语音并通过Whisper识别,支持中英日及混合语言朗读。 It is an AI Agent Skill for Claude Code / OpenClaw, with 131 downloads so far.

How do I install Web Reader TTS?

Run "/install web-reader-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Web Reader TTS free?

Yes, Web Reader TTS is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Web Reader TTS support?

Web Reader TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Web Reader TTS?

It is built and maintained by Zach (@phentse); the current version is v2.0.0.

💬 Comments