← Back to Skills Marketplace
interccy-stack

微信频道语音+视频实现气泡自动播报模式

by interccy-stack · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
31
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install voice-document-maker
Description
制作有声视频文档技能 - 将文字自动转为 TTS 语音,并配上 AI 生成的精美背景图,合成 MP4 视频文件。适用于微信视频卡片交付。
README (SKILL.md)

\r \r

有声视频制作师\r

\r

什么时候用\r

  • 需要将文字内容以"视频卡片"形式发送给用户时。\r
  • 用户要求"生成语音视频"、"做个有声海报"时。\r
  • 替代纯文本回复,提供"视觉 + 听觉"双重体验。\r \r

核心逻辑\r

  1. 语音生成:调用 edge_tts 生成高质量 MP3。\r
  2. 背景生成:调用 pollinations.ai 根据描述生成精美图片。\r
  3. 视频合成:使用 FFmpeg 将图片和音频合成 MP4。\r \r

如何使用\r

直接调用脚本:\r

C:\Users\lenovo\.copaw\venv\Scripts\python.exe skills\voice-document-maker\scripts\make_video.py --text "这里是朗读内容" --bg_prompt "背景描述" --output "output.mp4"\r
```\r
\r
## 示例\r
### 制作古风美女有声视频\r
```bash\r
python scripts/make_video.py --text "天行健,君子以自强不息。" --bg_prompt "Beautiful elegant Chinese lady, ancient style, soft lighting" --output "gufeng.mp4"\r
```\r
\r
### 制作科技风有声视频\r
```bash\r
python scripts/make_video.py --text "AI 技术正在改变世界。" --bg_prompt "Futuristic technology background, blue neon, cyberpunk" --output "tech.mp4"\r
```\r
\r
## 交付物\r
- MP4 视频文件,可直接通过 `send_file_to_user` 发送。\r
Usage Guidance
Install only if you are comfortable with the skill using FFmpeg, edge_tts, and pollinations.ai. Use simple output filenames like result.mp4, avoid sensitive background prompts, and consider fixing the script to prevent path traversal or accidental overwrites before relying on it.
Capability Analysis
Type: OpenClaw Skill Name: voice-document-maker Version: 1.0.0 The skill contains hardcoded absolute file paths to a specific user's directory (C:\Users\lenovo) for the Python interpreter and FFmpeg executable in scripts/make_video.py, which is highly irregular and suggests an environment-specific or poorly constructed bundle. It also performs external network requests to pollinations.ai to download images and uses subprocess.run to execute commands, which, combined with the lack of input validation on the output filename (potential path traversal), poses a security risk despite being aligned with the stated purpose.
Capability Assessment
Purpose & Capability
The described purpose—turn text into TTS audio, generate a background image, and combine them into an MP4—is coherent with the included script, but it depends on external/local tools and a remote image provider.
Instruction Scope
The script accepts a user-controlled output filename without restricting it to a safe basename or directory, and FFmpeg is invoked with overwrite enabled.
Install Mechanism
Registry requirements declare no binaries or install spec, but the script requires hard-coded Windows paths for FFmpeg and a Python environment running edge_tts.
Credentials
File creation is expected for this skill, but the artifacts do not clearly contain output writes to the skill output directory or prevent overwriting unintended paths.
Persistence & Privilege
No credentials, background persistence, startup hooks, or long-running behavior are shown; generated media files are expected for the skill purpose.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install voice-document-maker
  3. After installation, invoke the skill by name or use /voice-document-maker
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
在使用微信机器人时,你想要微信的绿泡小喇叭模式,作者试了无数次,生成你们所知道的所有语音格式,只有一种下载播放模式,你不知道多少秒的声音,下载了还不一定能手机打得开。这个痛点,有人解决了吗?我们换个思路,以卡片+声音制作有声视频文档,这个技能 - 将文字自动转为 TTS 语音,并配上 AI 生成的精美背景图,合成 MP4 视频文件。微信频道语音+视频实现气泡自动播报模式,适用于微信视频卡片交付。高维视频+声音解决低维的声音播放问题。让我们用微信机器人更友好,更可爱。创作者interccy When using WeChat bots, you want the green bubble with the small speaker icon. The author tried countless times to generate all the audio formats you know, but there’s only one download and playback mode. You have no idea how many seconds of sound it takes to download, and even then, it might not open on your phone. Has anyone solved this pain point? Let’s change our approach. Create audio video documents using cards and sound. This skill – automatically converts text into TTS speech and pairs it with AI-generated beautiful background images, synthesizing MP4 video files. WeChat channel audio + video enables automatic bubble broadcasting mode, suitable for WeChat video card delivery. High-dimensional video + sound solves the problem of low-dimensional sound playback. Let’s make WeChat bots more user-friendly and adorable. Creator: interccy
Metadata
Slug voice-document-maker
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 微信频道语音+视频实现气泡自动播报模式?

制作有声视频文档技能 - 将文字自动转为 TTS 语音,并配上 AI 生成的精美背景图,合成 MP4 视频文件。适用于微信视频卡片交付。 It is an AI Agent Skill for Claude Code / OpenClaw, with 31 downloads so far.

How do I install 微信频道语音+视频实现气泡自动播报模式?

Run "/install voice-document-maker" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 微信频道语音+视频实现气泡自动播报模式 free?

Yes, 微信频道语音+视频实现气泡自动播报模式 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 微信频道语音+视频实现气泡自动播报模式 support?

微信频道语音+视频实现气泡自动播报模式 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 微信频道语音+视频实现气泡自动播报模式?

It is built and maintained by interccy-stack (@interccy-stack); the current version is v1.0.0.

💬 Comments