voice-text-to-meme

Name: voice-text-to-meme
Author: hei-maom

功能描述

根据输入法语音识别文本或润色后文本生成单张表情包图片。适用于用户想把一句话做成聊天可发送的表情包、meme 图、带字梗图或贴纸图时。支持原始语音文本和润色后文本两种输入，默认优先使用润色后文本；自动根据文本语气判断风格；默认直接生成带字图片，也支持生成无字图并同时给出配文模板；使用 doubao-seedream...

安全使用建议

What to check before installing: - This skill sends user text/prompts and an API key to an external image model (default base URL https://models.audiozen.cn/v1). Verify that provider and endpoint are trustworthy and acceptable for your data/privacy needs. - The script requires MEME_MODEL_API_KEY (a secret). The skill registry metadata does not declare this required env var — do not paste a high-privilege or reused API key; create a dedicated, scoped token if possible. - The package includes requirements.txt (openai, requests) but no install spec; ensure the runtime will install these dependencies or run the script in a safe environment. - The script prints a JSON payload (including the prompt and base_url) to stdout; logs may therefore contain user text and model details—avoid logging sensitive text or run in an environment with appropriate log controls. - The skill writes image files to meme_outputs or temp directories; check file permissions and cleanup policies. - If you need stronger assurance, ask the maintainer to: (1) declare MEME_MODEL_API_KEY in the registry metadata as a required credential, (2) provide an explicit install spec (or container) to install dependencies, and (3) document the model provider and data handling / retention policy for the default base URL.

功能分析

Type: OpenClaw Skill Name: voice-text-to-meme Version: 1.0.0 The skill is a legitimate tool designed to convert voice-recognized or polished text into meme images using the Doubao image model. The core logic in `scripts/generate_meme.py` uses the OpenAI Python SDK to interact with a specific backend (models.audiozen.cn) and includes standard functions for text cleaning, tone inference, and image retrieval. There is no evidence of data exfiltration, command injection, or malicious prompt manipulation; all file and network operations are strictly aligned with the stated purpose of generating and saving meme images.

能力评估

✓ Purpose & Capability

Name/description match the code: the script takes voice/text, builds a prompt and calls an image-generation model (default doubao-seedream via an OpenAI-compatible client). The external model API key and base URL are logically required for this purpose. However, the registry metadata claims no required env vars while the SKILL.md and script require MEME_MODEL_API_KEY — an inconsistency in declared requirements.

ℹ Instruction Scope

SKILL.md instructs the agent to call scripts/generate_meme.py and to read MEME_MODEL_API_KEY and optional MEME_MODEL_BASE_URL; those actions are within the stated purpose (sending prompts to an image model). The script prints a JSON payload to stdout that includes the prompt, model name and base_url and the (possibly sensitive) source_text — that could surface user text and the model endpoint in logs/stdout. The skill writes output images to disk (meme_outputs or a temp dir) which is expected.

ℹ Install Mechanism

There is no install spec, but a requirements.txt and a runnable Python script are included. That means dependencies (openai, requests) must be installed by the host; the skill package does not declare how to install them. No remote downloads of arbitrary archives or suspicious installers are present.

⚠ Credentials

The skill requires a sensitive credential (MEME_MODEL_API_KEY) to call an external image model; that is proportionate to its functionality. However, the registry metadata does not declare any required env vars or a primary credential — the required API key is only documented in SKILL.md and used by the code. Users should treat MEME_MODEL_API_KEY as a secret and avoid reusing high-privilege tokens. MEME_MODEL_BASE_URL defaults to a third-party host (https://models.audiozen.cn/v1), so confirm the provider before sending user content.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills or global agent settings, and only writes its own output image files. Agent autonomous invocation is allowed by default but is not in itself a new risk here.

版本历史

v1.0.0

- Initial release of the "voice-text-to-meme" skill. - Converts speech-to-text input into a chat-friendly meme image. - Supports both original and polished text input, prioritizing polished text. - Auto-selects meme style based on input tone and intent. - By default, generates a single meme image with embedded caption; also supports generating caption-free images with separate caption templates. - Utilizes the doubao-seedream-4-5-251128 model for image generation.

元数据

Slug voice-text-to-meme

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

voice-text-to-meme 是什么？

根据输入法语音识别文本或润色后文本生成单张表情包图片。适用于用户想把一句话做成聊天可发送的表情包、meme 图、带字梗图或贴纸图时。支持原始语音文本和润色后文本两种输入，默认优先使用润色后文本；自动根据文本语气判断风格；默认直接生成带字图片，也支持生成无字图并同时给出配文模板；使用 doubao-seedream... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 260 次。

如何安装 voice-text-to-meme？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-text-to-meme」即可一键安装，无需额外配置。

voice-text-to-meme 是免费的吗？

是的，voice-text-to-meme 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

voice-text-to-meme 支持哪些平台？

voice-text-to-meme 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 voice-text-to-meme？

由 HeiMaoM（@hei-maom）开发并维护，当前版本 v1.0.0。

voice-text-to-meme 是什么？

如何安装 voice-text-to-meme？

voice-text-to-meme 是免费的吗？

voice-text-to-meme 支持哪些平台？

谁开发了 voice-text-to-meme？

💬 留言讨论