voice-text-to-meme

Name: voice-text-to-meme
Author: hei-maom

Description

根据输入法语音识别文本或润色后文本生成单张表情包图片。适用于用户想把一句话做成聊天可发送的表情包、meme 图、带字梗图或贴纸图时。支持原始语音文本和润色后文本两种输入，默认优先使用润色后文本；自动根据文本语气判断风格；默认直接生成带字图片，也支持生成无字图并同时给出配文模板；使用 doubao-seedream...

Usage Guidance

What to check before installing: - This skill sends user text/prompts and an API key to an external image model (default base URL https://models.audiozen.cn/v1). Verify that provider and endpoint are trustworthy and acceptable for your data/privacy needs. - The script requires MEME_MODEL_API_KEY (a secret). The skill registry metadata does not declare this required env var — do not paste a high-privilege or reused API key; create a dedicated, scoped token if possible. - The package includes requirements.txt (openai, requests) but no install spec; ensure the runtime will install these dependencies or run the script in a safe environment. - The script prints a JSON payload (including the prompt and base_url) to stdout; logs may therefore contain user text and model details—avoid logging sensitive text or run in an environment with appropriate log controls. - The skill writes image files to meme_outputs or temp directories; check file permissions and cleanup policies. - If you need stronger assurance, ask the maintainer to: (1) declare MEME_MODEL_API_KEY in the registry metadata as a required credential, (2) provide an explicit install spec (or container) to install dependencies, and (3) document the model provider and data handling / retention policy for the default base URL.

Capability Analysis

Type: OpenClaw Skill Name: voice-text-to-meme Version: 1.0.0 The skill is a legitimate tool designed to convert voice-recognized or polished text into meme images using the Doubao image model. The core logic in `scripts/generate_meme.py` uses the OpenAI Python SDK to interact with a specific backend (models.audiozen.cn) and includes standard functions for text cleaning, tone inference, and image retrieval. There is no evidence of data exfiltration, command injection, or malicious prompt manipulation; all file and network operations are strictly aligned with the stated purpose of generating and saving meme images.

Capability Assessment

✓ Purpose & Capability

Name/description match the code: the script takes voice/text, builds a prompt and calls an image-generation model (default doubao-seedream via an OpenAI-compatible client). The external model API key and base URL are logically required for this purpose. However, the registry metadata claims no required env vars while the SKILL.md and script require MEME_MODEL_API_KEY — an inconsistency in declared requirements.

ℹ Instruction Scope

SKILL.md instructs the agent to call scripts/generate_meme.py and to read MEME_MODEL_API_KEY and optional MEME_MODEL_BASE_URL; those actions are within the stated purpose (sending prompts to an image model). The script prints a JSON payload to stdout that includes the prompt, model name and base_url and the (possibly sensitive) source_text — that could surface user text and the model endpoint in logs/stdout. The skill writes output images to disk (meme_outputs or a temp dir) which is expected.

ℹ Install Mechanism

There is no install spec, but a requirements.txt and a runnable Python script are included. That means dependencies (openai, requests) must be installed by the host; the skill package does not declare how to install them. No remote downloads of arbitrary archives or suspicious installers are present.

⚠ Credentials

The skill requires a sensitive credential (MEME_MODEL_API_KEY) to call an external image model; that is proportionate to its functionality. However, the registry metadata does not declare any required env vars or a primary credential — the required API key is only documented in SKILL.md and used by the code. Users should treat MEME_MODEL_API_KEY as a secret and avoid reusing high-privilege tokens. MEME_MODEL_BASE_URL defaults to a third-party host (https://models.audiozen.cn/v1), so confirm the provider before sending user content.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills or global agent settings, and only writes its own output image files. Agent autonomous invocation is allowed by default but is not in itself a new risk here.

Version History

v1.0.0

- Initial release of the "voice-text-to-meme" skill. - Converts speech-to-text input into a chat-friendly meme image. - Supports both original and polished text input, prioritizing polished text. - Auto-selects meme style based on input tone and intent. - By default, generates a single meme image with embedded caption; also supports generating caption-free images with separate caption templates. - Utilizes the doubao-seedream-4-5-251128 model for image generation.

Metadata

Slug voice-text-to-meme

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is voice-text-to-meme?

根据输入法语音识别文本或润色后文本生成单张表情包图片。适用于用户想把一句话做成聊天可发送的表情包、meme 图、带字梗图或贴纸图时。支持原始语音文本和润色后文本两种输入，默认优先使用润色后文本；自动根据文本语气判断风格；默认直接生成带字图片，也支持生成无字图并同时给出配文模板；使用 doubao-seedream... It is an AI Agent Skill for Claude Code / OpenClaw, with 260 downloads so far.

How do I install voice-text-to-meme?

Run "/install voice-text-to-meme" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is voice-text-to-meme free?

Yes, voice-text-to-meme is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does voice-text-to-meme support?

voice-text-to-meme is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created voice-text-to-meme?

It is built and maintained by HeiMaoM (@hei-maom); the current version is v1.0.0.

More Skills

What is voice-text-to-meme?

How do I install voice-text-to-meme?

Is voice-text-to-meme free?

Which platforms does voice-text-to-meme support?

Who created voice-text-to-meme?

💬 Comments