← 返回 Skills 市场

Pronunciation Coach

Name: Pronunciation Coach
Author: crazybuffon

作者 Crazybuffon · GitHub ↗ · v1.0.4

cross-platform ✓ 安全检测通过

622

总下载

当前安装

版本数

在 OpenClaw 中安装

/install pronunciation-coach

功能描述

Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonatio...

安全使用建议

This skill appears to do what it claims: it will read audio files from ~/.openclaw/media/inbound/, convert them (ffmpeg), and upload them to Microsoft Azure Speech for pronunciation assessment. Before installing: 1) Confirm you are comfortable sending users' audio to Microsoft (privacy and billing/usage matters). 2) Provide an Azure Speech key and region via AZURE_SPEECH_KEY and AZURE_SPEECH_REGION. 3) Ensure ffmpeg and Node.js are available in the agent environment. 4) Note the SKILL.md suggests sending results back to users (text and TTS) but the skill does not implement Telegram messaging or TTS — you will need the agent or other skills to have those permissions. 5) Fix the registry metadata mismatch (it should declare required env vars) and verify the skill's source/homepage if provenance matters. If you need stronger assurance, review the scripts locally or run them in a sandboxed environment before granting access to real user audio or credentials.

功能分析

Type: OpenClaw Skill Name: pronunciation-coach Version: 1.0.4 The skill is benign, transparently declaring its purpose to analyze audio using Azure Speech Services. It explicitly requests read access to `~/.openclaw/media/inbound/` and network access to `*.stt.speech.microsoft.com` in `skill.json`. The `pronunciation-assess.sh` script demonstrates good security practices by implementing checks against filename option injection (`case "$AUDIO_FILE" in -*)`, `ffmpeg -i -- "$AUDIO_FILE"`) and sanitizing the `REFERENCE_TEXT` to prevent JSON injection before sending it to Azure. The `SKILL.md` instructions guide the agent to perform its stated function without any evidence of prompt injection attempts to subvert its behavior or access unauthorized data.

能力评估

ℹ Purpose & Capability

The name, description, SKILL.md, scripts, and skill.json consistently describe using Azure Speech for pronunciation assessment and reading voice messages from ~/.openclaw/media/inbound/. This matches the capability. However, the top-level registry summary included with the evaluation stated 'Required env vars: none' while SKILL.md and skill.json clearly declare AZURE_SPEECH_KEY and AZURE_SPEECH_REGION as required — a metadata inconsistency that should be corrected.

ℹ Instruction Scope

The runtime instructions are narrowly scoped: locate latest .ogg files in ~/.openclaw/media/inbound/, convert to WAV via ffmpeg, call Azure Speech, and produce a human-readable report. These actions are consistent with the stated purpose. Notes: the SKILL.md instructs the agent to 'send a voice message (via TTS) demonstrating the correct pronunciation' and to 'send the text report to the user' but provides no code to perform Telegram messaging or TTS; those actions require the agent to have separate messaging/TTS capabilities or permissions not included in the skill files.

✓ Install Mechanism

No install spec is provided (instruction- and script-only). This is low-risk from an installation perspective, but scripts will be executed directly by the agent environment and depend on ffmpeg and Node.js being present on PATH.

ℹ Credentials

Only Azure Speech credentials (AZURE_SPEECH_KEY, AZURE_SPEECH_REGION) are required by the scripts and skill.json; this is proportionate to the declared function. The earlier registry metadata that listed no required env vars is inconsistent with the skill's own manifest and SKILL.md. No other unrelated secrets are requested.

✓ Persistence & Privilege

The skill does not request always:true and does not modify other skills or system settings. skill.json declares read permission for ~/.openclaw/media/inbound/ and outbound network access to *.stt.speech.microsoft.com, which are consistent with its behavior. Autonomous invocation is permitted (platform default) but not combined with other high-risk factors here.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install pronunciation-coach
安装完成后，直接呼叫该 Skill 的名称或使用 /pronunciation-coach 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.4

- Removed support for passing Azure Speech API key and region directly via command line; now requires environment variables. - Updated SKILL.md to reflect simplified usage and prerequisite instructions. - scripts/pronunciation-assess.sh no longer accepts key/region as optional arguments.

v1.0.3

- Minor internal changes for maintainability; no user-facing feature changes. - Documentation content remains unchanged.

v1.0.2

- Updated skill.json with no changes to functionality or documentation. - No user-facing or workflow changes in this release.

v1.0.1

- Added privacy note specifying that voice messages are transmitted to Azure for analysis. - Declared environment variables in the skill metadata for easier configuration. - Streamlined skill description to focus on core functionality. - No changes to workflows or usage instructions.

v1.0.0

Initial release of Pronunciation Coach – provides actionable English pronunciation feedback using Azure Speech Services. - Analyzes user voice messages for pronunciation, fluency, prosody, and intonation. - Offers detailed reports with overall and word-level scores, highlighting problem sounds. - Supplies specific coaching tips and improvement feedback based on assessment results. - Generates practice exercises and demonstrations for targeted accent and pronunciation improvement.

元数据

Slug pronunciation-coach

版本 1.0.4

许可证 —

累计安装 1

当前安装数 1

历史版本数 5

常见问题

Pronunciation Coach 是什么？

Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonatio... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 622 次。

如何安装 Pronunciation Coach？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install pronunciation-coach」即可一键安装，无需额外配置。

Pronunciation Coach 是免费的吗？

是的，Pronunciation Coach 完全免费（开源免费），可自由下载、安装和使用。

Pronunciation Coach 支持哪些平台？

Pronunciation Coach 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Pronunciation Coach？

由 Crazybuffon（@crazybuffon）开发并维护，当前版本 v1.0.4。