← Back to Skills Marketplace
guoqunabc

Speech Notes

by Madoka · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
274
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install speech-notes
Description
将录音/语音转写为结构化演讲纪要。适用于:会议讲话、内部分享、演讲录音的转写整理。 触发条件:用户发送音频文件并要求整理/转写/纪要,或要求将已有转写文本整理成结构化纪要。
Usage Guidance
This skill's functionality (audio → structured notes) is reasonable, but the runtime instructions assume tools and API access that aren't declared or included. Before installing or enabling it, ask the provider: (1) where do scripts like scripts/speech-to-text.sh come from, (2) what credentials are required and how will you supply them (Feishu file_key, Feishu API token, Google API key, Qwen token), and (3) whether ffmpeg/ffprobe are expected on your host. Only provide API keys with minimal scopes and consider testing with non-sensitive audio first. If you can't obtain answers or the skill asks you to paste credentials into an opaque place, treat it as high risk and do not proceed.
Capability Analysis
Type: OpenClaw Skill Name: speech-notes Version: 1.0.0 The 'speech-notes' skill is a legitimate tool designed to transcribe audio recordings and format them into professional, structured meeting notes, specifically optimized for Feishu (Lark) documents. It utilizes standard audio processing utilities (ffmpeg, ffprobe) and reputable AI APIs (Gemini, Qwen, Feishu) to perform its stated tasks. The instructions in SKILL.md are detailed and focused on high-quality document formatting, stylistic consistency, and user interaction, with no evidence of malicious intent, data exfiltration, or unauthorized system access.
Capability Assessment
Purpose & Capability
The name/description (transcribe + organize speech) aligns with the instructions. However the SKILL.md expects use of local binaries (ffmpeg/ffprobe), a local script (scripts/speech-to-text.sh for Feishu STT), and multiple external STT/LLM endpoints (Google Generative Language, Qwen) — none of these are declared in the skill metadata (no required binaries, no env vars, no scripts). That discrepancy is disproportionate and unexplained.
Instruction Scope
The instructions explicitly tell the agent to run ffprobe/ffmpeg (file splitting/compression), call a local script for Feishu STT, craft Python scripts to send base64 audio to generativelanguage.googleapis.com, and use Qwen endpoints and Feishu document APIs. They also instruct saving original transcriptions locally and updating Feishu documents via API. Those actions involve local file I/O and network calls to multiple external services; the skill did not limit or declare these resources or credentials and did not include the referenced scripts, so the runtime scope is broader than the metadata indicates.
Install Mechanism
There is no install spec and no code files — instruction-only. That minimizes supply-chain risk. However the SKILL.md instructs running external tools and scripts that are not provided; the absence of a provisioned install still leaves the agent expected to have certain binaries and scripts available on the host.
Credentials
The documentation prescribes calling Feishu STT (file_key), Google Generative Language, and Qwen services and also using Feishu document APIs, which normally require credentials/tokens. Yet the skill metadata lists no required environment variables or primary credential. Requiring multiple external API keys and a local script without declaring them is disproportionate and ambiguous — the user would need to supply keys/credentials out-of-band for the skill to work.
Persistence & Privilege
always is false and the skill is user-invocable; it does not request permanent presence or elevated platform privileges. The instructions do ask the agent to write local files (save original transcriptions) and to update remote Feishu docs, which are normal for this function but should be acknowledged by the user before installation.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install speech-notes
  3. After installation, invoke the skill by name or use /speech-notes
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of speech-notes skill. - Converts audio recordings or existing transcripts into structured meeting minutes. - Supports audio preprocessing, segmentation, and transcription using Feishu STT, Gemini, or Qwen. - Delivers high-quality, well-formatted minutes following detailed structure and formatting guidelines. - Ensures retention of original speaker's style and removes unnecessary filler for clarity. - Outputs documents ready for use in Feishu, following best practices for readability and formal presentation.
Metadata
Slug speech-notes
Version 1.0.0
License
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Speech Notes?

将录音/语音转写为结构化演讲纪要。适用于:会议讲话、内部分享、演讲录音的转写整理。 触发条件:用户发送音频文件并要求整理/转写/纪要,或要求将已有转写文本整理成结构化纪要。 It is an AI Agent Skill for Claude Code / OpenClaw, with 274 downloads so far.

How do I install Speech Notes?

Run "/install speech-notes" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Speech Notes free?

Yes, Speech Notes is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Speech Notes support?

Speech Notes is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Speech Notes?

It is built and maintained by Madoka (@guoqunabc); the current version is v1.0.0.

💬 Comments