← Back to Skills Marketplace
franklu0819-lang

Coze Asr

by xiaofei · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
216
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install coze-asr
Description
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec...
README (SKILL.md)

Coze Automatic Speech Recognition (ASR)

Transcribe audio files to text using Coze API.

Setup

1. Get your API Key: Get a key from Coze Platform

2. Set it in your environment:

export COZE_API_KEY="your-key-here"

Supported Audio Formats

  • MP3 - Recommended
  • WAV - Supported
  • OGG - Supported (包括 opus 编码)

Note: Coze API 原生支持 mp3、wav、ogg 格式,无需转换。

Usage

Basic Transcription

Transcribe an audio file:

bash scripts/speech_to_text.sh recording.mp3

Full Options

bash scripts/speech_to_text.sh \x3Caudio_file> [language]

Parameters:

  • audio_file (required): Path to audio file
  • language (optional): Language code (default: zh)

Output Format

The script outputs JSON with transcribed text.

Example output:

{
  "text": "你好,这是转录的文本内容"
}

Troubleshooting

File Size Issues:

  • Check Coze API documentation for file size limits
  • Reduce sample rate or bit depth if needed

Poor Accuracy:

  • Improve audio quality
  • Ensure clear speech and minimal noise
  • Use appropriate language code

Format Issues:

  • Ensure file is not corrupted
  • Verify audio can be played by standard players
Usage Guidance
This skill appears to do what it says: it uploads a user-supplied audio file to Coze (https://api.coze.cn) and returns JSON transcription. Before installing or using it: (1) Be aware that your full audio file and any sensitive speech it contains will be sent to an external service — review Coze's privacy/security policy and ensure this is acceptable. (2) Provide a COZE_API_KEY with appropriate scope and rotate it if compromised. (3) The script uses curl and jq; ensure curl is available (manifest currently lists only jq). (4) Run the script in a controlled environment for testing (sandbox) and verify network egress is acceptable for your data. (5) If you need local/offline transcription or stronger data controls, consider alternatives that keep audio on-device or in a trusted environment.
Capability Analysis
Type: OpenClaw Skill Name: coze-asr Version: 1.0.0 The coze-asr skill is a legitimate tool for transcribing audio files using the Coze API. The script `scripts/speech_to_text.sh` correctly implements the transcription request by sending the audio file to the official Coze endpoint (api.coze.cn) using the provided API key. No malicious behaviors, such as unauthorized data exfiltration or hidden execution, were detected.
Capability Assessment
Purpose & Capability
Name, description, SKILL.md, and the provided script all consistently implement speech-to-text via Coze API. The declared requirement (jq) and required env var (COZE_API_KEY) are appropriate for this purpose.
Instruction Scope
Instructions and the script stick to the ASR task: validating the audio file, reading COZE_API_KEY, and POSTing the file to https://api.coze.cn/v1/audio/transcriptions. Note: the script invokes curl but the manifest only lists jq as a required binary — curl should be declared. Also be aware the script uploads entire audio content to an external service (Coze), which has privacy implications.
Install Mechanism
No install spec (instruction-only + a local script) — low installer risk. Nothing is downloaded from arbitrary URLs and no archives are extracted. The script will execute network calls at runtime (curl) but does not install additional software.
Credentials
Only COZE_API_KEY is required, which is proportional to calling an authenticated ASR API. No unrelated credentials, config paths, or excessive environment access are requested.
Persistence & Privilege
The skill does not request permanent presence or elevated platform privileges (always is false). It does not modify other skills or system-wide configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install coze-asr
  3. After installation, invoke the skill by name or use /coze-asr
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
首次发布:Coze 语音识别技能,支持 ogg/mp3/wav 格式
Metadata
Slug coze-asr
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Coze Asr?

Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec... It is an AI Agent Skill for Claude Code / OpenClaw, with 216 downloads so far.

How do I install Coze Asr?

Run "/install coze-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Coze Asr free?

Yes, Coze Asr is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Coze Asr support?

Coze Asr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Coze Asr?

It is built and maintained by xiaofei (@franklu0819-lang); the current version is v1.0.0.

💬 Comments