← Back to Skills Marketplace

Speech Recognition Local

Name: Speech Recognition Local
Author: zktufo

by lllleo · GitHub ↗ · v1.0.3 · MIT-0

linuxdarwinwin32 ⚠ suspicious

273

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install speech-recognition-local

Description

本地语音转文字 / Local Speech-to-Text. 使用 faster-whisper 在本地运行 Whisper 模型，无需 API 费用，完全免费。收到语音消息(.ogg .m4a .mp3)自动触发转录，支持中文/英文/日语/自动检测。| Free local STT/TTS alternati...

Usage Guidance

This skill is generally coherent with its stated purpose (local STT using faster-whisper) but check a few things before installing: 1) The script imports faster_whisper but does not install it for you — install faster-whisper (and its dependencies) in your environment first. 2) The first run will typically download model weights from the internet (Hugging Face or the library's source), so it is not strictly 'fully offline' until you pre-download and cache the model; expect bandwidth and storage usage. 3) Review available disk space and the 25MB per-file limit. 4) If privacy is critical, pre-download model files manually in a controlled environment so the runtime doesn't fetch them from external servers. 5) If you want automated dependency installation, consider adding a vetted install step (pip/conda) or run in an isolated virtualenv. These mismatches look like sloppy documentation rather than malicious intent, but verify dependencies and initial model download behavior before trusting with sensitive audio.

Capability Analysis

Type: OpenClaw Skill Name: speech-recognition-local Version: 1.0.3 The skill provides local speech-to-text functionality using the legitimate faster-whisper library. The Python script (scripts/transcribe.py) implements standard file validation and transcription logic without any indicators of data exfiltration, malicious execution, or prompt injection. All operations are performed locally as described in SKILL.md.

Capability Assessment

ℹ Purpose & Capability

Name/description match the code: a local transcription script using faster-whisper. However the description's 'fully offline' claim conflicts with the documented '首次使用自动下载' (first-use auto-download) behavior — initial model download requires network access.

✓ Instruction Scope

SKILL.md instructs the agent to run a single local script with an audio file and optional language. The script only reads the provided audio file and model artifacts; it does not reference unrelated system files, environment variables, or external endpoints in its code. No data exfiltration is present.

ℹ Install Mechanism

There is no install spec (low installation risk). SKILL.md says 'faster-whisper（首次使用自动安装）', but the script contains no auto-install logic (it simply imports faster_whisper). That mismatch means the user/agent must ensure faster-whisper and model artifacts are present; otherwise the script will fail and the model download will be triggered by the library at runtime.

✓ Credentials

No environment variables, credentials, or config paths are requested. The skill does not ask for unrelated secrets or elevated access.

✓ Persistence & Privilege

The skill does not request permanent presence (always=false) and does not modify other skills or system-wide agent settings. It caches the model in memory during a run, which is expected behavior.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install speech-recognition-local
After installation, invoke the skill by name or use /speech-recognition-local
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.3

Fix English recognition: default to auto language detection, increase beam_size to 5 for better accuracy

v1.0.2

Optimized bilingual SEO description, added feature highlights and use cases

v1.0.1

1.0.1: 修复目录结构 + 模型缓存 + 错误处理 + 多语言支持

Metadata

Slug speech-recognition-local

Version 1.0.3

License MIT-0

All-time Installs 2

Active Installs 2

Total Versions 3

Frequently Asked Questions

What is Speech Recognition Local?

本地语音转文字 / Local Speech-to-Text. 使用 faster-whisper 在本地运行 Whisper 模型，无需 API 费用，完全免费。收到语音消息(.ogg .m4a .mp3)自动触发转录，支持中文/英文/日语/自动检测。| Free local STT/TTS alternati... It is an AI Agent Skill for Claude Code / OpenClaw, with 273 downloads so far.

How do I install Speech Recognition Local?

Run "/install speech-recognition-local" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Speech Recognition Local free?

Yes, Speech Recognition Local is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Speech Recognition Local support?

Speech Recognition Local is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin, win32).

Who created Speech Recognition Local?

It is built and maintained by lllleo (@zktufo); the current version is v1.0.3.

More Skills