← Back to Skills Marketplace
impkind

Local Whisper

by ImpKind · GitHub ↗ · v1.5.0
cross-platform ✓ Security Clean
3019
Downloads
9
Stars
1
Active Installs
5
Versions
Install in OpenClaw
/install whisper-mlx-local
Description
Free local speech-to-text for Telegram and WhatsApp using MLX Whisper on Apple Silicon. Private, no API costs.
Usage Guidance
This skill appears to be what it claims: a local, Apple Silicon-optimized Whisper daemon and CLI. Before installing, consider the following: (1) The package will run a local HTTP daemon on 127.0.0.1:8787 — only allow trusted local callers (OpenClaw is intended), and be cautious about other local apps talking to that port. (2) The daemon accepts a JSON field containing a filesystem path and will open that path for transcription; ensure only trusted processes can instruct the daemon to avoid accidental processing of unintended files. (3) requirements.txt includes OpenAI and Groq client libraries; if you have OPENAI_API_KEY or GROQ_API_KEY set in your environment (or in a loaded .env), the code can use cloud backends — remove those env vars or avoid installing those packages if you require strict local-only operation. (4) Install via pip installs third-party packages from PyPI — review requirements.txt and consider using a virtualenv or isolated account. (5) If you want automatic startup, inspect the LaunchAgent plist before copying it into ~/Library/LaunchAgents. Overall this skill is coherent with its stated purpose, but take the above precautions if your threat model requires strict local-only privacy.
Capability Analysis
Type: OpenClaw Skill Name: whisper-mlx-local Version: 1.5.0 The skill provides local speech-to-text transcription using MLX Whisper, with optional cloud API fallbacks. All observed behaviors, including the use of `launchctl` for daemon persistence (documented in SKILL.md and README.md), local network communication via `curl` to `localhost:8787` (scripts/transcribe.sh), and optional external API calls to OpenAI/Groq (scripts/transcriber.py) for transcription, are clearly aligned with its stated purpose. There is no evidence of data exfiltration to unauthorized endpoints, malicious execution, obfuscation, or prompt injection attempts against the agent.
Capability Assessment
Purpose & Capability
The name/description (local Whisper on Apple Silicon) match the code: the project provides an MLX-backed daemon and CLI to transcribe audio locally. However, requirements.txt and transcriber.py also include optional cloud backends (OpenAI, Groq) and their client libraries; those are optional (used only if the packages are installed and API keys are present) but are not declared in SKILL.md as optional cloud-capable behavior. This is plausible design (local-first with optional fallbacks) but worth noting because the README emphasizes 'private, no API costs'.
Instruction Scope
SKILL.md's runtime instructions are focused on installing deps, running a local daemon, and wiring OpenClaw to call the provided CLI. The daemon accepts either raw audio uploads or JSON containing a local file path and will read that path off disk and transcribe it. Accepting file-system paths is required for the intended integration, but it also means the daemon can be directed (by local callers) to open arbitrary files on disk — a potential surprise if untrusted local processes can talk to the daemon. The daemon binds to 127.0.0.1, reducing remote exposure.
Install Mechanism
Install is manual via 'pip3 install -r requirements.txt' (PyPI). No downloads from untrusted URLs or archives. requirements.txt contains platform-conditional MLX package and optional cloud client libraries; pip installation will pull those packages from PyPI.
Credentials
The skill declares no required environment variables, which is accurate for basic local operation. The code uses dotenv and will consume environment variables if present (e.g., OPENAI_API_KEY, GROQ_API_KEY, CLAWD_WHISPER_* for port/backend/model, CLAWD_WHISPER_URL in the CLI). If API keys are present in the environment, the transcriber may use cloud backends, which conflicts with the 'private, no API costs' messaging unless you ensure no cloud keys are set. This is optional behavior but should be explicitly considered by users who expect strict local-only operation.
Persistence & Privilege
The skill is not always-enabled and does not autonomously modify other skills. It provides optional instructions to install a LaunchAgent plist for auto-start, which is normal for a user-installed local daemon. The daemon runs as a user process bound to 127.0.0.1; it does not request elevated privileges or modify other skill configurations automatically.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install whisper-mlx-local
  3. After installation, invoke the skill by name or use /whisper-mlx-local
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.5.0
Added OpenClaw configuration instructions
v1.2.0
Clearer messaging: free voice message transcription
v1.1.0
Rebrand: Clear positioning as free OpenAI Whisper replacement
v1.0.1
Clean release: removed attribution references
v1.0.0
Initial release: Free local speech-to-text for Apple Silicon
Metadata
Slug whisper-mlx-local
Version 1.5.0
License
All-time Installs 2
Active Installs 1
Total Versions 5
Frequently Asked Questions

What is Local Whisper?

Free local speech-to-text for Telegram and WhatsApp using MLX Whisper on Apple Silicon. Private, no API costs. It is an AI Agent Skill for Claude Code / OpenClaw, with 3019 downloads so far.

How do I install Local Whisper?

Run "/install whisper-mlx-local" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Local Whisper free?

Yes, Local Whisper is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Local Whisper support?

Local Whisper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Local Whisper?

It is built and maintained by ImpKind (@impkind); the current version is v1.5.0.

💬 Comments