← Back to Skills Marketplace
stardusten

TencentCloud ASR

by Stardusten · GitHub ↗ · v0.1.5 · MIT-0
cross-platform ⚠ suspicious
1466
Downloads
5
Stars
17
Active Installs
5
Versions
Install in OpenClaw
/install tencentcloud-asr
Description
腾讯云语音识别 ASR Skill,适用于语音转文字、音频转写、字幕生成、会议转录、语音消息识别、 本地文件或 URL 音频识别。包含三种模式:一句话识别(<=60s 短音频)、录音识别极速版 (<=2h/100MB 中长音频快速同步返回)、录音识别(<=5h 长音频异步识别)。支持普通话、 英语、粤语、日语、韩...
Usage Guidance
Plain-language considerations before installing or using this skill: - The skill actually needs your Tencent Cloud credentials (TENCENTCLOUD_SECRET_ID and TENCENTCLOUD_SECRET_KEY, and sometimes TENCENTCLOUD_APPID) even though the registry metadata says none — do not paste secrets into group chat; prefer configuring them locally or providing them only in a secure, private channel. - At runtime the skill may auto-install software: it can call system package managers (apt/dnf/yum/zypper/brew/winget/choco, possibly with sudo) to install ffmpeg/ffprobe and will pip-install the tencentcloud SDK. That means it can modify the host system and requires network access. Only run it on machines where you accept those changes (e.g., disposable VM, container, or developer machine), or inspect and run the scripts manually yourself. - The code references and may read host file paths (e.g., QQ Bot downloads directory) for integrations — if you don't want that, don't enable the QQ Bot integration or run the skill in an isolated environment. - The SKILL.md includes helpful security guidance (prefer temporary env injection, don't write keys to shell profiles), but some reference docs also show how to persist keys. Decide your preferred credential handling policy and enforce it (temporary env vs. persistent profile). - If you are not comfortable with automated package installs, review the scripts (ensure_ffmpeg.py, file_recognize.py, etc.) and run them manually under supervision, or set up ffmpeg and the Python SDK yourself before invoking the skill. What would increase my confidence: updated registry metadata that correctly lists required environment variables/primary credential and an explicit install manifest or an option to disable autonomous installation. If those aren’t provided, treat this skill as requiring elevated trust and run it only in an isolated environment.
Capability Analysis
Type: OpenClaw Skill Name: tencentcloud-asr Version: 0.1.5 The skill bundle contains scripts designed for autonomous system-level configuration and dependency management, which are high-risk capabilities. Specifically, `ensure_ffmpeg.py` attempts to install FFmpeg using various system package managers (e.g., apt-get, yum, brew) and may use sudo or add external repositories like RPM Fusion. Additionally, several scripts including `file_recognize.py` and `flash_recognize.py` perform autonomous pip installations of dependencies. While these actions are aligned with the stated purpose of providing ASR functionality, the instructions in `SKILL.md` direct the AI agent to execute these high-privilege setup scripts autonomously, which could lead to unauthorized system modifications or be exploited if the agent's environment is not properly sandboxed.
Capability Assessment
Purpose & Capability
The skill name, docs, and scripts all implement Tencent Cloud ASR functionality (sentence/flash/file modes) — that is coherent. However, the registry metadata declares no required environment variables or primary credential while the scripts and references explicitly require TENCENTCLOUD_SECRET_ID/TENCENTCLOUD_SECRET_KEY (and sometimes TENCENTCLOUD_APPID). This metadata omission is an incoherence that could mislead users about secrets the skill needs.
Instruction Scope
SKILL.md instructs the agent to run local scripts (inspect_audio.py, ensure_ffmpeg.py, self_check.py, and the various recognizers). The docs also include integration guidance that accesses system paths (e.g., /home/admin/.openclaw/qqbot/downloads for QQ Bot integration) and recommend accepting/processing user-provided credentials. The skill explicitly permits autonomous installation of system components (ffmpeg/ffprobe) and running pip installs; these behaviors go beyond purely analyzing an uploaded audio file and require careful user consent.
Install Mechanism
There is no static install spec in registry metadata (instruction-only), but scripts perform dynamic installs at runtime: ensure_ffmpeg.py drives system package managers (apt/dnf/yum/zypper/brew/winget/choco) and may run sudo, and file_recognize.py auto-installs the tencentcloud SDK via pip. ensure_ffmpeg.py also contains logic to fetch rpmfusion rpms via a mirror URL as a repo fallback. These dynamic install actions modify the host and involve network downloads — expected for full ASR functionality but higher-risk than pure instruction-only skills and not reflected in metadata.
Credentials
The skill requires Tencent Cloud credentials (SecretId/SecretKey) and optionally AppId, documented across references and enforced by scripts (require_credentials, get_credentials). The registry metadata reported 'Required env vars: none' and 'Primary credential: none', which is contradictory. The skill also references TENCENTCLOUD_TOKEN optionally. Asking for these secrets is proportional to the service, but the metadata omission and the skill's guidance around receiving credentials via chat (even while warning about risks) are notable issues for users with limited security awareness.
Persistence & Privilege
The skill is not marked always:true, and does not request persistent platform privileges. However, runtime behavior includes: attempting to install system packages (possibly with sudo), adding repos (rpmfusion fallback), and installing pip packages — all of which can alter the host system. The skill also contains guidance for integrating with host systems (OpenClaw/QQ Bot) that accesses host paths. That level of side-effecting system access is significant and should be explicitly consented to by the user/environment operator.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install tencentcloud-asr
  3. After installation, invoke the skill by name or use /tencentcloud-asr
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.5
minor
v0.1.4
Version 0.1.4 - Added self-diagnostic script (`scripts/self_check.py`) to validate Tencent Cloud credentials and ASR capability before real recognition. - Introduced new `sentence_recognize.py` script, replacing the previous `main.py` for short audio recognition. - Added comprehensive, step-by-step activation and configuration guides in references (e.g., `tencent_cloud_activation.md`, `env_config.md`). - Included sample audio (`assets/16k.wav`) and visual guides for easier onboarding. - Updated documentation for clearer workflow, routing rules, engine options, and common troubleshooting.
v0.1.2
python 3.6 support
v0.1.1
add cli transcription support
v0.1.0
Initial release
Metadata
Slug tencentcloud-asr
Version 0.1.5
License MIT-0
All-time Installs 17
Active Installs 17
Total Versions 5
Frequently Asked Questions

What is TencentCloud ASR?

腾讯云语音识别 ASR Skill,适用于语音转文字、音频转写、字幕生成、会议转录、语音消息识别、 本地文件或 URL 音频识别。包含三种模式:一句话识别(<=60s 短音频)、录音识别极速版 (<=2h/100MB 中长音频快速同步返回)、录音识别(<=5h 长音频异步识别)。支持普通话、 英语、粤语、日语、韩... It is an AI Agent Skill for Claude Code / OpenClaw, with 1466 downloads so far.

How do I install TencentCloud ASR?

Run "/install tencentcloud-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is TencentCloud ASR free?

Yes, TencentCloud ASR is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does TencentCloud ASR support?

TencentCloud ASR is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created TencentCloud ASR?

It is built and maintained by Stardusten (@stardusten); the current version is v0.1.5.

💬 Comments