toncall-videourl-text

Name: toncall-videourl-text
Author: juysoft

Description

唐潮Toncall工作室开发 - 视频URL语音转文字。支持任意可直接访问的视频URL，自动下载视频、提取音频、上传火山引擎TOS、调用火山引擎语音识别API识别文字，完成后自动清理本地和TOS的临时文件。使用：当用户发送视频URL并要求提取视频文案、语音转文字、转文字时直接执行；当用户只发送视频URL没有说明意...

Usage Guidance

This skill appears to do what it says: it downloads a provided video URL, extracts audio, uploads audio to your Volcengine TOS bucket, and calls Volcengine ASR to produce text. Before installing or running it: (1) Only use with videos you own or have permission to process — the audio is uploaded to third-party services. (2) Store credentials securely and consider using least-privilege/ephemeral keys for the TOS bucket (upload/delete permissions only for a dedicated bucket/folder). (3) Inspect the full script yourself — the provided file in the package is truncated in the listing and includes minor bugs (e.g., uses datetime.UTC which will fail in standard Python); fix or audit before production use. (4) Run the script in an isolated environment (container) and test with non-sensitive media first. (5) If you want keys kept out of disk, adapt the script to read them from a secure environment or secret store. If you want, I can (a) show the rest of the script if you paste it, (b) list the exact network endpoints and fields sent, or (c) suggest code changes to reduce privileges and improve safety.

Capability Analysis

Type: OpenClaw Skill Name: toncall-videourl-text Version: 1.0.0 The skill provides video-to-text transcription by downloading videos, extracting audio via ffmpeg, and utilizing Volcengine (ByteDance) TOS and ASR APIs. The Python script `video_url_to_text.py` implements standard AWS V4 signature logic for cloud storage interactions and includes robust cleanup routines for both local and remote temporary files. No evidence of malicious intent, credential exfiltration, or prompt injection was found; the code strictly follows the functionality described in `SKILL.md`.

Capability Assessment

✓ Purpose & Capability

The name/description (download video, extract audio, upload to Volcengine TOS, call Volcengine ASR) matches the code and SKILL.md. Required secrets are requested in a config.ini (TOS ak/sk, bucket, and ASR app/access keys) which is coherent with the stated purpose.

✓ Instruction Scope

Runtime instructions are limited and specific: run py scripts/video_url_to_text.py <url>. The script reads only the local config.ini, downloads the provided video URL, extracts audio with ffmpeg, uploads that audio to the user's TOS bucket, calls Volcengine ASR endpoints, writes a text result, and attempts to delete local and remote temp files. There are no instructions to scan arbitrary local files or exfiltrate unrelated data, but note that audio is uploaded to external services (Volcengine endpoints) — this is expected but privacy-relevant.

✓ Install Mechanism

This is an instruction-only skill with included Python script; there is no install spec or remote binary download. The only runtime dependency is ffmpeg and Python requests (both checked by the script), so installation risk is low.

ℹ Credentials

The script asks for TOS (object storage) credentials and Volcengine ASR keys in config.ini — these are proportional to uploading audio and calling the ASR API. Credentials are stored in a file rather than environment variables; that is a legitimate design choice but means you should secure config.ini. The skill does not request unrelated credentials.

✓ Persistence & Privilege

The skill does not request always:true or system-wide privileges. It does not modify other skills. It will use the provided TOS credentials to upload and delete objects in the configured bucket (full bucket-level privileges implied), so supply least-privilege credentials if possible.

Version History

v1.0.0

Initial release of the 视频URL语音转文字 skill: - Automatically processes any accessible video URL to extract audio, upload to TOS, perform speech-to-text with Volcano Engine API, and return transcription. - Cleans up all local and cloud temporary files after processing, regardless of success or failure. - Interactive: If user intent is unclear, asks whether video transcription is needed. - Easy setup with detailed config instructions; checks dependencies on startup. - Standalone Python script provided for integration or manual use.

Metadata

Slug toncall-videourl-text

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is toncall-videourl-text?

唐潮Toncall工作室开发 - 视频URL语音转文字。支持任意可直接访问的视频URL，自动下载视频、提取音频、上传火山引擎TOS、调用火山引擎语音识别API识别文字，完成后自动清理本地和TOS的临时文件。使用：当用户发送视频URL并要求提取视频文案、语音转文字、转文字时直接执行；当用户只发送视频URL没有说明意... It is an AI Agent Skill for Claude Code / OpenClaw, with 183 downloads so far.

How do I install toncall-videourl-text?

Run "/install toncall-videourl-text" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is toncall-videourl-text free?

Yes, toncall-videourl-text is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does toncall-videourl-text support?

toncall-videourl-text is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created toncall-videourl-text?

It is built and maintained by toncall (@juysoft); the current version is v1.0.0.

More Skills

What is toncall-videourl-text?

How do I install toncall-videourl-text?

Is toncall-videourl-text free?

Which platforms does toncall-videourl-text support?

Who created toncall-videourl-text?

💬 Comments