← Back to Skills Marketplace
midasheng-audio-text-distance
by
Junbo Zhang
· GitHub ↗
· v1.0.0
· MIT-0
167
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install midasheng-audio-text-distance
Description
Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against...
Usage Guidance
This skill appears to do what it says: it uploads audio files to a Xiaomi-hosted GLAP search API and returns similarity/classification results. Before installing or using it, consider: (1) Privacy — audio files are sent to https://llmplus.ai.xiaomi.com with no auth in examples, so do not upload sensitive or proprietary recordings unless you trust the service and its terms; (2) Network usage — the tool requires outbound network access; (3) Sanity check — test with non-sensitive samples first; (4) If you need an on-device alternative or encryption, prefer a local model or an API that supports authenticated, private uploads. The SKILL.md lists curl as a requirement while the script uses Python requests — install either curl (for examples) or ensure Python requests is available to run the included script.
Capability Analysis
Type: OpenClaw Skill
Name: midasheng-audio-text-distance
Version: 1.0.0
The skill provides a legitimate interface for multilingual audio-text retrieval and classification using a Xiaomi AI API (llmplus.ai.xiaomi.com). The Python script (audiosearch.py) and instructions (SKILL.md) are well-documented, lack any signs of data exfiltration or unauthorized execution, and include functional queue management logic to help the AI agent handle service latency.
Capability Assessment
Purpose & Capability
The skill's name and description claim audio-text retrieval via GLAP and all required artifacts (SKILL.md examples and scripts/audiosearch.py) perform exactly that against the Xiaomi llmplus.ai.xiaomi.com/dasheng/audio/search endpoint. There are no unrelated binaries, config paths, or credentials requested.
Instruction Scope
Runtime instructions and the included script only read user-supplied audio files and call the documented remote API (and a metrics endpoint for queue status). They do not read arbitrary system files or environment variables beyond what the user supplies. The SKILL.md and script consistently show network calls to the stated endpoint.
Install Mechanism
This is an instruction-only skill with no install spec and a single small Python script; nothing is downloaded or written to disk by an installer, which minimizes install-time risk.
Credentials
No environment variables or credentials are requested (proportionate). However, the skill uploads audio files to a third-party endpoint (llmplus.ai.xiaomi.com) without any authentication in the provided examples, so sensitive audio will be transmitted off-host; users should consider privacy and trust of that endpoint before use.
Persistence & Privilege
always is false, the skill does not request persistent system presence or modify other skills/config; it behaves as a normal, non-persistent, user-invoked utility.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install midasheng-audio-text-distance - After installation, invoke the skill by name or use
/midasheng-audio-text-distance - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of midasheng-audio-text-distance.
- Enables multilingual audio-text retrieval and classification using the GLAP model.
- Supports searching and matching audio files against text descriptions, classifying audio by text queries, and zero-shot audio event detection in 50+ languages.
- Provides queue status monitoring and guidance on interpreting service response delays.
- Supports multiple common audio formats (mp3, wav, flac, ogg, m4a).
Metadata
Frequently Asked Questions
What is midasheng-audio-text-distance?
Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against... It is an AI Agent Skill for Claude Code / OpenClaw, with 167 downloads so far.
How do I install midasheng-audio-text-distance?
Run "/install midasheng-audio-text-distance" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is midasheng-audio-text-distance free?
Yes, midasheng-audio-text-distance is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does midasheng-audio-text-distance support?
midasheng-audio-text-distance is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created midasheng-audio-text-distance?
It is built and maintained by Junbo Zhang (@jimbozhang); the current version is v1.0.0.
More Skills