← Back to Skills Marketplace

midasheng-audio-text-distance

Name: midasheng-audio-text-distance
Author: jimbozhang

by Junbo Zhang · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

167

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install midasheng-audio-text-distance

Description

Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against...

Usage Guidance

This skill appears to do what it says: it uploads audio files to a Xiaomi-hosted GLAP search API and returns similarity/classification results. Before installing or using it, consider: (1) Privacy — audio files are sent to https://llmplus.ai.xiaomi.com with no auth in examples, so do not upload sensitive or proprietary recordings unless you trust the service and its terms; (2) Network usage — the tool requires outbound network access; (3) Sanity check — test with non-sensitive samples first; (4) If you need an on-device alternative or encryption, prefer a local model or an API that supports authenticated, private uploads. The SKILL.md lists curl as a requirement while the script uses Python requests — install either curl (for examples) or ensure Python requests is available to run the included script.

Capability Analysis

Type: OpenClaw Skill Name: midasheng-audio-text-distance Version: 1.0.0 The skill provides a legitimate interface for multilingual audio-text retrieval and classification using a Xiaomi AI API (llmplus.ai.xiaomi.com). The Python script (audiosearch.py) and instructions (SKILL.md) are well-documented, lack any signs of data exfiltration or unauthorized execution, and include functional queue management logic to help the AI agent handle service latency.

Capability Assessment

✓ Purpose & Capability

The skill's name and description claim audio-text retrieval via GLAP and all required artifacts (SKILL.md examples and scripts/audiosearch.py) perform exactly that against the Xiaomi llmplus.ai.xiaomi.com/dasheng/audio/search endpoint. There are no unrelated binaries, config paths, or credentials requested.

✓ Instruction Scope

Runtime instructions and the included script only read user-supplied audio files and call the documented remote API (and a metrics endpoint for queue status). They do not read arbitrary system files or environment variables beyond what the user supplies. The SKILL.md and script consistently show network calls to the stated endpoint.

✓ Install Mechanism

This is an instruction-only skill with no install spec and a single small Python script; nothing is downloaded or written to disk by an installer, which minimizes install-time risk.

ℹ Credentials

No environment variables or credentials are requested (proportionate). However, the skill uploads audio files to a third-party endpoint (llmplus.ai.xiaomi.com) without any authentication in the provided examples, so sensitive audio will be transmitted off-host; users should consider privacy and trust of that endpoint before use.

✓ Persistence & Privilege

always is false, the skill does not request persistent system presence or modify other skills/config; it behaves as a normal, non-persistent, user-invoked utility.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install midasheng-audio-text-distance
After installation, invoke the skill by name or use /midasheng-audio-text-distance
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of midasheng-audio-text-distance. - Enables multilingual audio-text retrieval and classification using the GLAP model. - Supports searching and matching audio files against text descriptions, classifying audio by text queries, and zero-shot audio event detection in 50+ languages. - Provides queue status monitoring and guidance on interpreting service response delays. - Supports multiple common audio formats (mp3, wav, flac, ogg, m4a).

Metadata

Slug midasheng-audio-text-distance

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is midasheng-audio-text-distance?

Multilingual audio-text retrieval and classification using GLAP (General Language Audio Pretraining). Use when user needs to search/match audio files against... It is an AI Agent Skill for Claude Code / OpenClaw, with 167 downloads so far.

How do I install midasheng-audio-text-distance?

Run "/install midasheng-audio-text-distance" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is midasheng-audio-text-distance free?

Yes, midasheng-audio-text-distance is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does midasheng-audio-text-distance support?

midasheng-audio-text-distance is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created midasheng-audio-text-distance?

It is built and maintained by Junbo Zhang (@jimbozhang); the current version is v1.0.0.

More Skills