← Back to Skills Marketplace

MOSI Transcribe Diarize 多说话人转写

Name: MOSI Transcribe Diarize 多说话人转写
Author: mkkb473

by YYL · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

235

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install mosi-transcribe-diarize

Description

MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入，输出带时间戳与 speaker 的结构化转写结果（JSON、逐段文本、按说话人汇总）。用于会议纪要、访谈录音、多人对话整理。

Usage Guidance

What to consider before installing: - The script will upload any provided audio (including local files you point to) to https://studio.mosi.cn and sends your API key as a Bearer token. Only supply an API key you trust the service with and avoid sending highly sensitive audio unless you have an acceptable data policy. - There is a metadata inconsistency: the registry entry claims no required env vars but SKILL.md/script require MOSS_API_KEY (with fallbacks). Treat this as a documentation gap — confirm where to store/approve the key before installation. - The script requires the Python 'requests' package; the skill does not declare installation steps. Make sure your environment has required Python deps. - The code enforces HTTPS and restricts the host to studio.mosi.cn, which limits accidental exfiltration to other domains. That is good, but you should still verify the service (studio.mosi.cn) and the skill owner (currently 'unknown'). - If you need higher assurance: ask the publisher to (a) correct registry metadata to declare the required env vars, (b) provide provenance for the code (source repo, signing), and (c) document the data retention/privacy policy for uploaded audio. Rotate or scope API keys used with this skill where possible.

Capability Analysis

Type: OpenClaw Skill Name: mosi-transcribe-diarize Version: 1.0.0 The skill is a legitimate tool for audio transcription and diarization using the MOSI Studio API (studio.mosi.cn). The script `scripts/transcribe.py` includes a security validation check that restricts API requests to the official domain via HTTPS, effectively preventing the tool from being used for arbitrary data exfiltration. The logic is transparent, lacks obfuscation, and strictly follows the functionality described in the documentation.

Capability Assessment

ℹ Purpose & Capability

Name/description (multi‑speaker transcription) align with the included script and runtime instructions: the script sends audio (URL, local file encoded as data URL, or base64) to studio.mosi.cn for diarization. However, registry metadata claims no required env vars/credentials while the SKILL.md and script require an API key (MOSS_API_KEY / MOSI_TTS_API_KEY / MOSI_API_KEY). This metadata mismatch is an inconsistency to be aware of.

✓ Instruction Scope

SKILL.md instructs the agent to run scripts/transcribe.py with clearly scoped arguments (--audio-url, --file, --audio-data). The script only reads the provided audio file (if --file), base64-encodes it, and POSTs JSON to the enforced endpoint (https://studio.mosi.cn). It does not attempt to read arbitrary system files or other environment variables beyond the API keys described.

ℹ Install Mechanism

This is instruction-only with an included Python script (no network install). No external downloads or installers are used. The script imports the requests library but no install step or dependency declaration is provided in the registry metadata or SKILL.md — users will need Python and the requests package available in the environment.

⚠ Credentials

The script requires an API key (it checks MOSS_API_KEY and falls back to MOSI_TTS_API_KEY or MOSI_API_KEY) and will send that key as a Bearer token to studio.mosi.cn. That credential is appropriate for the declared purpose, but the registry metadata incorrectly lists 'Required env vars: none' and 'Primary credential: none' — an incoherence that could confuse permission/approval workflows. No unrelated secrets are requested.

✓ Persistence & Privilege

The skill does not request persistent/invisible presence (always:false). It does not alter other skills or agent-wide config. Autonomous invocation is allowed (platform default) but not combined with other concerning privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install mosi-transcribe-diarize
After installation, invoke the skill by name or use /mosi-transcribe-diarize
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Fork of moss-transcribe-diarize with env declaration for MOSI_TTS_API_KEY and homepage metadata

Metadata

Slug mosi-transcribe-diarize

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is MOSI Transcribe Diarize 多说话人转写?

MOSS 多说话人转写技能。支持 URL / 本地文件 / Base64 音频输入，输出带时间戳与 speaker 的结构化转写结果（JSON、逐段文本、按说话人汇总）。用于会议纪要、访谈录音、多人对话整理。 It is an AI Agent Skill for Claude Code / OpenClaw, with 235 downloads so far.

How do I install MOSI Transcribe Diarize 多说话人转写?

Run "/install mosi-transcribe-diarize" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is MOSI Transcribe Diarize 多说话人转写 free?

Yes, MOSI Transcribe Diarize 多说话人转写 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does MOSI Transcribe Diarize 多说话人转写 support?

MOSI Transcribe Diarize 多说话人转写 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created MOSI Transcribe Diarize 多说话人转写?

It is built and maintained by YYL (@mkkb473); the current version is v1.0.0.

More Skills