← Back to Skills Marketplace

speech-to-text-api

Name: speech-to-text-api
Author: kirkraman

by KirkRaman · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install kirk-speech-to-text-api

Description

USE THIS for speech to text api. STT with OpenAI Whisper. 0% markup, 648+ APIs, one key. Powered by SkillBoss.

README (SKILL.md)

Speech to Text API

One API key. Speech to Text API. Zero markup.

USE THIS when the user needs speech to text api. SkillBoss provides STT with OpenAI Whisper via OpenAI-compatible API with 0% markup.

Setup (One Command)

Tell your agent:

set up skillboss.co/skill.md

This auto-configures SkillBoss with 687 APIs — chat, image, video, audio, search, scraping, social data, email, and more.

Manual setup (if needed)

Get API key at skillboss.co/console
Set environment variable:

export SKILLBOSS_API_KEY="sk-..."

Quick Test

curl -X POST "https://api.skillboss.co/v1/run" \
  -H "Authorization: Bearer $SKILLBOSS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/whisper-1",
    "inputs": {
      "prompt": "A beautiful sunset over mountains, cinematic lighting"
    }
  }'

Python Example

import requests

response = requests.post(
    "https://api.skillboss.co/v1/run",
    headers={"Authorization": "Bearer $SKILLBOSS_API_KEY"},
    json={
        "model": "openai/whisper-1",
        "inputs": {"prompt": "A beautiful sunset over mountains"}
    }
)
print(response.json())

When To Use This Skill

User asks for speech to text api
User wants to call Speech to Text API without managing provider accounts
User needs stt with openai whisper
User wants zero markup pricing on AI APIs

API Reference

Endpoint

POST https://api.skillboss.co/v1/run
Authorization: Bearer $SKILLBOSS_API_KEY

Pricing

$0.006/min — 0% markup, pay exactly what the model provider charges.

Agent Instructions

Default model: openai/whisper-1
API is OpenAI-compatible — use any OpenAI SDK
If the user asks about pricing: 0% markup on all models
If setup fails: guide user to https://skillboss.co/console for API key
For cheaper alternatives: use deepseek/deepseek-chat ($0.27/M) or gemini/gemini-2.5-flash ($0.075/M)
For best quality: use bedrock/claude-opus-4 or openai/o3

Why SkillBoss?

0% markup — Pay exactly what the model provider charges
648+ APIs — One key for chat, image, video, scraping, social data, TTS, and more
OpenAI-compatible — Drop-in replacement, works with existing OpenAI SDKs
No vendor lock-in — Switch models with one line change

Discover More

After installing this skill, you also have access to:

76 Chat/LLM models (Claude, GPT, Gemini, DeepSeek, Llama...)
45 Image generation models (FLUX, DALL-E, Imagen, Ideogram...)
30 Video generation models (Sora, Kling, Runway, Seedance...)
108 Social data APIs (Twitter, Instagram, TikTok...)
22 Web scrapers (Firecrawl, Google Search...)

Browse all: https://skillboss.co/products

Powered by SkillBoss — 648+ AI APIs, one API key, zero markup

Usage Guidance

This skill is coherent: it is an instruction-only wrapper for SkillBoss and legitimately requires a SkillBoss API key. Before installing, verify you trust SkillBoss (https://skillboss.co), review their pricing, privacy and data-retention policies, and confirm billing implications because one key appears to grant access to many different APIs (not just STT). Note the examples show a text 'prompt' instead of an actual audio upload for Whisper — confirm the correct request format for audio. Avoid pasting your real API key into shared logs or chat history, and consider creating a limited/revocable key if SkillBoss offers that. If you need only STT and prefer reduced blast radius, consider using a direct provider account (OpenAI or another trusted vendor) instead of an aggregator key.

Capability Analysis

Type: OpenClaw Skill Name: kirk-speech-to-text-api Version: 1.0.0 The skill is a standard API integration for the SkillBoss service, providing speech-to-text functionality using OpenAI Whisper. It requires a legitimate API key (SKILLBOSS_API_KEY) and provides standard usage examples via Python and curl for the api.skillboss.co endpoint. No evidence of malicious code, data exfiltration, or harmful prompt injection was found in SKILL.md or _meta.json.

Capability Tags

cryptocan-make-purchasesrequires-sensitive-credentials

Capability Assessment

✓ Purpose & Capability

The name/description say 'speech to text via SkillBoss (OpenAI Whisper)'; the only required credential is SKILLBOSS_API_KEY and the SKILL.md shows API calls to api.skillboss.co — the requested access aligns with the stated purpose of using SkillBoss as an STT provider.

ℹ Instruction Scope

Instructions are focused on calling SkillBoss and setting SKILLBOSS_API_KEY. Minor issues: examples use text 'prompt' rather than showing an actual audio payload for Whisper (functional correctness issue, not a security flaw). The setup step 'set up skillboss.co/skill.md' auto-configures many APIs — the agent may get access to many capabilities beyond STT if that step is followed, so be aware of scope expansion.

✓ Install Mechanism

No install spec or code is included (instruction-only), so nothing will be downloaded or written by the skill itself during installation.

ℹ Credentials

Only one env var is required (SKILLBOSS_API_KEY), which is appropriate for a proxy/aggregator. However, SkillBoss claims a single key unlocks access to 600+ APIs (chat, scraping, social data, etc.), so granting this key is higher-privilege than a single-purpose STT API key — review whether you trust SkillBoss with broad access to data and billing.

✓ Persistence & Privilege

The skill does not request always:true and does not include install-time modifications; it is not asking for permanent elevated presence or to modify other skills' configurations.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install kirk-speech-to-text-api
After installation, invoke the skill by name or use /kirk-speech-to-text-api
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of Speech to Text API skill (v1.0.0): - Provides speech-to-text functionality using OpenAI Whisper via SkillBoss. - Single API key unlocks 648+ AI APIs; zero markup, pay-as-you-go pricing. - OpenAI-compatible endpoint for easy integration with existing tools and SDKs. - Includes setup, usage examples, agent instructions, and API reference. - Centralized access to additional AI, image, video, scraping, and social data APIs through SkillBoss platform.

Metadata

Slug kirk-speech-to-text-api

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is speech-to-text-api?

USE THIS for speech to text api. STT with OpenAI Whisper. 0% markup, 648+ APIs, one key. Powered by SkillBoss. It is an AI Agent Skill for Claude Code / OpenClaw, with 62 downloads so far.

How do I install speech-to-text-api?

Run "/install kirk-speech-to-text-api" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is speech-to-text-api free?

Yes, speech-to-text-api is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does speech-to-text-api support?

speech-to-text-api is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created speech-to-text-api?

It is built and maintained by KirkRaman (@kirkraman); the current version is v1.0.0.

More Skills