Groq API Inference

Name: Groq API Inference
Author: ivangdavila

Description

Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns.

README (SKILL.md)

Setup

On first use, read setup.md for activation preferences, credential verification, and default workflow setup.

When to Use

User needs to build, integrate, or troubleshoot Groq API inference for chat, tool calling, or speech transcription. Agent handles request shaping, model routing, failure recovery, and safe production patterns.

Architecture

Memory lives in ~/groq-api/. See memory-template.md for structure.

~/groq-api/
├── memory.md           # Status, activation preference, and defaults
├── requests/           # Reusable payload snippets
├── logs/               # Optional debug snapshots
└── experiments/        # Prompt/model A-B notes

Quick Reference

Use these files as decision aids, not as static docs: pick the smallest file that resolves the current blocker.

Topic	File
Setup process	`setup.md`
Memory template	`memory-template.md`
Request patterns	`api-patterns.md`
Model routing	`model-selection.md`
Failures and recovery	`troubleshooting.md`

Core Rules

1. Verify Auth and Endpoint Before Any Work

Check GROQ_API_KEY first and use Authorization: Bearer $GROQ_API_KEY for every request. Use https://api.groq.com/openai/v1 as the base URL and confirm access with /models.

curl -s https://api.groq.com/openai/v1/models \
  -H "Authorization: Bearer $GROQ_API_KEY" | jq '.data[0].id'

2. Start with a Minimal Deterministic Payload

Begin with small prompts and explicit format instructions. Add complexity only after the baseline call is stable.

3. Route by Task, Not by Habit

Use separate model choices for:

Fast interactive chat
High-accuracy reasoning
Speech transcription

Choose from live /models output instead of hardcoding assumptions.

4. Design for Retry and Degradation

For 429 and 5xx, retry with exponential backoff and capped attempts. If a model is overloaded, fail over to a compatible backup model and log the swap.

5. Validate Output Before Downstream Actions

If output feeds code execution or data writes, enforce JSON schema or strict parsing before acting. Reject malformed output early.

6. Treat Speech as a Separate Reliability Path

Speech uploads have different failure modes than chat. Validate input format, check file size, and surface transcription confidence when available.

7. Keep Secrets and User Data Scoped

Never store API keys in files. Keep request logs sanitized and avoid persisting full sensitive prompts unless the user explicitly asks.

Common Traps

Using stale model IDs copied from old examples -> call /models and select available IDs at runtime.
Sending giant prompts without truncation -> latency spikes and timeout risk.
Ignoring 429 backoff guidance -> repeated failures under load.
Mixing chat and transcription assumptions -> wrong endpoint and payload format.
Trusting free-form text for automation -> parse and validate before executing.

External Endpoints

All network traffic should be limited to these Groq endpoints for explicit inference tasks requested by the user.

Endpoint	Data Sent	Purpose
https://api.groq.com/openai/v1/models	None (GET)	Discover available models
https://api.groq.com/openai/v1/chat/completions	Prompt messages and options	Chat completions
https://api.groq.com/openai/v1/audio/transcriptions	Audio file and transcription params	Speech-to-text

No other data is sent externally.

Security & Privacy

Data that leaves your machine:

Prompt content sent to Groq inference endpoints
Audio content sent to Groq transcription endpoint when requested

Data that stays local:

Workflow preferences in ~/groq-api/memory.md
Optional local debug notes in ~/groq-api/logs/

This skill does NOT:

Store GROQ_API_KEY in project files
Access files outside ~/groq-api/ for persistence
Call undeclared third-party endpoints
Modify itself or other skills

Trust

By using this skill, prompts and optional audio content are sent to Groq. Only install if you trust Groq with that data.

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

api — reusable REST patterns, auth, and error handling
models — model comparison and selection heuristics
ai — current AI landscape checks before implementation decisions
fine-tuning — adaptation workflows when prompting is not enough
langchain — orchestration patterns for multi-step LLM pipelines

Feedback

If useful: clawhub star groq-api
Stay updated: clawhub sync

Usage Guidance

This skill appears coherent for Groq API work, but follow safe practices before installing: only provide a Groq API key (GROQ_API_KEY) and never paste it into saved files; restrict permissions on ~/groq-api/ (chmod 700) so local memory/log files are private; inspect any saved logs before sharing; confirm that the actual runtime agent you use will not autonomously exfiltrate data (the skill's docs say it limits network calls to Groq endpoints, but the agent environment enforces that); and avoid installing unrelated skills that request other credentials unless you need them. If you want extra assurance, run the example curl /models check yourself to verify expected behavior before enabling broader automation.

Capability Analysis

Type: OpenClaw Skill Name: groq-api Version: 1.0.0 The OpenClaw skill bundle for Groq API inference is benign. It clearly defines its purpose, limits network communication to declared Groq API endpoints, and restricts local file access to a dedicated `~/groq-api/` directory. The skill explicitly states that `GROQ_API_KEY` is not stored in files and guides the agent to handle credentials responsibly. Instructions in `SKILL.md` and `setup.md` are focused on legitimate API usage, error handling, and user interaction, with no evidence of prompt injection attempts to subvert the agent or perform malicious actions like data exfiltration, unauthorized execution, or persistence mechanisms. All `curl` commands are well-formed and target the declared Groq API.

Capability Assessment

✓ Purpose & Capability

Name and description match the declared requirements: curl + jq and GROQ_API_KEY are appropriate for calling Groq inference endpoints and parsing responses. No unrelated services, binaries, or config paths are requested.

ℹ Instruction Scope

Runtime instructions are narrowly scoped to calling Groq endpoints, validating output, and storing small workflow files under ~/groq-api/. The docs explicitly advise not to store GROQ_API_KEY in files and to sanitize logs. This is coherent; the only minor note is that local logs and memory files could accidentally capture sensitive prompts if the user or agent chooses to persist them, so follow the guidance to sanitize before saving.

✓ Install Mechanism

Instruction-only skill with no install spec and no downloads — lowest-risk installation model. It only relies on commonly-available tools (curl, jq) which are declared as required.

✓ Credentials

Requests a single API key (GROQ_API_KEY), which is exactly what the skill needs. The SKILL.md does not reference other env vars or credentials beyond the declared one.

✓ Persistence & Privilege

Does not request permanent 'always' inclusion, does not modify other skills, and only writes to its own directory under the user's home. Persistence is minimal and scoped to the skill's memory/log files.

Version History

v1.0.0

Initial release with Groq API workflows, model routing guidance, and troubleshooting playbooks for chat and speech.

Metadata

Slug groq-api

Version 1.0.0

License —

All-time Installs 3

Active Installs 3

Total Versions 1

Frequently Asked Questions

What is Groq API Inference?

Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns. It is an AI Agent Skill for Claude Code / OpenClaw, with 535 downloads so far.

How do I install Groq API Inference?

Run "/install groq-api" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Groq API Inference free?

Yes, Groq API Inference is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Groq API Inference support?

Groq API Inference is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin, win32).

Who created Groq API Inference?

It is built and maintained by Iván (@ivangdavila); the current version is v1.0.0.

More Skills