← Back to Skills Marketplace
twinsgeeks

Deepseek Deepseek Coder

by Twin Geeks · GitHub ↗ · v1.0.3 · MIT-0
darwinlinuxwindows ✓ Security Clean
143
Downloads
2
Stars
2
Active Installs
4
Versions
Install in OpenClaw
/install deepseek-deepseek-coder
Description
DeepSeek DeepSeek-Coder — run DeepSeek-V3, DeepSeek-R1, DeepSeek-Coder across your local fleet. 7-signal scoring routes every request to the best device. Cro...
README (SKILL.md)

DeepSeek — Run DeepSeek Models Across Your Local Fleet

Run DeepSeek-V3, DeepSeek-R1, and DeepSeek-Coder on your own hardware. The fleet router picks the best device for every request — no cloud API needed, zero per-token costs, all data stays on your machines.

Supported DeepSeek models

Model Parameters Ollama name Best for
DeepSeek-V3 671B MoE (37B active) deepseek-v3 General — matches GPT-4o on most benchmarks
DeepSeek-V3.1 671B MoE deepseek-v3.1 Hybrid thinking/non-thinking modes
DeepSeek-V3.2 671B MoE deepseek-v3.2 Improved reasoning + agent performance
DeepSeek-R1 1.5B–671B deepseek-r1 Reasoning — approaches O3 and Gemini 2.5 Pro
DeepSeek-Coder 1.3B–33B deepseek-coder Code generation (87% code, 13% NL training)
DeepSeek-Coder-V2 236B MoE (21B active) deepseek-coder-v2 Code — matches GPT-4 Turbo on code tasks

Setup

pip install ollama-herd
herd              # start the router (port 11435)
herd-node         # run on each machine

Package: ollama-herd | Repo: github.com/geeks-accelerator/ollama-herd

Models are pulled on demand — the router auto-pulls when a request arrives for a model not yet on any node, or you can pull manually via the dashboard. No models are downloaded during installation.

Use DeepSeek through the fleet

OpenAI SDK

from openai import OpenAI

client = OpenAI(base_url="http://localhost:11435/v1", api_key="not-needed")

# DeepSeek-R1 for reasoning
response = client.chat.completions.create(
    model="deepseek-r1:70b",
    messages=[{"role": "user", "content": "Prove that there are infinitely many primes"}],
    stream=True,
)
for chunk in response:
    print(chunk.choices[0].delta.content or "", end="")

DeepSeek-Coder for code

response = client.chat.completions.create(
    model="deepseek-coder-v2:16b",
    messages=[{"role": "user", "content": "Write a Redis cache decorator in Python"}],
)
print(response.choices[0].message.content)

Ollama API

# DeepSeek-V3 general chat
curl http://localhost:11435/api/chat -d '{
  "model": "deepseek-v3",
  "messages": [{"role": "user", "content": "Explain quantum computing"}],
  "stream": false
}'

# DeepSeek-R1 reasoning
curl http://localhost:11435/api/chat -d '{
  "model": "deepseek-r1:70b",
  "messages": [{"role": "user", "content": "Solve this step by step: ..."}],
  "stream": false
}'

Hardware recommendations (optional — choose models that fit your RAM)

Cross-platform: These are example configurations. Any device (Mac, Linux, Windows) with equivalent RAM works. The fleet router runs on all platforms.

DeepSeek offers models at every size. Pick the one that fits your available memory — smaller models work great for most tasks:

Model Min RAM Recommended hardware
deepseek-r1:1.5b 4GB Any Mac
deepseek-r1:7b 8GB Mac Mini M4 (16GB)
deepseek-r1:14b 12GB Mac Mini M4 (24GB)
deepseek-r1:32b 24GB Mac Mini M4 Pro (48GB)
deepseek-r1:70b 48GB Mac Studio M4 Max (128GB)
deepseek-coder-v2:16b 12GB Mac Mini M4 (24GB)
deepseek-v3 256GB+ Mac Studio M3 Ultra (512GB)

The fleet router automatically sends requests to the machine where the model is loaded — no manual routing needed.

Why run DeepSeek locally

  • Zero cost — DeepSeek API charges per token. Local is free after hardware.
  • Privacy — code and business data never leave your network.
  • No rate limits — DeepSeek API throttles during peak hours. Local has no throttle.
  • Availability — DeepSeek API has had outages. Your hardware doesn't depend on their servers.
  • Fleet routing — multiple machines share the load. One busy? Request goes to the next.

Fleet features

  • 7-signal scoring — picks the optimal node for every request
  • Auto-retry — fails over to next best node transparently
  • VRAM-aware fallback — routes to a loaded model in the same category instead of cold-loading
  • Context protection — prevents expensive model reloads from num_ctx changes
  • Request tagging — track per-project DeepSeek usage

Also available on this fleet

Other LLM models

Llama 3.3, Qwen 3.5, Phi 4, Mistral, Gemma 3 — any Ollama model routes through the same endpoint.

Image generation

curl -o image.png http://localhost:11435/api/generate-image \
  -H "Content-Type: application/json" \
  -d '{"model":"z-image-turbo","prompt":"a sunset","width":1024,"height":1024,"steps":4}'

Speech-to-text

curl http://localhost:11435/api/transcribe -F "[email protected]"

Embeddings

curl http://localhost:11435/api/embeddings -d '{"model":"nomic-embed-text","prompt":"query"}'

Dashboard

http://localhost:11435/dashboard — monitor DeepSeek requests alongside all other models. Per-model latency, token throughput, health checks.

Full documentation

Agent Setup Guide

Guardrails

  • Model downloads require explicit user confirmation — DeepSeek models range from 1GB (1.5B) to 400GB+ (671B). Always confirm before pulling.
  • Model deletion requires explicit user confirmation — never remove models without asking.
  • Never delete or modify files in ~/.fleet-manager/.
  • If a DeepSeek model is too large for available memory, suggest a smaller variant (e.g., deepseek-r1:7b instead of :70b).
  • No models are downloaded automatically — all pulls are user-initiated or require opt-in via the auto_pull setting.
Usage Guidance
This skill is internally consistent with its purpose, but before installing: (1) verify the PyPI package (ollama-herd) and the GitHub repo (review recent commits, maintainer identity, and issues); (2) install in a virtualenv or isolated VM if you are unsure; (3) be prepared for very large model downloads and ensure disk space and bandwidth; (4) confirm any model downloads prompt you before proceeding (the docs claim confirmation is required); (5) check what filesystem paths the herd service writes to (~/.fleet-manager/...) and restrict permissions if needed; (6) consider running the router behind a firewall or localhost-only interface to avoid exposing models to the network. If you want higher confidence, request the actual repository code or a pinned package artifact for review before installing.
Capability Assessment
Purpose & Capability
Name/description, examples (curl/OpenAI SDK), and required tools (curl/wget, optional python/pip) all align with running a local fleet router and calling a localhost Ollama-compatible API. The referenced config paths (~/.fleet-manager/latency.db, ~/.fleet-manager/logs/herd.jsonl) are consistent with a fleet manager's state and logs.
Instruction Scope
SKILL.md instructs installing 'ollama-herd' and running local processes (herd, herd-node) and making requests to localhost:11435. It does not instruct reading unrelated system files, exporting secrets, or sending data to external endpoints other than pulling models on demand (which the docs say requires confirmation).
Install Mechanism
This is instruction-only with no install spec; the docs instruct users to 'pip install ollama-herd'. That is coherent but introduces typical supply-chain risk because installing a PyPI package runs third-party code on your machine and may trigger on-demand model downloads. The SKILL itself does not include or pin any binaries.
Credentials
No environment variables or credentials are requested. Example code uses a localhost base_url and sets api_key to 'not-needed'. There are no unexpected credential requests in the SKILL.md or metadata.
Persistence & Privilege
always is false (not force-included). The skill does not request elevated platform privileges or attempt to modify other skills' configuration. Autonomous invocation is allowed by default, which is normal; nothing else indicates persistent privileged presence.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install deepseek-deepseek-coder
  3. After installation, invoke the skill by name or use /deepseek-deepseek-coder
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.3
Cross-platform support: macOS, Linux, and Windows. Updated OS metadata, descriptions, and hardware recommendations.
v1.0.2
- Model download and deletion guardrails clarified: models are now only downloaded with explicit user confirmation, never automatically on install. - Installation instructions updated: model downloads are handled on demand by the router or manually via the dashboard. - Hardware recommendations are now positioned as optional; users are encouraged to select models that fit available RAM. - Emphasized that no models are downloaded automatically—pulling requires opt-in or manual action. - Version bumped to 1.0.2.
v1.0.1
- Updated skill name and description to "deepseek-deepseek-coder" with a focus on DeepSeek-Coder. - No changes to dependencies, usage, features, or documentation content. - Version remains at 1.0.0; only descriptive and naming details updated.
v1.0.0
Initial release: Run DeepSeek models locally across your own fleet with optimized routing and zero cloud costs. - Supports DeepSeek-V3, V3.1, V3.2, R1, Coder, and Coder-V2 models via ollama-herd - Automatically routes requests to the best available device using 7-signal scoring - Enables OpenAI SDK and Ollama API compatibility for DeepSeek deployments - Provides hardware recommendations for running various DeepSeek model sizes - Includes fleet features like auto-retry, VRAM-aware fallback, context protection, and request tagging - Adds dashboard for monitoring model requests, latency, and throughput - No cloud API, no per-token charges, and keeps all data on your machines
Metadata
Slug deepseek-deepseek-coder
Version 1.0.3
License MIT-0
All-time Installs 2
Active Installs 2
Total Versions 4
Frequently Asked Questions

What is Deepseek Deepseek Coder?

DeepSeek DeepSeek-Coder — run DeepSeek-V3, DeepSeek-R1, DeepSeek-Coder across your local fleet. 7-signal scoring routes every request to the best device. Cro... It is an AI Agent Skill for Claude Code / OpenClaw, with 143 downloads so far.

How do I install Deepseek Deepseek Coder?

Run "/install deepseek-deepseek-coder" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Deepseek Deepseek Coder free?

Yes, Deepseek Deepseek Coder is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Deepseek Deepseek Coder support?

Deepseek Deepseek Coder is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux, windows).

Who created Deepseek Deepseek Coder?

It is built and maintained by Twin Geeks (@twinsgeeks); the current version is v1.0.3.

💬 Comments