← Back to Skills Marketplace

Grok Imagine

Name: Grok Imagine
Author: ademczuk

by ademczuk · GitHub ↗ · v1.1.0

cross-platform ⚠ suspicious

467

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install grok-imagine-extended

Description

Generate images and videos using xAI Grok Imagine Extended. Text-to-image, image editing, text-to-video, image-to-video. Use when: user asks to generate, cre...

README (SKILL.md)

\r \r

Grok Imagine Extended (xAI Image & Video Generation)\r

\r Generate images and videos from text prompts using the xAI API.\r \r

Image Generation\r

python3 {baseDir}/scripts/generate_image.py --prompt "your image description" --filename "output.png"\r
```\r
\r
With options:\r
\r
```bash\r
python3 {baseDir}/scripts/generate_image.py --prompt "a cyberpunk city at night" --filename "city.png" --resolution 2k --aspect-ratio 16:9\r
```\r
\r
## Image Editing\r
\r
Single source image:\r
\r
```bash\r
python3 {baseDir}/scripts/generate_image.py --prompt "make it a watercolor painting" --filename "edited.png" -i "/path/to/source.jpg"\r
```\r
\r
Multiple source images (up to 3):\r
\r
```bash\r
python3 {baseDir}/scripts/generate_image.py --prompt "combine into one scene" --filename "combined.png" -i img1.png -i img2.png\r
```\r
\r
## Video Generation\r
\r
Text-to-video:\r
\r
```bash\r
python3 {baseDir}/scripts/generate_image.py --prompt "a cat walking through flowers" --filename "cat.mp4" --video --duration 5\r
```\r
\r
Image-to-video (animate a still):\r
\r
```bash\r
python3 {baseDir}/scripts/generate_image.py --prompt "add gentle camera zoom and wind" --filename "animated.mp4" --video -i photo.jpg --duration 5\r
```\r
\r
## Models\r
\r
| Model | Type | Cost |\r
|-------|------|------|\r
| `grok-imagine-image` | Image (default) | $0.02/img |\r
| `grok-imagine-image-pro` | Image (high quality) | $0.07/img |\r
| `grok-imagine-video` | Video (auto for --video) | $0.05/sec |\r
\r
Select model with `--model grok-imagine-image-pro`. Video mode always uses `grok-imagine-video`.\r
\r
## All Options\r
\r
| Flag | Description |\r
|------|-------------|\r
| `--prompt`, `-p` | Text description (required) |\r
| `--filename`, `-f` | Output path (required) |\r
| `-i` | Input image for editing/animation (repeatable, max 3 for images, 1 for video) |\r
| `--model`, `-m` | Image model (default: grok-imagine-image) |\r
| `--aspect-ratio`, `-a` | 1:1, 16:9, 9:16, 4:3, 3:4, etc. |\r
| `--resolution`, `-r` | Image: 1k/2k. Video: 480p/720p |\r
| `--n` | Number of images 1-10 (default 1) |\r
| `--video` | Generate video instead of image |\r
| `--duration`, `-d` | Video duration 1-15 seconds (default 5) |\r
| `--api-key`, `-k` | Override XAI_API_KEY |\r
\r
## API Key\r
\r
- `XAI_API_KEY` env var\r
- Or set `skills."grok-imagine".apiKey` / `skills."grok-imagine".env.XAI_API_KEY` in `~/.openclaw/openclaw.json`\r
- Or auto-read from `~/keys.txt`\r
\r
## Notes\r
\r
- Use timestamps in filenames: `2026-03-01-cyberpunk-city.png`\r
- The script prints a `MEDIA:` line for OpenClaw to auto-attach on supported chat providers\r
- Do not read the image back; report the saved path only\r
- Image URLs from xAI are temporary; the script downloads them immediately\r
- Video generation is async and polls until done (can take 1-5 minutes)\r
- 2k resolution returns PNG; 1k returns JPEG\r

Usage Guidance

This skill appears to do what it says: call xAI's image/video endpoints, download media, and save files. Before installing, ensure you: (1) only provide an XAI_API_KEY you trust for image generation and monitor usage/costs (video polling can incur charges while it runs); (2) are comfortable with the script reading ~/keys.txt if that file exists (remove or secure it if not); (3) understand the skill will download remote media and write files to the provided output paths; and (4) note the SKILL.md mentions reading ~/.openclaw/openclaw.json but the bundled script does not — if you rely on that behavior, verify it or set the env/--api-key explicitly. Overall the package is coherent with its stated purpose.

Capability Analysis

Type: OpenClaw Skill Name: grok-imagine-extended Version: 1.1.0 The skill is classified as suspicious primarily due to a critical path traversal vulnerability in `scripts/generate_image.py`. The `download_file` function writes to `output_path` which is directly derived from the user-controlled `--filename` argument without sanitization, allowing an attacker to write files to arbitrary locations (e.g., `../../../../tmp/malicious.txt`). Additionally, the script attempts to load the `XAI_API_KEY` from `~/keys.txt`, which, while intended for convenience, expands the attack surface for API key compromise if an attacker can manipulate that file via prompt injection.

Capability Assessment

✓ Purpose & Capability

Name/description, required XAI_API_KEY, endpoints (api.x.ai), and the provided script all align with an image/video generation skill. Required capabilities are proportional to the stated purpose.

ℹ Instruction Scope

Runtime instructions tell the agent to run the included script and only reference XAI_API_KEY, a local keys.txt fallback, and output paths. The script does network calls to the xAI API, downloads returned media to disk, and polls for video status as documented — all within the scope. Minor mismatch: SKILL.md also claims the key can be read from ~/.openclaw/openclaw.json, but the included script only implements env var, explicit --api-key, and ~/keys.txt lookup.

✓ Install Mechanism

No install spec; the skill is instruction-only with a bundled script. Nothing is downloaded or executed at install time.

ℹ Credentials

Only XAI_API_KEY is required (declared as primary). The script optionally reads ~/keys.txt for a fallback key (documented). There are no unrelated secrets requested. Users should note the script will read keys.txt if present — ensure that file's contents and permissions are acceptable.

✓ Persistence & Privilege

always is false and the skill does not request persistent/privileged presence or modify other skills or system-wide settings. Autonomous invocation is allowed but is the platform default.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install grok-imagine-extended
After installation, invoke the skill by name or use /grok-imagine-extended
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.1.0

- Switched all usage examples from uv to python3 for running scripts. - Updated requirements: removed need to install uv and related instructions. - Changed metadata to only require XAI_API_KEY environment variable (removed bins/uv). - Removed OpenClaw-specific notes and instructions about media line and script interaction. - No changes to command-line options, features, or overall functionality.

v1.0.2

No user-facing changes in this release. - No file changes detected compared to the previous version. - Functionality and documentation remain the same.

v1.0.1

Version 1.0.1 - Updated API key auto-discovery: removed support for reading from C:/Projects/keys.txt (Windows). - SKILL.md and script instructions updated to reflect this change.

v1.0.0

Initial release: text-to-image, image editing (up to 3 sources), text-to-video, image-to-video via xAI Grok Imagine API. Models: grok-imagine-image, grok-imagine-image-pro, grok-imagine-video.

Metadata

Slug grok-imagine-extended

Version 1.1.0

License —

All-time Installs 0

Active Installs 0

Total Versions 4

Frequently Asked Questions

What is Grok Imagine?

Generate images and videos using xAI Grok Imagine Extended. Text-to-image, image editing, text-to-video, image-to-video. Use when: user asks to generate, cre... It is an AI Agent Skill for Claude Code / OpenClaw, with 467 downloads so far.

How do I install Grok Imagine?

Run "/install grok-imagine-extended" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Grok Imagine free?

Yes, Grok Imagine is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Grok Imagine support?

Grok Imagine is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Grok Imagine?

It is built and maintained by ademczuk (@ademczuk); the current version is v1.1.0.

More Skills