← Back to Skills Marketplace
fanxi-ju

豆包AI图像生成

by FANXI-JU · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
183
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install doubao-ai-image
Description
Generate free AI images via Doubao web interface using automated browser interaction for detailed and styled visual content without API costs.
README (SKILL.md)

doubao-ai-image

Description

Free AI image generation using Doubao (豆包) web interface through browser automation. Mimics human interaction to generate and download images without API costs.

Usage Scenarios

  • Free AI image generation for personal or business use
  • When API-based image generation is not available or too expensive
  • Automated image creation for reports, presentations, or social media
  • Integration with other workflows requiring visual content

Workflow

  1. Navigate to Doubao: Open https://www.doubao.com/chat/ in browser
  2. Access Image Generation: Click on "图像生成" or "AI 创作" feature
  3. Input Prompt: Enter detailed image description in the text box
  4. Generate Images: Submit the prompt and wait for AI generation
  5. Select Image: Choose from the generated options (typically 4 variations)
  6. Capture Image: Use browser automation to screenshot the large preview image (ref="preview") to avoid download issues
  7. Save Locally: Store image in /workspace/ai_images/doubao/ directory with timestamp
  8. Deliver: Send image directly to Feishu chat or other specified destination

Key Features

  • Completely free AI image generation (no API costs)
  • Human-like browser interaction to avoid bot detection
  • Supports detailed prompts with style, composition, and quality specifications
  • Automatic file management and delivery integration
  • Works with Doubao's Seedream 4.5 model and various aspect ratios/styles

Technical Requirements

  • Browser automation capability with proper user agent
  • Element targeting for precise screenshot capture (using ARIA refs)
  • Image format handling (PNG, JPG, WebP)
  • Error handling for generation failures or UI changes
  • Wait mechanisms for AI generation completion
  • Local file system access for image storage

Best Practices

  • Use specific, detailed prompts for better results
  • Include style references when needed (e.g., "realistic", "cartoon", "anime")
  • Specify aspect ratio if important for the use case
  • Target the "preview" element (ref="preview") for clean screenshot capture
  • Verify image quality and composition before delivery
  • Handle cases where generation fails or produces unsuitable results
  • Save images to standardized directory structure: /workspace/ai_images/doubao/YYYY-MM-DD/

Limitations

  • Dependent on Doubao web interface availability
  • Subject to Doubao's usage limits and terms of service
  • Download location may vary by system/browser configuration
  • Generation time varies based on server load
  • No guaranteed consistency in image quality or style

Integration Points

  • Can be combined with report generation skills (e.g., xiaoxiyouxi-5min-report)
  • Suitable for avatar creation, concept visualization, or illustration needs
  • Can feed into social media content pipelines
  • Useful for rapid prototyping of visual concepts
Usage Guidance
This skill is inconsistent and raises red flags. Before installing, ask the publisher for: (1) a list of required automation tools (Puppeteer/Selenium, browser binary) and an install spec; (2) how Feishu (or other) delivery is authenticated and where tokens are stored; (3) confirmation they are not instructing the agent to bypass Doubao's terms or bot-detection defenses. Consider legal/ToS risk: automating to 'avoid bot detection' can violate the target site's terms and lead to account suspension or legal issues. If you still want to proceed, only run it in a controlled environment, provide minimal, well-scoped credentials (not long-lived account owner secrets), and prefer using official APIs or partnering with the service instead of automated UI scraping.
Capability Analysis
Type: OpenClaw Skill Name: doubao-ai-image Version: 1.0.0 The skill bundle describes a legitimate workflow for automating AI image generation via the Doubao (doubao.com) web interface using browser automation. The instructions in SKILL.md focus on navigating the UI, capturing screenshots of generated images, and saving them to a local workspace directory. No indicators of malicious intent, data exfiltration, or harmful prompt injection were found; the activities are consistent with the stated purpose of providing a free alternative to image generation APIs.
Capability Assessment
Purpose & Capability
The stated purpose (browser-automated image generation on Doubao) matches the instructions, but the skill does not declare or document required automation tooling (e.g., Puppeteer/Selenium, headless browser) or credentials needed for delivery endpoints. Asking the agent to 'mimic human interaction to avoid bot detection' is disproportionate and suggests actions beyond straightforward automation.
Instruction Scope
SKILL.md instructs browser navigation, DOM element targeting, screenshot capture, local file writes to /workspace/ai_images/doubao/, and delivering images to external chat (Feishu) — but it does not declare how the agent will obtain Feishu credentials or the browser automation runtime. The explicit guidance to avoid bot detection is scope-creep and ethically questionable; it may imply bypassing site protections or terms of service.
Install Mechanism
There is no install spec (instruction-only), which reduces file-written-to-disk risk. However, the technical requirements (browser automation, element targeting, user-agent control) imply additional tooling that the skill does not declare or install, creating an operational gap the deployer must fill.
Credentials
The skill requests no environment variables or credentials in metadata but refers to delivering images to Feishu chat and using a user agent/cookies for automation. That delivery/integration normally requires API tokens or web session credentials — their absence in the declared requirements is an inconsistency and could lead to ad-hoc credential handling or insecure storage.
Persistence & Privilege
The skill is not marked always:true and does not request persistent system-wide privileges. It writes to a local workspace path per its instructions, which is expected for an image-generation task. The default ability for the agent to invoke the skill autonomously is enabled (disable-model-invocation: false), which is normal but worth noting.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doubao-ai-image
  3. After installation, invoke the skill by name or use /doubao-ai-image
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug doubao-ai-image
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 豆包AI图像生成?

Generate free AI images via Doubao web interface using automated browser interaction for detailed and styled visual content without API costs. It is an AI Agent Skill for Claude Code / OpenClaw, with 183 downloads so far.

How do I install 豆包AI图像生成?

Run "/install doubao-ai-image" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 豆包AI图像生成 free?

Yes, 豆包AI图像生成 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 豆包AI图像生成 support?

豆包AI图像生成 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 豆包AI图像生成?

It is built and maintained by FANXI-JU (@fanxi-ju); the current version is v1.0.0.

💬 Comments