← Back to Skills Marketplace
wangjipeng977

screenshot-to-prototype

by 王继鹏 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
33
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install wangjipeng-screenshot-to-prototype
Description
Use when (1) user pastes a UI screenshot and needs editable frontend code prototype.
README (SKILL.md)

Screenshot To Prototype

Use when (1) user pastes a UI screenshot and needs editable frontend code prototype.

Core Position

This skill solves the specific engineering problem of: user pastes a UI screenshot and needs editable frontend code prototype

This skill is NOT:

  • A general-purpose capability that activates on anything
  • A replacement for manual human judgment
  • A tool that stores state or remembers across sessions

This skill IS activated ONLY when the trigger conditions are explicitly met.

Modes

/screenshot-to-prototype

Default mode. Performs the core task end-to-end.

When to use: User provides input matching the trigger conditions above.

Execution Steps

  1. Receive screenshot — User pastes a UI screenshot (PNG/JPG/WebP)

    • Confirm the image is a UI mockup, not a photo or diagram
    • If the image is unclear or not a UI, state: "This skill converts UI screenshots to editable frontend prototypes. Please provide a clear screenshot of a UI design."
  2. Analyze UI structure — Identify key elements from the screenshot:

    • Detect layout structure (header, sidebar, main content, footer)
    • Identify component types: buttons, input fields, cards, tables, navigation
    • Estimate spacing, font sizes, and color scheme from visual cues
  3. Generate component map — Map detected elements to frontend structure:

    • Container elements (div/section) based on layout regions
    • Interactive elements (button, input, select) with appropriate attributes
    • Text content areas with estimated typography (size, weight, color)
  4. Output prototype code — Generate clean, well-structured HTML/CSS (or React/Vue):

    • Use semantic HTML tags matching the detected structure
    • Include inline CSS or basic class names for styling
    • Add placeholder content that reflects the screenshot layout
    • Structure the code so it is immediately readable and editable
  5. Deliver with guidance — Return the code with a brief explanation:

    • State what framework/libraries the output uses
    • Note any assumptions made about sizing or positioning
    • Offer to refine specific components if the user provides more detail

Mandatory Rules

Do not

  • Do not make up facts or claim actions were taken that were not
  • Do not hardcode API keys — use os.getenv("API_KEY") instead
  • Do not store sensitive user data beyond the current session
  • Do not exceed token budget without warning the user first
  • Do not activate for off-topic requests — return a brief decline message

Do

  • Validate all inputs before acting
  • Handle errors gracefully with actionable error messages
  • Log actions taken for auditability
  • State explicitly when you are uncertain or data is insufficient

Quality Bar

A good output:

  • Solves exactly the problem described in the trigger conditions
  • Provides actionable result in the expected format within 3 turns
  • Handles error cases with specific guidance, not generic "try again"
  • States assumptions explicitly when input is ambiguous

A bad output:

  • Solves a different problem than the one triggered
  • Provides a generic "I can't help with that" without explaining why
  • Crashes, hangs, or returns malformed output on valid input
  • Activates for off-topic requests (false positive)

Good vs. Bad Examples

Scenario Bad Output Good Output
Trigger matched "I can help with that." + no action Correct transformation delivered in structured format
Invalid input Crash or wrong result "Missing required field: [X]. Please provide [Y]."
Ambiguous input Guesses and might be wrong States assumption and asks for confirmation
Off-topic request Attempts to help anyway "This skill activates when [trigger]. Please restate your request."

References

  • references/ — Detailed templates, schemas, and edge-case rules for this skill
Usage Guidance
Reasonable to install if you want help turning UI screenshots into editable frontend code. Review generated code before using it, and be aware that the README's API_KEY and write-mode examples look like generic boilerplate rather than actual required behavior.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The stated purpose is narrow and coherent: activate when a user provides a UI screenshot and wants editable frontend prototype code. The main SKILL.md instructions align with that purpose and emphasize input validation and off-topic refusal.
Instruction Scope
README and README_zh contain boilerplate placeholders and an unexplained read/write mode example, but the authoritative SKILL.md narrows activation to UI screenshots, instructs against storing sensitive data, and does not direct autonomous file mutation.
Install Mechanism
Installation is ordinary ClawHub/manual skill-copy guidance. No install hook, dependency install, background service, or persistence mechanism is present in the artifact.
Credentials
Metadata marks the skill as requiring sensitive credentials and README lists API_KEY, but the skill itself does not actually call an external service or require credentials; this is overbroad documentation rather than harmful behavior.
Persistence & Privilege
The skill explicitly says it does not store state or remember across sessions and tells the agent not to store sensitive user data beyond the current session. No privileged, persistent, or credential-reading code was found.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install wangjipeng-screenshot-to-prototype
  3. After installation, invoke the skill by name or use /wangjipeng-screenshot-to-prototype
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: Converts UI screenshots into editable frontend code prototypes. - Detects layout and UI elements from user-provided screenshots (PNG/JPG/WebP). - Generates structured HTML/CSS (or React/Vue) code reflecting the screenshot design. - Includes clear validation and guidance for ambiguous or invalid inputs. - Provides brief explanations, notes assumptions, and offers refinements on request. - Strictly activates only when a clear UI screenshot is provided.
Metadata
Slug wangjipeng-screenshot-to-prototype
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is screenshot-to-prototype?

Use when (1) user pastes a UI screenshot and needs editable frontend code prototype. It is an AI Agent Skill for Claude Code / OpenClaw, with 33 downloads so far.

How do I install screenshot-to-prototype?

Run "/install wangjipeng-screenshot-to-prototype" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is screenshot-to-prototype free?

Yes, screenshot-to-prototype is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does screenshot-to-prototype support?

screenshot-to-prototype is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created screenshot-to-prototype?

It is built and maintained by 王继鹏 (@wangjipeng977); the current version is v1.0.0.

💬 Comments