← Back to Skills Marketplace
cx6226301

Auto Captcha Solver

by bigchen · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
537
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install auto-captcha-solver
Description
Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted alphanumeric, numeric, rotated, or a...
README (SKILL.md)

Auto Captcha Solver

Use this skill to solve simple captcha images in browser automation.

Supported Captcha Types

  • 4 to 6 character text captchas
  • Distorted alphanumeric captchas
  • Numeric captchas
  • Simple rotated characters
  • Arithmetic captchas (example: 3+8)

Do not use this skill for reCAPTCHA, hCaptcha, sliders, or click-object challenges.

Workflow

  1. Detect a captcha image element from the page.
  2. Capture a screenshot buffer of the captcha.
  3. Run preprocessing (grayscale, contrast normalization, resize, noise reduction).
  4. Run OCR and clean output.
  5. Detect arithmetic patterns and evaluate if needed.
  6. Fill the captcha input and optionally submit.

Capture Guidance

  • Prefer screenshotting only the captcha element, not the full page.
  • Accept only trusted http or https image URLs when reading captcha image source.
  • Reject suspicious schemes like javascript: or file:.
  • Enforce image size and pixel limits before OCR.

Return Format

Return a result object with:

  • solved: boolean
  • value: solved captcha text
  • type: alphanumeric, numeric, arithmetic, or unknown
  • confidence: OCR confidence score
  • hash: SHA1 image hash (cache key)
  • fromCache: optional boolean when a cached answer is used

Module Map

  • solve.js: main entry for solving an image buffer
  • preprocess.js: image normalization pipeline
  • ocr.js: OCR and text cleanup with multiple passes
  • cache.js: SHA1 captcha cache
  • browser.js: automation helpers for Playwright, Puppeteer, and Selenium
Usage Guidance
This package appears to implement what it says (OCR-based solving for simple image captchas), but before installing or running it you should: 1) Be aware the registry metadata omits an install step while package.json lists native dependencies (sharp, tesseract.js) — run npm install in a controlled environment and expect native build artifact downloads. 2) Inspect or avoid executing developer/debug scripts (brute-*.js, debug-run.js) — they contain hard-coded absolute paths (D:/...) that access local files and are likely leftover dev/testing code. 3) Note the skill writes .captcha-verified.json to the current working directory (stores calibrated answers); if that directory is shared/backed-up, solved captcha values will be persisted. 4) If you plan to use the fallbackVision option, treat it cautiously: it can send image buffers to an external vision provider (possible data exfiltration) — only provide a trusted implementation. 5) Run the code in a sandboxed environment first (or review/strip debug scripts) and confirm the dependency installation behavior (native binaries) meets your operational/security policies.
Capability Analysis
Type: OpenClaw Skill Name: auto-captcha-solver Version: 1.0.0 The skill bundle is a legitimate implementation of a captcha solver using Tesseract OCR and the Sharp image processing library. It includes robust security controls such as image buffer validation, size limits, and URL scheme filtering (restricting to http/https/data) in `browser.js` and `preprocess.js`. While the bundle includes several development scripts (`brute.js`, `brute-color.js`, `debug-run.js`) with hardcoded local file paths, these are clearly for offline tuning and are not part of the main execution logic. No evidence of data exfiltration, malicious persistence, or prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description align with the code: modules implement preprocessing, OCR (tesseract.js), caching, and browser helpers for Playwright/Puppeteer/Selenium. However the registry metadata claimed 'instruction-only / no install' while the package includes multiple Node source files and package.json with native dependencies (sharp, tesseract.js) — this mismatch (no install spec but code present) should be explained.
Instruction Scope
SKILL.md describes a scoped workflow that does not request secrets or broad system access, and the main entry points (solve.js, browser.js) stick to that. But several included helper/debug scripts (brute-*.js, debug-run.js) read a hard-coded absolute Windows path (D:/www/openclaw/captcha-solver/captcha.png). Those scripts are not used by the main API but are present in the package and could be run by someone accidentally — they access local filesystem and print OCR output. Also the code writes a local cache file (.captcha-verified.json) in process.cwd(). The skill also supports a fallbackVision hook that, if supplied by a caller, could send images to an external vision provider (this is an optional capability but can lead to external transmission of captcha images).
Install Mechanism
There is no declared install spec in the registry metadata, yet package.json/package-lock are present specifying dependencies (sharp, tesseract.js). Installing will pull native libraries (sharp/libvips) and tesseract.js runtime pieces; these are expected for this functionality but require native build artifacts and may be heavy. The lack of an explicit install step in registry metadata is an inconsistency the user should be aware of.
Credentials
The skill requests no environment variables or credentials and does not embed network endpoints for exfiltration. No suspicious credential access patterns were found. The only I/O is reading image buffers provided by the caller and writing a local verified-cache file.
Persistence & Privilege
The skill is not force-enabled (always:false) and does not request elevated privileges. It does persist solved answers in a file (.captcha-verified.json) in the working directory via the cache module; this is expected for a caching/calibration feature but is persistent on disk and could reveal solved captchas if the directory is shared.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install auto-captcha-solver
  3. After installation, invoke the skill by name or use /auto-captcha-solver
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Slug auto-captcha-solver
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Auto Captcha Solver?

Detect and solve simple image captchas during browser automation. Use when flows encounter 4-6 character text, distorted alphanumeric, numeric, rotated, or a... It is an AI Agent Skill for Claude Code / OpenClaw, with 537 downloads so far.

How do I install Auto Captcha Solver?

Run "/install auto-captcha-solver" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Auto Captcha Solver free?

Yes, Auto Captcha Solver is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Auto Captcha Solver support?

Auto Captcha Solver is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Auto Captcha Solver?

It is built and maintained by bigchen (@cx6226301); the current version is v1.0.0.

💬 Comments