← Back to Skills Marketplace
ccmagia2-gif

google-image-api-skill

by Maggia · GitHub ↗ · v1.0.0
cross-platform ✓ Security Clean
360
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install google-image-api-skill
Description
This skill helps users automatically extract structured image data from Google Images via BrowserAct API. Agent should proactively apply this skill when user...
README (SKILL.md)

\r \r

Google Image API Automation Skill\r

\r

📖 Introduction\r

This skill provides users with one-click image data extraction directly from Google Images using the BrowserAct Google Image API template. It allows you to search with keywords, set country and language, control scroll depth and result limits, returning clean, structured image metadata directly via API.\r \r

✨ Features\r

  1. No hallucinations, ensuring stable and accurate data extraction: Pre-set workflows avoid generative AI hallucinations.\r
  2. No CAPTCHA issues: No need to deal with reCAPTCHA or other verification challenges.\r
  3. No IP restrictions or geo-blocking: No need to handle regional IP limitations.\r
  4. Agile execution speed: Faster task execution compared to pure AI-driven browser automation solutions.\r
  5. High cost-effectiveness: Significantly reduces data acquisition costs compared to AI solutions that consume a large number of tokens.\r \r

🔑 API Key Guide\r

Before running, you must check the BROWSERACT_API_KEY environment variable. If it is not set, do not take any further action; you should request and wait for the user to provide it collaboratively.\r The Agent must inform the user at this point:\r

"Since you haven't configured the BrowserAct API Key yet, please go to the BrowserAct Console to get your Key first."\r \r

🛠️ Input Parameters\r

The Agent should flexibly configure the following parameters according to user needs when calling the script:\r \r

  1. KeyWords (Search keywords)\r
    • Type: string\r
    • Description: Search keywords used on Google Images.\r
    • Example: flower, ai agent, tesla\r \r
  2. Country (Country or region bias)\r
    • Type: string\r
    • Description: Country or region bias for results.\r
    • Supported values: us, gb, ca, au, de, fr, es, jp, kr\r
    • Default: us\r \r
  3. Language (UI language)\r
    • Type: string\r
    • Description: UI language for the Google Images session and returned text.\r
    • Supported values: en, zh-CN, zh-TW, ja, ko, fr, de, es\r
    • Default: en\r \r
  4. Scroll_count (Number of scroll actions)\r
    • Type: number\r
    • Description: Number of scroll actions to load more image results.\r
    • Default: 5\r \r
  5. Datelimit (Maximum items)\r
    • Type: number\r
    • Description: Maximum number of items to extract from the results list.\r
    • Default: 50\r \r

🚀 Invocation (Recommended)\r

The Agent should execute the following independent script to achieve "results with one command":\r \r

# Example invocation\r
python -u ./scripts/google_image_api.py "KeyWords" "Country" "Language" Scroll_count Datelimit\r
```\r
\r
### ⏳ Execution Status Monitoring\r
Since this task involves automated browser operations, it may take a considerable amount of time (several minutes). The script will **continuously output status logs with timestamps** while running (e.g., `[14:30:05] Task Status: running`).\r
**Agent Notice**:\r
- While waiting for the script to return results, please keep an eye on the terminal output.\r
- As long as the terminal is outputting new status logs, it means the task is running normally; do not mistake it for a deadlock or unresponsiveness.\r
- If the status remains unchanged for a long time or the script stops outputting without returning a result, then consider triggering the retry mechanism.\r
\r
## 📊 Data Output\r
After successful execution, the script will parse and print the results directly from the API response. The results include:\r
- `is_product`: Whether the result is detected as a product-style listing\r
- `link`: Click-through URL associated with the result\r
- `title`: Image result title or caption text\r
- `source_logo`: Source site logo URL\r
- `source`: Source site name shown in results\r
- `related_content_id`: Google Images related content identifier\r
- `thumbnail`: Thumbnail image URL\r
- `index`: Result index in the list\r
\r
## ⚠️ Error Handling & Retry\r
During the execution of the script, if an error occurs (such as network fluctuation or task failure), the Agent should follow this logic:\r
\r
1. **Check the output**:\r
   - If the output **contains** `"Invalid authorization"`, it means the API Key is invalid or expired. In this case, **do not retry**; guide the user to check and provide the correct API Key.\r
   - If the output **does not contain** `"Invalid authorization"` but the task execution fails (for example, the output starts with `Error:` or the result is empty), the Agent should **automatically try executing the script one more time**.\r
\r
2. **Retry limit**:\r
   - Automatic retry is limited to **once**. If the second attempt still fails, stop retrying and report the specific error message to the user.\r
\r
## 🌟 Typical Use Cases\r
1. **Visual Content Sourcing**: Finding specific imagery for creative research and design content.\r
2. **Competitor Asset Monitoring**: Scanning Google Images for competitor product styles and logos.\r
3. **Market Visual Research**: Building datasets of product listings across various countries.\r
4. **Localized Image Trends**: Tracking what images appear for specific terms in Japan (`jp`) or France (`fr`).\r
5. **E-commerce Discovery**: Extracting click-through links to track down where products are sold.\r
6. **Data Enrichment**: Fetching thumbnails and high-level titles associated with keywords.\r
7. **Brand Tracking**: Finding instances of specific brands appearing as image results.\r
8. **SEO Keyword Visualization**: Checking the visual results that rank for chosen SEO keywords.\r
9. **Automated Content Aggregation**: Delivering daily list-level visual metadata for specific topics.\r
10. **Global Image Search**: Finding images related to global events or personalities in their native languages.\r
Usage Guidance
This skill appears to do what it says: it calls BrowserAct workflows to extract image metadata and returns the results. Before installing or providing an API key: 1) Fix or verify the required binary: the manifest lists 'pyhon' — make sure your agent environment has Python available and the agent knows to run 'python'. 2) Verify you trust BrowserAct (https://www.browseract.com) — the script will send your search parameters and receive scraped results via their API, so the key grants that service access to run tasks on your behalf. 3) Limit exposure: use a scoped or dedicated API key, monitor usage, and rotate the key if possible. 4) Confirm legality and terms: scraping search results or storing images may have legal/ToS implications in your use case. 5) Be skeptical of marketing claims in SKILL.md (e.g., 'No CAPTCHA' or 'No IP restrictions') — they may not hold in all cases; test with non-sensitive queries first. If you need higher assurance, inspect network logs while running and review your BrowserAct account activity after first use.
Capability Analysis
Type: OpenClaw Skill Name: google-image-api-skill Version: 1.0.0 The skill is designed to interact with the BrowserAct Google Image API to extract structured image data. The Python script correctly retrieves the API key from environment variables and sends user-provided parameters as JSON data to the `api.browseract.com` endpoint. There is no evidence of data exfiltration beyond the stated purpose, unauthorized execution, persistence mechanisms, or obfuscation. The SKILL.md instructions are clear, align with the stated purpose, and do not contain any prompt injection attempts. A minor typo ('pyhon' instead of 'python') exists in the `SKILL.md` metadata, but this is a benign bug, not a security vulnerability or malicious indicator. The script itself is not vulnerable to shell injection from its command-line arguments, as they are used as data in an API request.
Capability Assessment
Purpose & Capability
The skill's name/description match what the included script does: it calls BrowserAct workflow endpoints to run a Google Images scraping workflow. One incoherence: the declared required binary is spelled 'pyhon' (likely intended to be 'python'), which is a minor but actionable mismatch that could prevent environment checks or automated installs from detecting the real requirement.
Instruction Scope
SKILL.md instructs the agent to check BROWSERACT_API_KEY, run the included python script with search parameters, monitor stdout logs, and retry once on non-auth errors. It does not ask the agent to read unrelated files, other environment variables, or transmit data to unexpected endpoints.
Install Mechanism
No install spec is provided (instruction-only with an included script). The bundled script is small and straightforward; it uses Python and the requests library and makes HTTPS calls to api.browseract.com. There are no downloads from untrusted URLs or archive extraction steps.
Credentials
Only BROWSERACT_API_KEY is required, which is appropriate for calling the BrowserAct API. The skill does not request unrelated secrets or config paths. The README correctly instructs to halt if the key is not present.
Persistence & Privilege
The skill does not request permanent 'always' inclusion, does not modify other skills or system configs, and does not request elevated privileges beyond running a script and making outbound HTTPS calls.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install google-image-api-skill
  3. After installation, invoke the skill by name or use /google-image-api-skill
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the google-image-api-skill, enabling one-click extraction of structured image data from Google Images using the BrowserAct API. - Supports keyword search, country and language selection, scroll control, and result limit customization. - Outputs clean metadata (e.g., product detection, titles, links, thumbnail, source info). - Handles errors and retries automatically; guides user if API Key is missing or invalid. - Designed for tasks like visual content sourcing, competitor monitoring, market research, and trend tracking.
Metadata
Slug google-image-api-skill
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is google-image-api-skill?

This skill helps users automatically extract structured image data from Google Images via BrowserAct API. Agent should proactively apply this skill when user... It is an AI Agent Skill for Claude Code / OpenClaw, with 360 downloads so far.

How do I install google-image-api-skill?

Run "/install google-image-api-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is google-image-api-skill free?

Yes, google-image-api-skill is completely free (open-source). You can download, install and use it at no cost.

Which platforms does google-image-api-skill support?

google-image-api-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created google-image-api-skill?

It is built and maintained by Maggia (@ccmagia2-gif); the current version is v1.0.0.

💬 Comments