← Back to Skills Marketplace
donatasdecodo

Decodo Scraper

by DonatasDecodo · GitHub ↗ · v1.1.0
cross-platform ✓ Security Clean
1467
Downloads
10
Stars
10
Active Installs
3
Versions
Install in OpenClaw
/install decodo-scraper
Description
Search Google, scrape web pages, Amazon product pages, YouTube subtitles, or Reddit (post/subreddit) using the Decodo Scraper OpenClaw Skill.
README (SKILL.md)

Decodo Scraper OpenClaw Skill

Use this skill to search Google, scrape any URL, or fetch YouTube subtitles via the Decodo Web Scraping API. Search outputs a JSON object of result sections; Scrape URL outputs plain markdown; Amazon and Amazon search output parsed product-page or search results (JSON). Amazon search uses --query. YouTube subtitles outputs transcript/subtitles. Reddit post and Reddit subreddit output post/listing content (JSON).

Authentication: Set DECODO_AUTH_TOKEN (Basic auth token from Decodo Dashboard → Scraping APIs) in your environment or in a .env file in the repo root.

Errors: On failure the script writes a JSON error to stderr and exits with code 1.


Tools

1. Search Google

Use this to find URLs, answers, or structured search results. The API returns a JSON object whose results key contains several sections (not all may be present for every query):

Section Description
organic Main search results (titles, links, snippets).
ai_overviews AI-generated overviews or summaries when Google shows them.
paid Paid/sponsored results (ads).
related_questions “People also ask”–style questions and answers.
related_searches Suggested related search queries.
discussions_and_forums Forum or discussion results (e.g. Reddit, Stack Exchange).

The script outputs only the inner results object (these sections); pagination info (page, last_visible_page, parse_status_code) is not included.

Command:

python3 tools/scrape.py --target google_search --query "your search query"

Examples:

python3 tools/scrape.py --target google_search --query "best laptops 2025"
python3 tools/scrape.py --target google_search --query "python requests tutorial"

Optional: --geo us or --locale en for location/language.


2. Scrape URL

Use this to get the content of a specific web page. By default the API returns content as Markdown (cleaner for LLMs and lower token usage).

Command:

python3 tools/scrape.py --target universal --url "https://example.com"

Examples:

python3 tools/scrape.py --target universal --url "https://example.com"
python3 tools/scrape.py --target universal --url "https://news.ycombinator.com/"

3. Amazon product page

Use this to get parsed data from an Amazon product (or other Amazon) page. Pass the product page URL as --url. The script sends parse: true and outputs the inner results object (e.g. ads, product details, etc.).

Command:

python3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/PRODUCT_ID"

Examples:

python3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/B09H74FXNW"

4. Amazon search

Use this to search Amazon and get parsed results (search results list, delivery_postcode, etc.). Pass the search query as --query.

Command:

python3 tools/scrape.py --target amazon_search --query "your search query"

Examples:

python3 tools/scrape.py --target amazon_search --query "laptop"

5. YouTube subtitles

Use this to get subtitles/transcript for a YouTube video. Pass the video ID (e.g. from youtube.com/watch?v=VIDEO_ID) as --query.

Command:

python3 tools/scrape.py --target youtube_subtitles --query "VIDEO_ID"

Examples:

python3 tools/scrape.py --target youtube_subtitles --query "dFu9aKJoqGg"

6. Reddit post

Use this to get the content of a Reddit post (thread). Pass the full post URL as --url.

Command:

python3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/SUBREDDIT/comments/ID/..."

Examples:

python3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/nba/comments/17jrqc5/serious_next_day_thread_postgame_discussion/"

7. Reddit subreddit

Use this to get the listing (posts) of a Reddit subreddit. Pass the subreddit URL as --url.

Command:

python3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/SUBREDDIT/"

Examples:

python3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/nba/"

Summary

Action Target Argument Example command
Search google_search --query python3 tools/scrape.py --target google_search --query "laptop"
Scrape page universal --url python3 tools/scrape.py --target universal --url "https://example.com"
Amazon product amazon --url python3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/B09H74FXNW"
Amazon search amazon_search --query python3 tools/scrape.py --target amazon_search --query "laptop"
YouTube subtitles youtube_subtitles --query python3 tools/scrape.py --target youtube_subtitles --query "dFu9aKJoqGg"
Reddit post reddit_post --url python3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/nba/comments/17jrqc5/..."
Reddit subreddit reddit_subreddit --url python3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/nba/"

Output: Search → JSON (sections). Scrape URL → markdown. Amazon / Amazon search → JSON (results e.g. ads, product info, delivery_postcode). YouTube → transcript. Reddit → JSON (content).

Usage Guidance
This skill implements a Decodo-hosted web-scraping client and requires a Decodo auth token (DECODO_AUTH_TOKEN). Before installing: 1) Verify you trust Decodo (https://decodo.com) because requests and scraped content are sent to scraper-api.decodo.com using your token. 2) Do not store the token in a committed .env file in a public repo — keep it in your environment or a secure secret store. 3) Reconcile the registry metadata mismatch (the registry claims no env vars while SKILL.md and the script require DECODO_AUTH_TOKEN). 4) Install dependencies locally (pip install -r requirements.txt) and inspect the code if you plan to run it in an environment with sensitive data. 5) If you need to prevent autonomous network calls, consider disabling model invocation or only invoking the skill manually. Overall the skill appears coherent for its described purpose, but only proceed if you trust the external Decodo service and follow best practices for handling the token.
Capability Analysis
Type: OpenClaw Skill Name: decodo-scraper Version: 1.1.0 The OpenClaw skill is designed for web scraping via the Decodo API. The `SKILL.md` and `README.md` provide clear, non-malicious instructions for the AI agent and user, with no evidence of prompt injection attempts to subvert agent behavior. The `tools/scrape.py` script uses `argparse` for secure command-line argument parsing and the `requests` library for API communication to `https://scraper-api.decodo.com`. It correctly handles the `DECODO_AUTH_TOKEN` environment variable for authentication. There is no evidence of data exfiltration beyond the stated purpose, persistence mechanisms, obfuscation, or suspicious dependencies in `requirements.txt`. The code's functionality is directly aligned with its described purpose.
Capability Assessment
Purpose & Capability
Name/description, README, SKILL.md, and the included Python tool all consistently implement a Decodo-backed web scraper (Google, Amazon, YouTube subtitles, Reddit, universal pages). The declared runtime requirement (DECODO_AUTH_TOKEN) is appropriate. Small inconsistency: the registry metadata at the top of the report lists no required env vars/primary credential, but SKILL.md and the script both require DECODO_AUTH_TOKEN.
Instruction Scope
SKILL.md and tools/scrape.py only instruct the agent to read a single DECODO_AUTH_TOKEN (or a .env file in the repo root), construct JSON payloads, and POST to scraper-api.decodo.com. The script does not read unrelated system files, other environment variables, or modify other skills or system settings.
Install Mechanism
There is no install spec (instruction-only), and included requirements.txt lists only requests and python-dotenv. No external arbitrary downloads, obscure URLs, or archive extraction are present. The user must install Python deps themselves (pip install -r requirements.txt).
Credentials
The only secret required is DECODO_AUTH_TOKEN (Basic token used in Authorization header) which is proportional to the task. Note the registry metadata did not list this required env var while SKILL.md and the code do — this mismatch should be reconciled before trusting the registry entry.
Persistence & Privilege
Skill is not always-enabled and is user-invocable. It does not request persistent elevated privileges, does not modify other skills or global agent configs, and does not request unrelated credentials.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install decodo-scraper
  3. After installation, invoke the skill by name or use /decodo-scraper
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
Expanded support for new content types: now includes Amazon, YouTube, and Reddit scraping. - Added commands to scrape Amazon product pages and Amazon search results (outputs structured JSON). - Added the ability to fetch YouTube video subtitles using the video ID. - Added support for scraping content and listings from Reddit posts and subreddits. - Google search output is now a structured JSON object with clearly described sections. - Documentation improved with detailed usage examples and updated summary table.
v0.1.0
- Added homepage and credentials metadata fields to SKILL.md. - Declared DECODO_AUTH_TOKEN as a required environment variable.
v1.0.0
Initial release of the Decodo Scraper OpenClaw Skill. - Search Google or scrape any web page using the Decodo Web Scraping API. - "Search" outputs a JSON array of results; "Scrape URL" outputs markdown content. - Easy command-line usage with clear examples for searching and scraping. - Requires `DECODO_AUTH_TOKEN` authentication. - Errors output as JSON to stderr with process exit code 1.
Metadata
Slug decodo-scraper
Version 1.1.0
License
All-time Installs 10
Active Installs 10
Total Versions 3
Frequently Asked Questions

What is Decodo Scraper?

Search Google, scrape web pages, Amazon product pages, YouTube subtitles, or Reddit (post/subreddit) using the Decodo Scraper OpenClaw Skill. It is an AI Agent Skill for Claude Code / OpenClaw, with 1467 downloads so far.

How do I install Decodo Scraper?

Run "/install decodo-scraper" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Decodo Scraper free?

Yes, Decodo Scraper is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Decodo Scraper support?

Decodo Scraper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Decodo Scraper?

It is built and maintained by DonatasDecodo (@donatasdecodo); the current version is v1.1.0.

💬 Comments